CN113496255A

CN113496255A - Power distribution network hybrid observation point distribution method based on deep learning and decision tree driving

Info

Publication number: CN113496255A
Application number: CN202110617672.7A
Authority: CN
Inventors: 刘友波; 赵亮; 高红均; 向月; 刘俊勇
Original assignee: Sichuan University
Current assignee: Sichuan University
Priority date: 2021-05-31
Filing date: 2021-05-31
Publication date: 2021-10-12
Anticipated expiration: 2041-05-31
Also published as: CN113496255B

Abstract

The invention discloses a power distribution network hybrid observation point distribution method based on deep learning and decision tree driving, which relates to the technical field of power system optimization, and is characterized by firstly constructing a decision tree model to quantitatively analyze the influence importance of each node characteristic of a power distribution network on the topology observability of the power distribution network, and selecting part of node measurement characteristics with high importance and measurement data to form a hybrid measurement sample set for deep learning model training according to the sequence of the node characteristic importance from large to small; and then analyzing the topology observability change performance index of the power distribution network under different mixed measurement schemes by adopting a PCA-DBN coupling deep learning model, and finally determining the optimal point distribution scheme of the mixed measurement device according to the selected nodes when the power distribution network is completely observable and the mixed measurement scheme. The method breaks through the planning idea of the traditional measuring device optimization point distribution method, and adopts data driving methods such as deep learning and decision trees to realize an economical and efficient hybrid observation point distribution planning scheme for active power distribution network topology identification.

Description

Power distribution network hybrid observation point distribution method based on deep learning and decision tree driving

Technical Field

The invention relates to the technical field of power system optimization, in particular to a power distribution network hybrid observation and point distribution method based on deep learning and decision tree driving.

Background

With the development of distribution network automation, more and more AMI and PMU equivalent measurement devices are put into a distribution network, and data support is provided for solving complex nonlinear problems such as active distribution network state estimation and topology identification by adopting a data science method represented by deep learning. However, the data-based data science method has higher requirements on the quality and quantity of sample data, and particularly in a large-scale power distribution network, because high-precision automatic power distribution measurement devices such as AMI (advanced metering infrastructure) and PMU (power management unit) are expensive, the measurement devices are very limited in configuration quantity, the quantity and precision of real-time measurement data still cannot meet the requirements of data required by the current algorithm, and pseudo measurement data based on prediction has to be used in most scenes; although more advanced measurement devices are already put into part of the power grid, the distribution positions of the power grid are not reasonable enough, the collected measurement data contain more redundant information, the cost of each measurement device is different, and data incompatibility caused by data delay, different data periods, large data precision difference and the like among the measurement data is a completely considerable important factor influencing the power distribution network. In order to consider the economy and reliability of power data sources and improve the observability of a power distribution network under the power distribution network state estimation and topology identification scenes, a new measurement system needs to be planned and configured in a power distribution network system or an existing measurement system needs to be improved, and the observability of the power distribution network topology is realized by fully utilizing mixed measurement data. Under the current environment of rapid development of the power distribution network, powerful support of related technologies is urgently needed to guide topology identification hybrid observation point distribution and planning of the active power distribution network.

Disclosure of Invention

In view of the technical defects, the invention provides a power distribution network hybrid observation point distribution method based on deep learning and decision tree driving.

In order to achieve the purpose, the technical scheme of the invention is as follows:

a power distribution network hybrid observation point distribution method based on deep learning and decision tree driving comprises the following steps:

s1, obtaining an active power distribution network running state offline sample T;

s2, constructing a decision tree model based on the off-line samples of the running state of the active power distribution network, inputting the off-line samples obtained in S1 into the decision tree model for analysis and calculation to obtain the feature importance of the nodes, arranging the feature importance according to the descending order, selecting the measured data of the nodes corresponding to the first n feature importance of the sequence, and selecting a candidate mixed observation stationing sample set;

s3, inputting the alternative mixed observation stationing sample set obtained in the step S2 into a PCA-DBN coupling topology recognition model, analyzing the observability of the topology of the active power distribution network, judging whether the topology of the active power distribution network is completely observable, if the topology of the active power distribution network is completely observable, entering the step S4, if the topology of the active power distribution network is not completely observable, selecting the measurement data of the nodes corresponding to the first n +1 feature importance degrees of the sequence to form an alternative mixed observation stationing scheme, and inputting the alternative mixed observation stationing scheme into the PCA-DBN coupling topology recognition model again;

and S4, obtaining a mixed observation point distribution scheme.

Preferably, the process of obtaining the offline sample of the operation state of the active power distribution network in S1 is as follows:

s10, selecting a power distribution network operation topological structure set G meeting three operation characteristics of radial, loop-free and island-free of an active power distribution network_T＝(G_T1,G_T2,…,G_Tm)；

S11, from the feasible topology set G_TIn select topology G_Ta；

S12, based on feasible topology G_TaGenerating different active power distribution network operation scenes according to a typical daily load fluctuation curve and a photovoltaic output curve and considering photovoltaic and ZIP loads;

and S13, calculating the load flow distribution under different active traveling scenes, and recording the active distribution network operation state data to form an active distribution network operation state offline sample, wherein the offline sample comprises the active power P of each node of the active distribution network, the node voltage V and the voltage phase angle difference theta as the input of the sample, and the correspondingly input distribution network tie switch and section switch states as the sample output.

Preferably, the step of constructing the decision tree model in S2 includes the following steps:

carrying out supervised learning on an active power distribution network running state offline sample T which takes X as an input variable and Y as an output variable, wherein the active power distribution network running state offline sample is as follows: t { (x)₁,y₁),(x₂,y₂),…,(x_n,y_n)}；

The mathematical expression of the kini coefficient of the active power distribution network running state off-line sample T is as follows:

the active power distribution network running state offline sample is provided with K categories C_kThe number of samples of the Kth category;

the decision tree model is a binary decision tree recursively constructed from a root node through an active power distribution network running state offline sample based on a minimum criterion of a kini coefficient, and the decision tree model is established in the following process:

s20, inputting an active power distribution network running state off-line sample T and a Gini coefficient threshold value g_ini；

S21, calculating each characteristic of the active power distribution network operation state offline sample and the loss function of the division node, and randomly selecting the jth characteristic x of the active power distribution network operation state offline sample T_jCut variable as sample and x_jThe value s is used as a dividing node of the sample, and the off-line sample T of the running state of the active power distribution network is divided into two subdata sets T_s1And T_s2The mathematical expression of the partition principle is as follows:

the loss function being after divisionTwo subdata sets T_s1And T_s2The mean square error of (a), the corresponding feature segmentation variable x when the loss function reaches the set minimum value_jAnd a dividing node s, the calculation formula of which is as follows:

in the formula c₁Representing a sub data set T_s1Average value of output, c₂Representing a sub data set T_s2Average value of the output, wherein

S22, respectively calculating the damping coefficients of the two division nodes, and judging whether the damping coefficients of the two division nodes are smaller than a damping coefficient threshold value g_iniWhen the damping coefficients of the two divided nodes are not less than the damping coefficient threshold value g_iniIf so, the two divided nodes respectively return to the step S21 to continue the node recursive division; when the Gini coefficient of one of the divided nodes is smaller than the threshold value g of the Gini coefficient_iniThen the node stops the node recursive partitioning, and the node is not smaller than the threshold value g of the Gini coefficient_iniThe other divided node returns to step S21 to continue the node recursive division until the kini coefficient recursively reaches the other divided node and is less than the threshold value g of the kini coefficient_iniObtaining K subspaces of all divided nodes;

s23, dividing the input active power distribution network operation state offline samples into K subspaces, wherein each subspace comprises partial sample data and an average value c of subspace output values_KAnd obtaining a decision tree model, wherein the mathematical expression of the decision tree model is as follows:

in the formula I (x is equal to T)_sK) An indicator function representing the model.

Preferably, the offline samples obtained in step S1 are input to the decision tree model for analysis and calculation in step S2 as follows:

in the formula, N represents the total number of off-line samples of the running state of the active power distribution network, and N_tNumber of offline samples, N, representing active distribution network operating state of current node_tRRepresenting the left sub-tree, N, of the current node_tLThe number of the active power distribution network running state offline samples of the left sub-tree of the current node is represented, H represents the purity of the current node, and H represents the purity of the current node_rightRepresenting the left sub-tree of the current node, H_leftIndicating the degree of purity of the left sub-tree of the current node.

Preferably, in step S3, the alternative mixed observation point distribution sample set obtained in step S2 is input into the PCA-DBN coupled topology recognition model, and the process of analyzing the observability of the topology of the active power distribution network is as follows:

s30, preprocessing the alternative mixed observation stationing sample set;

s31, selecting principal component variables meeting the requirement of topology identification accuracy by adopting PCA, and simultaneously reducing the noise and data dimensionality of node voltage data;

s32, constructing a DBN network, and applying the DBN to train and learn the nonlinear relation between the node voltage of the sample and the network topology;

s33, after the training is finished, storing the PCA-DBN coupling model after the training;

and S34, inputting the preprocessed alternative mixed observation stationing sample set into the trained PCA-DBN coupling model topology recognizer for power distribution network topology identification analysis, and obtaining an analysis result of the topology observability of the active power distribution network.

The invention has the beneficial effects that: analyzing the influence degree of node measurement data on the observability of the power distribution network quantitatively through a data-driven decision tree model, preliminarily screening the positions of mixed observation distribution points, and formulating different mixed observation distribution points according to the importance degree of the nodes;

then based on a mixed observation stationing scheme, a PCA-DBN deep learning method is adopted to carry out observability analysis on the topology of the active power distribution network, a principal component analysis method is used to adaptively select the principal characteristics in the measured data, the measured redundant information is removed, and the learning efficiency of the DBN can be effectively improved;

the DBN is adopted to process non-sequence sample data, and the problem of complex nonlinearity in active power distribution network topology identification is effectively solved. And finally, determining an economical and efficient mixed observation point arrangement planning scheme according to observability analysis results of different mixed observation point arrangement schemes.

Drawings

FIG. 1 is a schematic view; the invention provides; an IEEE33 node standard power distribution system;

FIG. 2 is a schematic view; the invention provides; system node feature importance percentage;

FIG. 3 is a schematic view; the invention provides; system node feature importance;

FIG. 4 is a schematic view; the invention provides; the measurement device distribution quantity influences the topology observability of the power distribution network;

FIG. 5 is a schematic view; the invention provides; a system mixed observation stationing planning scheme;

FIG. 6 is a schematic view; the invention provides; the flow chart is schematic.

Detailed Description

The technical solutions in the embodiments of the present invention are clearly and completely described below with reference to the accompanying drawings, and other advantages and effects of the present invention will be readily apparent to those skilled in the art from the disclosure of the present specification. The invention is capable of other and different embodiments and of being practiced or of being carried out in various ways, and its several details are capable of modification in various respects, all without departing from the spirit and scope of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

s2, constructing a decision tree model based on the off-line samples of the running state of the active power distribution network, inputting the off-line samples obtained in S1 into the decision tree model for analysis and calculation to obtain the feature importance of the nodes, arranging the feature importance according to the descending order, selecting the measured data of the nodes corresponding to the first n feature importance of the sequence, and forming an alternative mixed observation stationing sample set;

and S4, obtaining a mixed observation point distribution scheme.

s10, selecting a power distribution network operation topological structure set satisfying three operation characteristics of radial, loop-free and island-free of an active power distribution network as G_T＝(G_T1,G_T2,…,G_Tm)；

S11, from the feasible topology set G_TIn select topology G_Ta；

S12, based on feasible topology G_TaGenerating different power distribution network operation scenes according to the typical daily load fluctuation curve and the photovoltaic output curve; meanwhile, in order to simulate more real operation conditions of the active power distribution network system, large-scale photovoltaic access power distribution networks are considered, and all load types are ZIP loads.

The input reasons for selecting the active power P of the node, the node voltage V and the voltage phase angle difference theta as samples are as follows:

for any distribution network determined by the network topology structure, the steady-state operation state of the power grid can also be determined by solving the power flow. The power flow equation of the power system is shown as follows:

in the formula P_iAnd Q_iRespectively representing injected active and reactive power, V, of node i_iRepresenting the sum voltage magnitude, θ, of node i_ijRepresenting the voltage angle difference between node i and node j. According to the power flow equation, the other two variables can be determined for the power distribution network determined by the network topology as long as any two of P, Q, V and theta are known.

carrying out supervised learning on an active power distribution network operation state offline sample T which takes X as an input variable and Y as an output variable, wherein the active power distribution network operation state offline sample T { (X)₁,y₁),(x₂,y₂),…,(x_n,y_n) And (4) assuming that K types of off-line samples of the running state of the active power distribution network exist, wherein the number of the Kth type samples is C_k. In the process of establishing the decision tree, the operation state offline samples of the active power distribution network need to be divided, and the jth characteristic x of the operation state offline samples T of the active power distribution network is usually selected_jAnd the value s is used as a segmentation variable and a division node of the sample, and the off-line sample of the running state of the active power distribution network is divided into twoSubdata set T_s1And T_s2The division principle is as follows:

the present invention selects features based on the criterion of minimum kini coefficients. The expression of the Keyny coefficient of the active power distribution network operation state off-line sample T is as follows:

the magnitude of the kiney coefficient represents the purity of the model, and a smaller kiney coefficient indicates a better selected feature of the model. The decision tree model is a binary decision tree recursively constructed by taking the minimum kini coefficient of off-line sample data of the active power distribution network operation state as a criterion from a root node, and the construction process of the decision tree model is as follows:

S21, calculating a loss function of each characteristic of the active power distribution network running state offline sample and the division node, wherein the loss function is divided into two sub data sets T_s1And T_s2Selecting a corresponding feature segmentation variable xj and a corresponding segmentation node s when the loss function is minimum, wherein a calculation formula is shown as the following formula:

in the formula c₁And c₂Respectively representing two sub data sets T_s1And T_s2Average value of the output, wherein

S22, dividing the input active distribution network operation state offline sample into two sub-samples through the step S21Data set T_s1And T_s2；

S23, respectively calculating the damping coefficients of the two division nodes, and judging whether the damping coefficients of the two division nodes are smaller than a damping coefficient threshold value g_iniWhen the damping coefficients of the two divided nodes are not less than the damping coefficient threshold value g_iniIf so, the two divided nodes respectively return to the step S21 to continue the node recursive division; when the Gini coefficient of one of the divided nodes is smaller than the threshold value g of the Gini coefficient_iniThen the node stops the node recursive partitioning, and the node is not smaller than the threshold value g of the Gini coefficient_iniThe other divided node returns to step S21 to continue the node recursive division until the kini coefficient recursively reaches the other divided node and is less than the threshold value g of the kini coefficient_iniObtaining K subspaces of all divided nodes;

s24, finally dividing the input active power distribution network operation state offline samples into K subspaces, wherein each subspace comprises partial sample data and an average value c of subspace output values_KThe final decision tree model can be expressed as

For a sample set containing multiple features, the decision tree can calculate a division standard value of each feature in the sample, the division standard value is used as an importance degree calculation index of each feature, and the contribution degree of each feature in the sample to a target variable can be determined through the feature importance degree.

Preferably, in step S2, the feature importance of each node of the active power distribution network is calculated by the decision tree model as follows:

in which N and N_tRespectively representing the total quantity of the off-line samples of the running state of the active power distribution network and the quantity, N, of the off-line samples of the running state of the active power distribution network of the current node_tRAnd N_tLRespectively representing the quantity of offline samples of the operating states of the active power distribution network of the current node left sub-tree and the current node left sub-tree. H represents the purity of the current node, the impurity is the sum of squares of errors of off-line samples of the running state of the active power distribution network of the node, and H_rightAnd H_leftAnd respectively representing the impurity degrees of the left sub-tree of the current node and the left sub-tree of the current node, wherein the smaller the impurity degree H of the output node is, the higher the feature importance degree of the node is.

and S30, preprocessing the sample set of the alternative mixed observation stationed points.

The preprocessing method for the alternative mixed observation stationing sample set is as follows:

data quality issues mainly include data loss, erroneous data, and high levels of measurement noise. When missing data is processed, a k neighbor algorithm is adopted to complement and modify sample data. According to the algorithm, k adjacent samples of the missing data samples are selected according to the characteristic of high similarity between adjacent data of the samples, and the missing data is corrected by using the number of bits of the k adjacent samples. The use of median values also enables us to deal with bad data. The euclidean distance is used to determine the distance between sample points as shown in the following equation:

in the formula X_i＝(x_i1,x_i2,…,x_im) Representing the first m-dimensional data of sample i, element x_irRepresenting the attribute of the r-th dimension of sample i. The anomalous data in the sample will be represented by the mean absolute deviation described belowThe index (MAD) was subjected to hypothesis testing as shown below:

wherein MAD is mean (| x)_i-x_median|). With the robust statistical test method based on MAD, we can identify bad data, and these identified data will be replaced by the values obtained by the k-nearest neighbor algorithm. Meanwhile, because the types of the operation data of the power grid are more, the dimensions of the operation data are different and distributed, and the operation data need to be standardized firstly. Suppose the operating data obtained from the grid is X ═ X (X)¹,x²,x³,…,x^m) And has been subjected to a standardization process. The normalization process is shown below:

and S31, adopting PCA to select principal component variables meeting the requirement of the topology identification accuracy, and simultaneously reducing the noise of the node voltage data and the data dimension.

The main characteristics in the measured data are selected in a self-adaptive mode by applying a principal component analysis method, redundant information of the measured data is removed, and the learning efficiency of the DBN can be effectively improved. The principle component variable mode which meets the requirement of the topological identification accuracy is selected by adopting PCA as follows:

Z＝XX^T

Z＝λω

Y＝Xω

wherein Z is a covariance matrix of the power grid operation data X, and the eigenvalue and eigenvector of the covariance matrix Z are respectively lambda ═ lambda₁,λ₂,…,λ_m) And ω ═ ω (ω)₁,ω₂,…,ω_m) And a characteristic value λ₁≥λ₂≥…≥λ_mArranged from large to small, and Y is the main component of the sample data.

Calculating the variance contribution rate delta of the kth principal component variable_kAnd cumulative varianceThe contribution φ is shown as:

selecting proper magnitude of the cumulative variance contribution rate according to actual requirements, determining the number of the principal components F according to the set cumulative variance contribution rate, and expressing the selected principal component set as F ═ { F ═ F₁,f₂,f₃,…,f_pAnd p is the number of the selected principal components.

S32, constructing a DBN network, and applying the DBN to train and learn the nonlinear relation between the node voltage of the sample and the network topology.

Constructing a DBN network, and applying the DBN to train and learn the nonlinear relation mode of the node voltage of the sample and the network topology as follows:

the DBN is a probabilistic generative model composed of multiple Restricted Boltzmann Machine (RBM) stacks. Each RBM comprises a visual layer V and an implied layer H, the neurons of the visual layer and the implied layer are connected, and the neurons in the visual layer and the neurons in the implied layer are not connected. The joint probability, joint probability density, activation probability of nerve units and model parameters of the visual layer and the hidden layer in the RBM in the DBN model are shown as follows:

wherein E (v, h | theta) represents the joint probability of the visible layer and the hidden layer in the RBM, P (v, h | theta) is the joint probability density, and P (h | theta)_j1| v) is the activation probability of the cell, L (θ) is the maximum likelihood function of the model parameter θ, v_iRepresenting the status of the nodes of the visual layer, h_jRepresenting the state of the hidden layer node, a_iAnd b_jRespectively representing the corresponding bias values, w, of the nodes of the visual layer and the hidden layer_ijAnd representing the connection weight between the visual layer and the hidden layer, wherein p is the number of the main components.

The parameter θ is updated using a Nesterov acceleration adaptive moment estimation (Nadam) optimizer. The adaptive moment estimation (Adam) algorithm is improved by a Nesterov Acceleration Gradient (NAG) algorithm. Adam's update rules are as follows:

m_t＝β₁m_t-1+(1-β₁)g_t

where eta is the learning rate, t is the number of updates, g_tIs a gradient, m_tIs an estimate of the first moment of the gradient, v_tIs an estimate of the second moment of the gradient,

and

correcting the estimate, beta, for deviations in the momentum vector₁，β₂And epsilon is a correction parameter,

the estimate is corrected for the current momentum vector bias.

The structure of the DBN is mainly determined by the number of neurons per layer and the number of hidden layers. The scale and the precision of the deep belief network are determined by the number of hidden layer neurons and the number of hidden layer neurons, and too many hidden layer neurons can cause the problems of overlarge DBN scale, large calculation amount, long model training time and the like; while too few hidden layer neurons reduce prediction accuracy. The invention determines the structure of the DBN by a grid search method.

And S33, stopping training after reaching the set standard, and storing the PCA-DBN coupling model after training.

The PCA-DBN coupling model training setting standard adopts reconstruction errors to carry out quantitative analysis, and the definition and the judgment standard of the reconstruction errors are shown as the following formula:

where n is the number of samples, pr is the calculated value of the network, d is the true value,

and calculating an average value of the network, wherein delta is a preset reconstruction error value, and L is the number of layers of the hidden layer.

And finally, carrying out observability analysis on the topology of the active power distribution network under different mixed observation stationing schemes by using a PCA-DBN deep learning method, and determining the optimal mixed observation stationing planning scheme by taking the complete observability of the active power distribution network as a target.

And S34, inputting the real-time measurement data into the trained PCA-DBN coupling model topology recognizer for power distribution network topology identification analysis.

As shown in fig. 1, an IEEE33 node standard distribution network is described as an example. The types of loads in the system are all ZIP loads, and the proportion of constant power, constant impedance and constant current loads is 50%, 30% and 20% respectively. Three photovoltaic power stations with a rated capacity of 0.3MW are placed at node 17, node 21 and node 32. 350 topological structures meeting the running characteristics of radial, ring-free and island-free operation of the power distribution network are selected in total, 35000 data samples are formed, and the sample data comprises node voltage amplitude, phase angle and injection power.

Firstly, according to the proposed decision tree optimization model, the importance of the node characteristics of the power distribution network is analyzed and calculated, and the importance percentage of the node characteristics is shown in fig. 2. According to the graph result, the amplitude and the phase angle difference of the node voltage have large influence on the observability of the power distribution network topological structure, and the injection power of the node has small influence on the observability of the power distribution network topological structure. From the analysis of the operation angle of the power system, the voltage and the reactive power of the power grid are strongly related, and the phase angle and the active power are strongly related, so that the operation topology of the power grid can be reflected by selecting two variables of the node voltage amplitude and the phase angle, and the topology observability of the power distribution network is realized.

Further analysis, the characteristic importance indexes of each node are sequentially arranged from large to small according to the importance of the node, as shown in fig. 3, it can be seen from the graph that the difference of the characteristic importance of different nodes is large, for example, the importance of the voltage amplitude and the phase angle of the 0 th node is 0, because the node 0 is a balanced node, and the voltage amplitude and the phase angle are not changed; secondly, the importance of the voltage amplitude characteristic and the voltage phase angle characteristic of each node are different, for example, the importance of the voltage amplitude characteristic of the node 7 is smaller than that of the phase angle, and the importance of the voltage amplitude characteristic of the node 21 and the node 18 is larger than that of the phase angle.

The number of nodes is reduced from all the nodes according to the descending order of the importance degree of the nodes, the measured data of the corresponding nodes are selected to form a training sample set, the training sample set is input into a PCA-DBN coupling topology recognition model and other machine learning methods, the influence of the distribution position and the distribution number of the measuring device on the topology observability of the power distribution network is analyzed, and the comparison result is shown in FIG. 4. The comparison shows that the PCA-DBN model requires the least number of the arranged measuring devices under the condition that the topology of the power distribution network is completely considerable, and when the number of the arranged measuring devices is more than 11, the identification rate of the topology of the power distribution network is stabilized to be more than 99%, and the topology of the power distribution network can be considered to be completely considerable at the moment; when the number of the installed measuring devices is less than 11, the descending range of the topology identification rate of the power distribution network is large.

The distribution quantity and the topology observability of the power distribution network measuring device are considered comprehensively, the first 11 nodes with the maximum node importance degree are selected as the optimal distribution positions of the high-precision PMU measuring device of the IEEE33 node system, and the selected nodes are combined into P_case33I.e., {7,21,18,17,24,4,3,6,9,11,32 }. Further, the voltage and phase angle data of the

nodes

7, 17, 3,6,9,11 and 32 and the voltage data of the

nodes

21,18, 24 and 4 are selected as the input of the model, the topology of the power distribution network still can reach 98% of the accuracy of topology identification, and the observability of the topology of the power distribution network is ensured. Considering both economy and topology observability, the optimal configuration scheme is to arrange high-precision PMU measuring devices capable of measuring node voltages and phase angles only at

nodes

7, 17, 3,6,9,11 and 32, and arrange voltage mutual inductance devices PT with lower price at

nodes

21,18, 24 and 4, as shown in FIG. 5.

The foregoing is illustrative of the preferred embodiments of this invention, and it is to be understood that the invention is not limited to the precise form disclosed herein and that various other combinations, modifications, and environments may be resorted to, falling within the scope of the concept as disclosed herein, either as described above or as apparent to those skilled in the relevant art. And that modifications and variations may be effected by those skilled in the art without departing from the spirit and scope of the invention as defined by the appended claims.

Claims

1. A power distribution network hybrid observation point distribution method based on deep learning and decision tree driving is characterized by comprising the following steps:

s3, inputting the alternative mixed observation stationing sample set obtained in the step S2 into a PCA-DBN coupling topology recognition model, analyzing the observability of the topology of the active power distribution network, judging whether the topology of the active power distribution network is completely observable, if the topology of the active power distribution network is completely observable, entering the step S4, if the topology of the active power distribution network is not completely observable, selecting the measurement data of the nodes corresponding to the first n +1 feature importance degrees of the sequence to form an alternative mixed observation stationing sample set, and inputting the alternative mixed observation stationing sample set into the PCA-DBN coupling topology recognition model again;

and S4, obtaining a mixed observation point distribution scheme.

2. The deep learning and decision tree driving-based hybrid observation and stationing method for the power distribution network according to claim 1, wherein the process of obtaining the offline samples of the operation status of the active power distribution network in S1 is as follows:

S11, from the feasible topology set G_TIn select topology G_Ta；

and S13, calculating load flow distribution under different active traveling scenes, and recording the active distribution network operation state data to form an active distribution network operation state offline sample, wherein the offline sample comprises active power P of each node of the active distribution network, node voltage V and voltage phase angle difference theta as sample inputs, and correspondingly input distribution network tie switch and section switch states as sample outputs.

3. The deep learning and decision tree driving based power distribution network hybrid observation stationing method according to claim 1, wherein the step of constructing the decision tree model in S2 comprises the steps of:

S21, calculating the running state of the active power distribution network to be off-lineEach characteristic of the samples and the loss function of the divided nodes are selected randomly, and the jth characteristic x of the active power distribution network operation state offline sample T is selected randomly_jCut variable as sample and x_jThe value s is used as a dividing node of the sample, and the off-line sample T of the running state of the active power distribution network is divided into two subdata sets T_s1And T_s2The mathematical expression of the partition principle is as follows:

the loss function is divided into two sub data sets T_s1And T_s2The mean square error of (a), the corresponding feature segmentation variable x when the loss function reaches the set minimum value_jAnd a dividing node s, the calculation formula of which is as follows:

S22, respectively calculating the damping coefficients of the two division nodes, and judging whether the damping coefficients of the two division nodes are smaller than a damping coefficient threshold value g_iniWhen the damping coefficients of the two divided nodes are not less than the damping coefficient threshold value g_iniIf so, the two divided nodes respectively return to the step S21 to continue the node recursive division; when the Gini coefficient of one of the divided nodes is smaller than the threshold value g of the Gini coefficient_iniThen the node stops the node recursive partitioning, and the node is not smaller than the threshold value g of the Gini coefficient_iniReturns to step S21 to continue the recursive division of the nodes until recursion to the kiney system of another divided nodeNumber less than the threshold value g of the kini coefficient_iniObtaining K subspaces of all divided nodes;

4. The deep learning and decision tree driving based hybrid observation point distribution method for the power distribution network according to claim 1, wherein the offline samples obtained in step S1 are input into the decision tree model for analysis and calculation in the following manner in step S2:

5. The deep learning and decision tree driving-based power distribution network hybrid observation stationing method according to claim 1, wherein the step S3 is implemented by inputting the candidate hybrid observation stationing sample set obtained in the step S2 into a PCA-DBN coupled topology recognition model, and the process of analyzing the observability of the topology of the active power distribution network is as follows:

s30, preprocessing the alternative mixed observation stationing sample set;