Background
The identification of the power utilization behaviors of the users is a necessary measure for realizing intelligent energy comprehensive service under the background of the power Internet of things. When the big data are used for analyzing the electricity consumption behaviors of the users in an artificial intelligence mode, the data set samples are often unbalanced, the classification of the electricity consumption data is inaccurate, and the subsequent data analysis and processing are problematic, so that the expansion of the small sample data through the small sample user electricity consumption data generation method has important significance in balancing the data sets. The generation of the countermeasure network is a deep learning model, is good at generating the expansion data which is similar to the input data distribution, and is a new method for solving the expansion problem of the small sample data.
On one hand, the existing method for expanding the power consumption data of the small sample user is a simple copy of a sample in terms of a generation algorithm and has no diversity, so that the generalization capability of subsequent analysis is not strong; on the other hand, the classification is performed by considering the time domain, and the characteristics of the user electricity data in the frequency domain are not sufficiently considered. Aiming at the limitation of the existing small sample user electricity consumption data generation method, the invention acquires daily electricity consumption load data of the small sample user by collecting on the basis of fully considering the frequency domain characteristics of the user electricity consumption data, calculates the frequency domain mapping vector of the daily electricity consumption load data of the small sample user, adopts a probabilistic distribution variance minimum generation confrontation network method to expand and generate frequency domain user candidate data, screens the frequency domain user electricity consumption data according to the minimum sample cumulative cosine divergence criterion, and calculates the expanded time domain small sample user electricity consumption data based on frequency domain inverse transformation; the consistency of the power consumption expansion data of the small sample user and the original data on the frequency domain distribution is effectively improved.
Disclosure of Invention
The invention aims to provide a small sample user electricity consumption data expansion method with frequency domain distribution consistency, which comprises the following steps:
s1, acquiring daily electricity load data of a small sample electricity user;
s2, calculating a frequency domain mapping vector of daily electric load data of the small sample user;
s3, expanding and generating frequency domain user data to be selected by adopting a method of generating a confrontation network by using a probability distribution variance minimum;
s4, screening frequency domain user power utilization data according to a sample accumulated cosine divergence minimum criterion;
and S5, calculating the expanded time domain small sample user power utilization data based on the frequency domain inverse transformation.
Compared with the general technology, the method has the advantages that on the basis of fully considering the frequency domain characteristics of the user electricity utilization data, the daily electricity utilization load data of the small sample electricity utilization user are acquired through collection, the frequency domain mapping vector of the daily electricity utilization load data of the small sample electricity utilization user is calculated, the confrontation network method is expanded by adopting the minimum probability distribution variance to generate the frequency domain user data to be selected, the frequency domain user electricity utilization data are screened according to the minimum sample cumulative cosine divergence criterion, and the expanded time domain small sample user electricity utilization data are calculated on the basis of frequency domain inverse transformation; the consistency of the power consumption expansion data of the small sample user and the original data on the frequency domain distribution is effectively improved.
Detailed Description
The preferred embodiments will be described in detail below with reference to the accompanying drawings. It should be emphasized that the following description is merely exemplary in nature and is not intended to limit the scope of the invention or its application. Referring to FIG. 1, a flow chart of the method of the present invention is shown, and the method of the present invention is further described with reference to specific examples.
The specific implementation application scenario is shown in fig. 2, the total number of small sample power users is 2, daily power load data of the small sample users after classification is obtained through an intelligent electric meter, the sampling interval is 0.5 hour, frequency domain mapping quantity is calculated through Fourier transform, frequency domain expansion data are generated through a generation antagonistic network, high-quality frequency domain data are screened based on the minimum rule of cosine distance between distributions, and time domain small sample expansion data are obtained through frequency domain inverse transformation. The obtained expansion data can be used as the predicted load data of the power consumption of the original small sample user, and can also be combined for data analysis, so that various problems caused by the similar unbalance phenomenon are solved.
The invention provides a small sample user electricity consumption data expansion method with consistent frequency domain distribution, which comprises the following steps:
s1, acquiring daily electricity load data of small sample electricity users
The total number of the power users of the small sample is set to be K (the value range of K is 1-100), and the multiple of the expanded large sample is set to be Z (the value range is5-10), and the expansion factor is lambda (the value range is 2-5); daily electricity load data of each small sample electricity user K (the value range of K is 1-K) is acquired by using a household intelligent electricity meter, and electricity data D in the ith (the value range of i is 1-50) statistical time of the small sample electricity user are periodically acquired according to a sampling interval T (unit hour)
k(i) Acquiring a daily electrical load data set { D
k(i) (} (i ═ 1.., N), where
floor (×) denotes the round-down operation.
In the example, the total number of the small sample power users is K ═ 2, the expansion large sample multiple is Z ═ 2, and the expansion factor is λ ═ 2; acquiring daily electricity load data of each small sample electricity user K (the value range of K is 1-K ()) by using a household intelligent electricity meter, and periodically acquiring electricity data D of the small sample electricity user in the ith (the value range of i is 1-50) statistical time according to the sampling interval T of 0.5 (unit hour)
k(i) Acquiring a daily electrical load data set { D
k(i) (} (i ═ 1.., N), where
floor (×) denotes the round-down operation.
S2, calculating a frequency domain mapping vector of daily electric load data of the small sample user
For each small sample electricity user k, according to the formula
Calculating daily electric load data frequency domain mapping quantity X of small sample electric user k
k(s) (s ═ 1.. said., N), obtaining daily electricity data frequency domain mapping quantity vector of all users
In the example, for each small sample, the electricity user k is used according to the formula
Calculating daily electric load data frequency domain mapping quantity X of small sample electric user k
k(s) (s ═ 1.. said., N), obtaining daily electricity data frequency domain mapping quantity vector of all users
S3, adopting a method of generating a countermeasure network by minimum probability distribution variance to expand and generate frequency domain user candidate data
The daily electricity load data frequency domain mapping vector of K small sample electricity users is used as data input, a gradient descent method is adopted, the countermeasure network is subjected to data expansion training according to the minimum probability distribution variance criterion, P lambda Z frequency domain user data to be selected are generated through expansion, wherein the P (P1
In the example, daily electricity load data frequency domain mapping vectors of K small sample electricity users are used as data input, a gradient descent method is adopted, a countermeasure network is generated according to a probability distribution variance minimum criterion for data expansion training, P lambda Z frequency domain user data to be selected are generated through expansion, wherein the P (P1, the
S4, screening frequency domain user electricity utilization data according to the minimum criterion of sample accumulated cosine divergence
For the p-th user day frequency domain data to be selected generated by each expansion, according to a formula
Calculating the sample accumulated cosine divergence Q (p) of the data to be selected; according to the value of Q (p), the daily frequency of the user to be selected is increased from small to largeCarrying out sequencing operation on the domain data vectors; selecting the front Z user data to be selected as the power consumption daily frequency domain data of the extended user
In the example, the data of the p-th frequency domain of the user to be selected generated by each expansion is processed according to a formula
Calculating the sample accumulated cosine divergence Q (p) of the data to be selected; sequencing the day frequency domain data vectors of the users to be selected according to the value Q (p) from small to large; selecting the front Z user data to be selected as the power consumption daily frequency domain data of the extended user
S5, time domain small sample user electricity consumption data based on frequency domain inverse transformation calculation expansion
For each z th extended user, the power consumption daily frequency domain data
According to the formula
Sequentially calculating q ═ 1.. and N time domain components H of the expansion data
z(q) (q ═ 1.., N), constituting extended daily electricity load data { H @
z(q) } (q 1.., N), completing the operation of expanding the electricity data by the small sample user.
In the example, the electricity utilization daily frequency domain data of every z th expansion user
According to the formula
Sequentially calculating q ═ 1.. and N time domain components H of the expansion data
z(q) (q ═ 1.., N), constituting extended daily electricity load data { H @
z(q)}(q=1,...,N),And finishing the electricity consumption data expansion operation of the small sample user.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.