CN111753252A - Nataf transformation-based random variable sample generation method and system - Google Patents

Nataf transformation-based random variable sample generation method and system Download PDF

Info

Publication number
CN111753252A
CN111753252A CN202010384994.7A CN202010384994A CN111753252A CN 111753252 A CN111753252 A CN 111753252A CN 202010384994 A CN202010384994 A CN 202010384994A CN 111753252 A CN111753252 A CN 111753252A
Authority
CN
China
Prior art keywords
random variable
correlation coefficient
random
distribution domain
gaussian distribution
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010384994.7A
Other languages
Chinese (zh)
Inventor
唐俊杰
许丹
乐彦婷
戴赛
崔晖
丁强
黄国栋
蔡帜
燕京华
闫翠会
张传成
孙振
李伟刚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing University
State Grid Corp of China SGCC
China Electric Power Research Institute Co Ltd CEPRI
State Grid Fujian Electric Power Co Ltd
Original Assignee
Chongqing University
State Grid Corp of China SGCC
China Electric Power Research Institute Co Ltd CEPRI
State Grid Fujian Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chongqing University, State Grid Corp of China SGCC, China Electric Power Research Institute Co Ltd CEPRI, State Grid Fujian Electric Power Co Ltd filed Critical Chongqing University
Priority to CN202010384994.7A priority Critical patent/CN111753252A/en
Publication of CN111753252A publication Critical patent/CN111753252A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/06Electricity, gas or water supply

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Mathematical Optimization (AREA)
  • Mathematical Analysis (AREA)
  • Computational Mathematics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • Algebra (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • Public Health (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Water Supply & Treatment (AREA)
  • Biophysics (AREA)
  • Evolutionary Biology (AREA)
  • Biomedical Technology (AREA)
  • Artificial Intelligence (AREA)
  • Operations Research (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Complex Calculations (AREA)

Abstract

The invention relates to a random variable sample generation method and system based on Nataf transformation, wherein the method comprises the following steps: establishing a conversion relation between a correlation coefficient matrix of the random variable in the original distribution domain and a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain according to Nataf transformation; determining a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain by adopting a radial basis network based on the conversion relation between the correlation coefficient matrix of the random variable in the original distribution domain and the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain; and generating sample data of the random variable in the original distribution domain by using a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain. According to the technical scheme provided by the invention, the conversion relation of the correlation coefficient of the random variable in the original distribution domain and the standard Gaussian distribution domain in the Nataf transformation is simplified by using the radial basis function neural network, the solving speed of the correlation coefficient of the random variable in the standard Gaussian distribution domain in the Nataf transformation is accelerated, and the generation speed of the random variable sample is accelerated.

Description

Nataf transformation-based random variable sample generation method and system
Technical Field
The invention relates to the technical field of new energy grid connection technology and optimal power flow analysis, in particular to a random variable sample generation method and system based on Nataf transformation.
Background
With the development of new energy technology, more and more new energy power plants are incorporated into the power network. For different kinds of new energy power plants, most of primary energy sources of the new energy power plants have strong random uncertainty, such as wind speed of a wind power plant, solar radiation intensity of a photovoltaic power plant, tidal flow rate of a tidal power plant and the like. In the case of large-scale grid connection of new energy, great uncertainty is brought to a power system by the uncertain sources.
Probabilistic power flow and probabilistic optimal power flow have been proposed as basic analysis methods capable of effectively handling power systems with probabilistic uncertainty sources. For a general probabilistic power flow and probabilistic optimal power flow algorithm, the algorithm mainly comprises three steps of probability modeling, probabilistic (optimal) power flow calculation, analysis of probability calculation results and application. As the scale of the power grid increases, the number of uncertain sources contained in the power system increases, and the probability analysis of the whole network will be a very time-consuming process (under the condition of ensuring reasonable analysis accuracy). The time of probability analysis is mainly consumed in probability modeling and probability (optimal) power flow calculation, and the time of probability modeling is often ignored by people.
Generally, probabilistic modeling is divided into two blocks — edge distribution modeling and correlation modeling. Edge distribution modeling is generally based on actually measured data, and probability density functions obeyed by variables are estimated through a mathematical statistics method, and the process needs a larger sample length. For correlation modeling, it is common to describe the correlation between random variables using pearson correlation coefficients, i.e., linear correlation coefficients. Similarly, the linear correlation coefficient is calculated from measured data corresponding to the variable.
However, in probabilistic (optimal) power flow analysis, in order to perform a probabilistic calculation, it is first necessary to generate a sample that satisfies a probabilistic model. Practice has shown, however, that a considerable number of random variables are subject to edge distribution types that are not gaussian, whereas non-gaussian samples with correlations cannot be generated directly. Therefore, when dealing with random variables with linear correlations, other methods are needed to generate appropriate samples, such as the Nataf transform.
On the premise of knowing the cumulative distribution function of the random variable or the inverse function thereof, the Nataf transformation effectively connects the standard Gaussian distribution domain and the original distribution domain of the variable by an equal probability method. Thus, the generation of the original domain samples translates into the problem of generating standard gaussian distributed samples with a specific correlation, which can be easily implemented. However, the problem is that the Nataf transform is a monotonous nonlinear transformation process, and the linear correlation coefficient values of two variables are changed before and after the nonlinear transformation based on the properties of the linear correlation coefficient. Therefore, it is an important step to calculate the correlation coefficient of the corresponding variable in the standard gaussian domain according to the distribution information of the variable in the original distribution domain, such as the edge distribution and the correlation coefficient. From the knowledge of probability theory, it can be found that the solving process is an implicit solving problem of a nonlinear equation containing double integral.
For the problem of solving the correlation coefficient of the original domain in the Nataf transformation, the traditional Simpson method and the dichotomy method can be used for calculating more accurately, but the calculation speed is very time-consuming. Practice has shown that this method takes up to minutes for the calculation of a correlation coefficient. The correlation coefficient calculations involved may take as long as several days for a large scale system that may contain hundreds or thousands of random variables. Therefore, in a computing scenario with high real-time requirements, such a method is very undesirable. Later through algorithm development, a calculation method based on a Gauss-Hermite product method and an interpolation method is developed, the method has considerable calculation advantages in speed, however, the numerical integration and the interpolation calculation both introduce extra errors, and improper parameter selection can have great influence on calculation results. Therefore, based on such a conventional correlation coefficient calculation method, the practical applicability of the method may not be high in the application of real-time probabilistic optimal power flow calculation of a large power grid containing high-dimensional random sources.
Disclosure of Invention
Aiming at the defects of the prior art, the invention aims to provide a random variable sample generation method based on Nataf transformation, which utilizes a radial basis function neural network to simplify the conversion relation of correlation coefficients of random variables in an original distribution domain and a standard Gaussian distribution domain in the Nataf transformation, accelerates the solving speed of the correlation coefficients of the random variables in the standard Gaussian distribution domain in the Nataf transformation, and further accelerates the generation speed of the random variable sample.
The purpose of the invention is realized by adopting the following technical scheme:
the invention provides a random variable sample generation method based on Nataf transformation, and the improvement is that the method comprises the following steps:
establishing a conversion relation between a correlation coefficient matrix of the random variable in the original distribution domain and a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain according to Nataf transformation;
determining a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain by adopting a radial basis network based on the conversion relation between the correlation coefficient matrix of the random variable in the original distribution domain and the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain;
and generating sample data of the random variable in the original distribution domain by using a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain.
The invention provides a random variable sample generation system based on Nataf transformation, and the improvement is that the system comprises:
the establishing module is used for establishing a conversion relation between a correlation coefficient matrix of the random variable in the original distribution domain and a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain according to Nataf transformation;
the determining module is used for determining the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain by adopting a radial basis network based on the conversion relation between the correlation coefficient matrix of the random variable in the original distribution domain and the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain;
and the generating module is used for generating sample data of the random variable in the original distribution domain by using the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain.
Compared with the closest prior art, the invention has the following beneficial effects:
according to the technical scheme provided by the invention, the conversion relation between the correlation coefficient matrix of the random variable in the original distribution domain and the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain is established according to Nataf transformation; determining a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain by adopting a radial basis network based on the conversion relation between the correlation coefficient matrix of the random variable in the original distribution domain and the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain; and generating sample data of the random variable in the original distribution domain by using a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain. The method simplifies the conversion relation of the correlation coefficients of the random variables in the original distribution domain and the standard Gaussian distribution domain in the Nataf transformation by using the radial basis function neural network, accelerates the solving speed of the correlation coefficients of the random variables in the standard Gaussian distribution domain in the Nataf transformation, and further accelerates the generation speed of random variable samples.
Drawings
FIG. 1 is a flow chart of a method for generating random variable samples based on Nataf transformation;
FIG. 2 is a probability density diagram of the cost of power generation in an embodiment of the invention;
fig. 3 is a block diagram of a random variable sample generation system based on a Nataf transform.
Detailed Description
The following describes embodiments of the present invention in further detail with reference to the accompanying drawings.
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The invention provides a random variable sample generation method based on Nataf transformation, which comprises the following steps of:
step 101, establishing a conversion relation between a correlation coefficient matrix of a random variable in an original distribution domain and a correlation coefficient matrix of a random variable in a standard Gaussian distribution domain according to Nataf transformation;
102, determining a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain by adopting a radial basis network based on the conversion relation between the correlation coefficient matrix of the random variable in the original distribution domain and the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain;
and 103, generating sample data of the random variable in the original distribution domain by using the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain.
Specifically, the step 101 includes:
step 101-1, forming an N-dimensional vector Z by using N random variables which obey standard Gaussian distribution, and determining a joint density function between a random variable i and a random variable j in the vector Z;
step 101-2, determining correlation coefficients ρ of the random variable i and the random variable j in the correlation coefficient matrix of the random variables in the original distribution domain based on the principle that the cumulative distribution function value of the random variables in the original distribution domain is equal to the cumulative distribution function value of the random variables in the gaussian distribution domain (Nataf transform) and the joint density function between the random variable i and the random variable j in the vector ZXijAnd correlation coefficient rho of random variable i and random variable j in correlation coefficient matrix of random variable in standard Gaussian distribution domainZijThe conversion relationship between them;
in the preferred embodiment of the present invention, the matrix C of correlation coefficients of random variables in the original distribution domainXMatrix C of correlation coefficients with random variables in the standard Gaussian distribution domainZAre all N × order N matrices, and are all strictly symmetric real number square matrices (rho)Xij=ρXji、ρZij=ρZji),CXAnd CZAll main diagonals ofThe elements are all 1, and correlation coefficient calculation formulas of the random variable i and the random variable j in the original distribution domain are determined by using a Pearson correlation coefficient method.
Wherein i, j ∈ (1-N), N is the number of random variables in the power system, and the joint density function between the random variable i and the random variable j in the vector Z
Figure BDA0002483466080000041
ziIs an independent variable, z, of a random variable i in a standard Gaussian distribution domainjIs an argument of a random variable j in the domain of a standard gaussian distribution.
In a preferred embodiment of the invention, the random variables of the power system may include: wind speed of a wind power plant, illumination intensity of a photovoltaic power plant, load, tidal energy speed of a tidal power plant, branch parameters of a power system and the like.
Further, the step 101-2 specifically includes:
determining the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the original distribution domain according to the following formulaXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between:
Figure BDA0002483466080000042
in the formula (I), the compound is shown in the specification,
Figure BDA0002483466080000051
to be phi (z)i) Substituting into the value obtained in the inverse function of the cumulative distribution function of the random variable i in the original distribution domain,
Figure BDA0002483466080000052
to be phi (z)j) Substituting into the inverse function of the cumulative distribution function of the random variable j in the original distribution domain to obtain a value, Φ (z)i) The argument for the random variable i in the normalized Gaussian distribution domain is ziCumulative score of random variable i in time-corresponding standard Gaussian distribution domainValue of the distribution function, Φ (z)j) The argument for the random variable j in the normalized Gaussian distribution domain is zjThe value of the cumulative distribution function of the random variable j in the standard Gaussian distribution domain, muIs the mean value of the random variable i after the synchronous sampling of N random variables of the power system in the original distribution domainjIs the mean value, sigma, of a random variable j after synchronously sampling N random variables of the power system in an original distribution domainiFor the variance, σ, of the random variable i after the N random variables of the power system are synchronously sampled in the original distribution domainjIs the variance of a random variable j after synchronously sampling N random variables of the power system in the original distribution domain.
In a specific embodiment of the present invention, the correlation coefficients ρ of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the original distribution domainXijAnd correlation coefficient rho of random variable i and random variable j in correlation coefficient matrix of random variable in standard Gaussian distribution domainZijThe conversion relation between the random variables is based on the joint density function between the random variable i and the random variable j in the vector Z
Figure BDA0002483466080000053
Conversion relation of random variable i in original distribution domain and random variable i in standard Gaussian distribution domain
Figure BDA0002483466080000054
Conversion relation between random variable j in original distribution domain and random variable j in standard Gaussian distribution domain
Figure BDA0002483466080000055
And the correlation coefficients of the random variable i and the random variable j in the original distribution domain
Figure BDA0002483466080000056
μijTo be the mean value of the product of the random variable i and the random variable j after synchronously sampling the N random variables of the power system in the original distribution domain,
Figure BDA0002483466080000057
as a function of the cumulative distribution of the random variable i in the standard gaussian distribution domain,
Figure BDA0002483466080000058
as a function of the cumulative distribution of the random variable i in the original distribution domain,
Figure BDA0002483466080000059
as a function of the cumulative distribution of the random variable j in the standard gaussian distribution domain,
Figure BDA00024834660800000510
is a cumulative distribution function of the random variable j in the original distribution domain.
In the preferred embodiment of the present invention, the process of obtaining the cumulative distribution function of the random variable i to the power system in the original distribution domain may be:
analyzing sampling data of a random variable i of the power system in the original distribution domain by adopting a mathematical statistics method, and acquiring a probability distribution function of the random variable i of the power system in the original distribution domain; and determining the cumulative distribution function of the random variable i of the power system in the original distribution domain according to the probability distribution function of the random variable i of the power system in the original distribution domain.
Solving the correlation coefficient matrix C of random variables in the standard Gaussian distribution domain in the inventionZI.e. solving for all off-diagonal element values therein. According to the symmetry of the correlation coefficient matrix, the elements that need to be calculated are only the upper triangular elements (or the lower triangular elements) and do not include the main diagonal elements. On the premise of N random variables, the number of elements to be calculated is 0.5N (N-1), and correlation coefficients rho of the random variables i and j in a correlation coefficient matrix of the power system in a standard Gaussian distribution domain are calculatedZijMay be illustrated by step 102, wherein step 102 comprises:
step 102-1, synchronously sampling N for random variables i and j of the power system in a standard Gaussian distribution domainsThen, obtain NsVector Z formed by sampling data of random variable i and random variable j in sub-samplingij,SAnd will move towardsQuantity Zij,SIn the formula
Figure BDA0002483466080000061
Obtaining a vector Zij,SCorresponding vector RBFij,s
In the preferred embodiment of the invention, a radial basis function neural network is used to approximate ZiAnd ZjFor input, take XiAnd XjThe product of (a) is the transfer relationship of the output, formulated as
Figure BDA0002483466080000062
Sampling random variables i and j in a standard Gaussian distribution domain in a two-dimensional space formed by the random variables i and j to obtain NsSamples by a vector Zij,SIs represented by the formulaij,SRespectively substituting the medium elements into the above formula to obtain a vector RBFij,s
Step 102-2, vector Zij,SSum vector RBFij,sRespectively as radial basis function neural networks WijInput data and output data of
Figure BDA0002483466080000063
As a radial basis neural network WijSetting function of the kth neuron of the hidden layer
Figure BDA0002483466080000064
As a radial basis neural network WijObtaining a setting function of an output layer to obtain a radial basis function (Wn)ijN of the hidden layersAn output weight matrix ω of each neuron;
in the preferred embodiment of the present invention, RBF (Z)ij) Is the transfer function of the radial basis function neural network, which represents when a given input quantity is (Z)i,Zj) The approximate output value of the radial basis function neural network after the specific sample value is RBF (Z)ij) The radial basis function neural network structure is a three-layer network structure including an input layer, a hidden layer (radial basis layer), and an output layer, the input layer including two neurons respectively corresponding to an input quantity ZiAnd Zj(ii) a The hidden layer (radial base layer) comprises NsEach neuron is a radial basis function, and the radial basis function of the kth neuron is phiRBF,k(Zij) With a function width parameter of 1, the radial basis function of the kth neuron is expanded to
Figure BDA0002483466080000065
(Zi,k,Zj,k) The number of neurons in the output layer is 1, which is the center of the kth neuron. The whole radial basis function neural network is fully connected between direct layers, and the weight omega is set in the process of transmitting the kth neuron of the hidden layer to the output layerkThus, the transmission expression of the whole radial basis function neural network is
Figure BDA0002483466080000071
Or RBF (Z)ij)=ΦRBF(Zij) ω; with Zij,SInputting data for radial basis function neural network, using RBFij,sAnd as the output data of the radial basis function neural network, the weight between each neuron in the hidden layer of the radial basis function neural network and the output layer of the radial basis function neural network can be obtained by utilizing the transfer expression of the radial basis function neural network.
102-3, based on the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the original distribution domainXijAnd correlation coefficient rho of random variable i and random variable j in correlation coefficient matrix of random variable in standard Gaussian distribution domainZijThe conversion relation between them and the radial basis function neural network WijN of the hidden layersDetermining correlation coefficient rho of random variable i and random variable j in correlation coefficient matrix of random variable in standard Gaussian distribution domain by output weight matrix omega of each neuronZij
Wherein the content of the first and second substances,
Figure BDA0002483466080000072
Zi,hthe value Z of the random variable i is the value of the random variable i when the nth sampling of the N random variables of the power system is carried out in the standard Gaussian distribution domainj,hFor N random numbers of power systems in a standard Gaussian distribution domainThe value of the random variable j at the h-th sampling of the machine variable,
Figure BDA0002483466080000077
ωkis a radial basis network WijThe output weight of the kth neuron of the hidden layer,
Figure BDA0002483466080000073
is represented by (Z)i,h,Zj,h) As a radial basis network WijThe output data of the network, RBF (Z)ij) Is represented by (Z)i,Zj) As a radial basis network WijThe input data of (c) is the output data of the network output layer, (Z)i,k,Zj,k) Is a radial basis network WijCenter of the kth neuron of the hidden layer, ΦRBF,k(Zij) Is a radial basis network WijHidden layer output matrix of (1) h, k ∈ (1-N)s),
Figure BDA0002483466080000074
NsIs the total number of sample points that are,
Figure BDA0002483466080000075
is represented by (Z)i,Zj) As a radial basis network WijWhen the data is input, the output data of the kth neuron of the network hidden layer, i, j ∈ (1-N), N is the number of random variables in the power system, z isiIs an independent variable, z, of a random variable i in a standard Gaussian distribution domainjIs an argument of a random variable j in the domain of a standard gaussian distribution.
Further, the step 102-3 includes:
step A: based on formula
Figure BDA0002483466080000076
And RBF (Z)ij)=ΦRBF(Zij) ω the correlation coefficient ρ of the random variable i and the random variable j in the correlation coefficient matrix of the random variables in the original distribution domainXijRandom variable in matrix of correlation coefficient with random variable in standard Gaussian distribution domaini and the correlation coefficient ρ of the random variable jZijThe conversion relationship between them is transformed into:
Figure BDA0002483466080000081
and B: order to
Figure BDA0002483466080000082
And the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the original distribution domainXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship of (a) is transformed into:
Figure BDA0002483466080000083
and C: the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the original distribution domainXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between them is transformed into:
Figure BDA0002483466080000084
in the preferred embodiment of the invention, p is calculated considering that the radial basis functions are all gaussian functions with absolute multiplicationsXijAnd rhoZijOf the conversion type
Figure BDA0002483466080000085
The double integral part of (a) is denoted by I,
Figure BDA0002483466080000086
in I to
Figure BDA0002483466080000087
Transformation into formula
Figure BDA0002483466080000088
And the two norms thereof are expanded, I can be transformed into:
Figure BDA0002483466080000089
and then to
Figure BDA00024834660800000810
Wherein
Figure BDA00024834660800000811
Figure BDA00024834660800000812
Step D: order to
Figure BDA00024834660800000813
And initializing correlation coefficients rho of random variable i and random variable j in the correlation coefficient matrix of the random variable in the standard Gaussian distribution domainZij,0=ρXijThe current iteration number L is 1;
in the preferred embodiment of the invention, simplified Newton's method is used, with pXijAs ρZijThe initial value of (c), the fixed Jacobian matrix element (i.e., the partial derivative function G' (ρ) in each iterationZij) Constant 1) and using ρZij,L=ρZij,L-1+G(ρZij,L-1) Iterate when G (ρ)Zij,L-1) Satisfies the iteration termination condition, | G (ρ)Zij,L-1) If is | <, the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain at the L-th iteration is determinedZij,LAnd the correlation coefficients of the random variable i and the random variable j in the correlation coefficient matrix serving as the random variable in the standard Gaussian distribution domain.
General formula
Figure BDA0002483466080000091
The medium independent variable is rhoZijSetting ρ at initial iterationZij,0=ρXij
Step (ii) ofE: will rhoZij,L-1Substitution into
Figure BDA0002483466080000092
In (1), calculating G (ρ)Zij,L-1) And G (ρ)Zij,L-1) Substitution into the iterative equation ρZij,L=ρZij,L-1+G(ρZij,L-1) In the method, the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain during the L-th iteration is obtainedZij,L
Step F: when | G (ρ)Zij,L-1) When | is less than | then let ρZij,LThe correlation coefficients of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain are obtained, otherwise, the correlation coefficient is made L +1, and the step E is returned;
wherein, muIs the mean value of the random variable i after the synchronous sampling of N random variables of the power system in the original distribution domainjIs the mean value, sigma, of a random variable j after synchronously sampling N random variables of the power system in an original distribution domainiFor the variance, σ, of the random variable i after the N random variables of the power system are synchronously sampled in the original distribution domainjTo synchronize the variance of the random variable j after sampling the N random variables of the power system in the original distribution domain,
Figure BDA0002483466080000093
to be phi (z)i) Substituting into the value obtained in the inverse function of the cumulative distribution function of the random variable i in the original distribution domain,
Figure BDA0002483466080000094
to be phi (z)j) Substituting into the inverse of the cumulative distribution function of the random variable j in the original distribution domain to obtain a value, UiIndependent variables, U, for the standard Gaussian distribution-compliant and independent random variable i in the Gaussian distribution domainjIs independent variable of independent random variable j in Gaussian distribution domain, which obeys standard Gaussian distribution, A is a first parameter, BkIs a second parameter, EkIs the third parameter, D is the fourth parameter, G (ρ)Zij) Is the fifth parameter, is the end of iterationThe threshold value of the threshold value is set,
Figure BDA0002483466080000095
Figure BDA0002483466080000101
Zi,kthe method is the value of a random variable i, Z in the k-th sampling of N random variables of the power system in a standard Gaussian distribution domainj,kThe method is the value of the random variable j in the standard Gaussian distribution domain when the kth sampling is carried out on the N random variables of the power system.
Specifically, the step 103 includes:
step 103-1, generating NPOPFSet independent and standard Gaussian distribution compliant sample data, and using NPOPFThe group of independent sample data which obeys the standard Gaussian distribution forms N rows NPOPFSample matrix U of columnsPOPF
103-2, according to the correlation coefficient matrix C of the random variables in the standard Gaussian distribution domainZSum matrix UPOPFDetermining a sample matrix Z of random variables in a standard Gaussian distribution domainPOPF
Step 103-3, a sample matrix Z of random variables in the standard Gaussian distribution domainPOPFSample matrix X converted to random variables in the original distribution domainPOPF
Wherein each group of sample data comprises N elements,
Figure BDA0002483466080000102
XPOPF,ifis a matrix XPOPFRow i and column f ofPOPF,ifIs a matrix ZPOPFRow i and column f ofPOPF=LNPOPFL is a correlation coefficient matrix C for random variables in a standard Gaussian distribution domainZThe resulting lower triangular matrix, i.e. C, using Cholesky decompositionZ=L·LTT is a transposed symbol, i ∈ (1-N), f ∈ (1-N)POPF) N is the total number of random variables of the power system, NPOPFThe number of generated sample sets.
In the preferred embodiment of the present inventionIn the method, a sample matrix X of random variables in an original distribution domain can be obtained by using a Monte Carlo methodPOPFAnd performing probability optimal power flow calculation or power flow calculation, and counting output results of the probability optimal power flow calculation or power flow calculation so as to determine the probability distribution condition of random variables in the power system.
In the best embodiment of the invention, an example is constructed to verify the validity of the algorithm. The method is based on standard IEEE118 node calculation and adopts the following expansion mode: and fans are additionally arranged at part of nodes, and the specific expansion condition is shown in table 1.
TABLE 1
Figure BDA0002483466080000103
For the wind speeds of all the wind turbines, a Weibull distribution (Weibull distribution) with a scale parameter of 10.7 and a shape parameter of 3.97 is adopted, and a correlation coefficient matrix among the wind speeds is shown in Table 2.
TABLE 2
Figure BDA0002483466080000111
In addition, all active loads (99 in total) are subjected to normal distribution with the average value of the original value of the standard calculation example and the standard deviation of 0.015 times of the average value. In each calculation, the power factor is kept the same as the standard calculation example, so that the value of the reactive load is determined. In the node order, the first 50 loads were set as the first group loads, the remaining 49 loads were the second group loads, the correlation coefficient between the internal loads of each group was 0.8, the correlation coefficient between the inter-group loads was 0, and the correlation coefficient between the loads and the wind speed was 0. The conversion formula for converting the wind speed into the active power output of the corresponding fan is as follows:
Figure BDA0002483466080000112
wherein v iswindRepresenting wind speed in m/s; pTThe active output of a single fan is shown,unit MW; the reactive power consumed by each fan is a constant value of-0.0002 MVar.
According to the setting, the correlation coefficient conversion method selects a Simpson method to obtain integral, and a dichotomy method is used for solving a result obtained by a nonlinear equation to serve as reference.
The correlation conversion accuracy and speed results are shown in table 3. The Simpson method and the Newton method are used as reference methods, the Gauss-Hermite method and the interpolation method are more advanced solving methods at present, and the radial basis function neural network and the simplified Newton method are used as methods provided by the invention.
TABLE 3
Figure BDA0002483466080000113
Based on a reference correlation coefficient conversion method and a method based on a radial basis function neural network provided by the invention, a Monte Carlo method (10000 simple random samples) is combined to calculate a probability optimal power flow, the probability distribution condition of an objective function is considered, the obtained result is shown in FIG. 2, FIG. 2 shows that the similarity of the probability densities of the power generation cost calculated by the reference method and the technical scheme provided by the invention is 0.9686, namely representing the proportion of two probability density coincidence parts in the whole distribution condition, and the larger the similarity (close to 1) is, the higher the similarity of the two probability densities is.
The invention provides a random variable sample generation system based on Nataf transformation, as shown in FIG. 3, the system comprises:
the establishing module is used for establishing a conversion relation between a correlation coefficient matrix of the random variable in the original distribution domain and a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain according to Nataf transformation;
the determining module is used for determining the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain by adopting a radial basis network based on the conversion relation between the correlation coefficient matrix of the random variable in the original distribution domain and the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain;
and the generating module is used for generating sample data of the random variable in the original distribution domain by using the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain.
Specifically, the establishing module includes:
the first determining unit is used for forming an N-dimensional vector Z by using N random variables which obey standard Gaussian distribution, and determining a joint density function between a random variable i and a random variable j in the vector Z;
a second determining unit, configured to determine a correlation coefficient ρ of the random variable i and the random variable j in the correlation coefficient matrix of the random variables in the original distribution domain according to a principle that a cumulative distribution function value of the random variables in the original distribution domain is equal to a cumulative distribution function value of the random variables in the gaussian distribution domain and a joint density function between the random variable i and the random variable j in the vector ZXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between them;
wherein i, j ∈ (1-N), N is the number of random variables in the power system, and the joint density function between the random variable i and the random variable j in the vector Z
Figure BDA0002483466080000121
ziIs an independent variable, z, of a random variable i in a standard Gaussian distribution domainjIs an argument of a random variable j in the domain of a standard gaussian distribution.
The second determining unit is specifically configured to: further, the third determining unit is specifically configured to:
determining the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the original distribution domain according to the following formulaXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between:
Figure BDA0002483466080000131
in the formula (I), the compound is shown in the specification,
Figure BDA0002483466080000132
to be phi (z)i) Substituting into the value obtained in the inverse function of the cumulative distribution function of the random variable i in the original distribution domain,
Figure BDA0002483466080000133
to be phi (z)j) Substituting into the inverse function of the cumulative distribution function of the random variable j in the original distribution domain to obtain a value, Φ (z)i) The argument for the random variable i in the normalized Gaussian distribution domain is ziThe value of the cumulative distribution function of the random variable i in the standard Gaussian distribution domain, phi (z), corresponding to the timej) The argument for the random variable j in the normalized Gaussian distribution domain is zjThe value of the cumulative distribution function of the random variable j in the standard Gaussian distribution domain, muiIs the mean value, μ, of the random variable i after the synchronous sampling of the N random variables of the power system in the original distribution domainjIs the mean value, sigma, of a random variable j after synchronously sampling N random variables of the power system in an original distribution domainiFor the variance, σ, of the random variable i after the N random variables of the power system are synchronously sampled in the original distribution domainjIs the variance of a random variable j after synchronously sampling N random variables of the power system in the original distribution domain.
Further, the third determining unit is specifically configured to:
determining the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the original distribution domain according to the following formulaXijAnd correlation coefficient rho of random variable i and random variable j in correlation coefficient matrix of random variable in standard Gaussian distribution domainZijThe conversion relationship between:
in the formula (I), the compound is shown in the specification,
Figure BDA0002483466080000134
to be phi (z)i) Substituting into the value obtained in the inverse function of the cumulative distribution function of the random variable i in the original distribution domain,
Figure BDA0002483466080000135
to be phi (z)j) Substitution into original distribution Domain randomnessThe value obtained in the inverse of the cumulative distribution function of the variable j.
Specifically, the determining module includes:
a first acquisition unit for synchronously sampling N random variables of the power system in a standard Gaussian distribution domainsThen, obtain NsVector Z formed by sampling data of random variable i and random variable j in sub-samplingij,SAnd will vector Zij,SIn the formula
Figure BDA0002483466080000136
Obtaining a vector Zij,SCorresponding vector RBFij,s
A second acquisition unit for acquiring the vector Zij,SSum vector RBFij,sRespectively as radial basis function neural networks WijInput data and output data of
Figure BDA0002483466080000141
As a radial basis neural network WijSetting function of the kth neuron of the hidden layer
Figure BDA0002483466080000142
As a radial basis neural network WijObtaining a setting function of an output layer to obtain a radial basis function (Wn)ijN of the hidden layersAn output weight matrix ω of each neuron;
a fourth determining unit, configured to determine a correlation coefficient ρ of the random variable i and the random variable j based on the correlation coefficient matrix of the random variables in the original distribution domainXijAnd correlation coefficient rho of random variable i and random variable j in correlation coefficient matrix of random variable in standard Gaussian distribution domainZijThe conversion relation between them and the radial basis function neural network WijN of the hidden layersDetermining correlation coefficient rho of random variable i and random variable j in correlation coefficient matrix of random variable in standard Gaussian distribution domain by output weight matrix omega of each neuronZij
Wherein the content of the first and second substances,
Figure BDA0002483466080000149
Zi,hthe value Z of the random variable i is the value of the random variable i when the nth sampling of the N random variables of the power system is carried out in the standard Gaussian distribution domainj,hIs the value of a random variable j when the nth sampling of the N random variables of the power system is carried out in a standard Gaussian distribution domain,
Figure BDA0002483466080000143
ωkis a radial basis network WijThe output weight of the kth neuron of the hidden layer,
Figure BDA0002483466080000144
Figure BDA0002483466080000145
is represented by (Z)i,h,Zj,h) As a radial basis network WijThe output data of the network, RBF (Z)ij) Is represented by (Z)i,Zj) As a radial basis network WijThe input data of (c) is the output data of the network output layer, (Z)i,k,Zj,k) Is a radial basis network WijCenter of the kth neuron of the hidden layer, ΦRBF,k(Zij) Is a radial basis network WijHidden layer output matrix of (1) h, k ∈ (1-N)s),
Figure BDA0002483466080000146
NsIs the total number of sample points that are,
Figure BDA0002483466080000147
is represented by (Z)i,Zj) As a radial basis network WijWhen the data is input, the output data of the kth neuron of the network hidden layer, i, j ∈ (1-N), N is the number of random variables in the power system, z isiIs an independent variable, z, of a random variable i in a standard Gaussian distribution domainjIs an argument of a random variable j in the domain of a standard gaussian distribution.
Further, the fourth determining unit includes:
first transformation sub-sheetElement for based on
Figure BDA0002483466080000148
And RBF (Z)ij)=ΦRBF(Zij) ω the correlation coefficient ρ of the random variable i and the random variable j in the correlation coefficient matrix of the random variables in the original distribution domainXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between them is transformed into:
Figure BDA0002483466080000151
a second conversion subunit for converting
Figure BDA0002483466080000152
And the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the original distribution domainXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship of (a) is transformed into:
Figure BDA0002483466080000156
a third transformation subunit, configured to transform correlation coefficients ρ of the random variable i and the random variable j in a correlation coefficient matrix of the random variable in the original distribution domainXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between them is transformed into:
Figure BDA0002483466080000153
an initialization subunit for ordering
Figure BDA0002483466080000154
And initializing the phases of random variables in a standard Gaussian distribution domainCorrelation coefficient rho of random variable i and random variable j in relational number matrixZij,0=ρXijThe current iteration number L is 1;
an iteration subunit for dividing pZij,L-1Substitution into
Figure BDA0002483466080000155
In (1), calculating G (ρ)Zij,L-1) And G (ρ)Zij,L-1) Substitution into the iterative equation ρZij,L=ρZij,L-1+G(ρZij,L-1) In the method, the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain during the L-th iteration is obtainedZij,L
A judgment subunit for judging when | G (ρ)Zij,L-1) When | is less than | then let ρZij,LThe correlation coefficients of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain are obtained, otherwise, the correlation coefficient is made L +1, and the step E is returned;
wherein, muIs the mean value of the random variable i after the synchronous sampling of N random variables of the power system in the original distribution domainjIs the mean value, sigma, of a random variable j after synchronously sampling N random variables of the power system in an original distribution domainiFor the variance, σ, of the random variable i after the N random variables of the power system are synchronously sampled in the original distribution domainjTo synchronize the variance of the random variable j after sampling the N random variables of the power system in the original distribution domain,
Figure BDA0002483466080000161
to be phi (z)i) Substituting into the value obtained in the inverse function of the cumulative distribution function of the random variable i in the original distribution domain,
Figure BDA0002483466080000162
to be phi (z)j) Substituting into the inverse of the cumulative distribution function of the random variable j in the original distribution domain to obtain a value, UiIndependent variables, U, for the standard Gaussian distribution-compliant and independent random variable i in the Gaussian distribution domainjFor complying with standard height in Gaussian distribution domainIndependent variables of a random variable j which is distributed in a gaussian manner, A is a first parameter, BkIs a second parameter, EkIs the third parameter, D is the fourth parameter, G (ρ)Zij) For the fifth parameter, for the iteration termination threshold,
Figure BDA0002483466080000163
Figure BDA0002483466080000164
Zi,kthe method is the value of a random variable i, Z in the k-th sampling of N random variables of the power system in a standard Gaussian distribution domainj,kThe method is the value of the random variable j in the standard Gaussian distribution domain when the kth sampling is carried out on the N random variables of the power system.
Specifically, the generating module includes:
a sample construction unit for generating NPOPFSet independent and standard Gaussian distribution compliant sample data, and using NPOPFThe group of independent sample data which obeys the standard Gaussian distribution forms N rows NPOPFSample matrix U of columnsPOPF
A fifth determining unit for determining a correlation coefficient matrix C based on the random variables in the standard Gaussian distribution domainZSum matrix UPOPFDetermining a sample matrix Z of random variables in a standard Gaussian distribution domainPOPF
A conversion unit for converting a sample matrix Z of random variables in a standard Gaussian distribution domainPOPFSample matrix X converted to random variables in the original distribution domainPOPF
Wherein each group of sample data comprises N elements,
Figure BDA0002483466080000165
XPOPF,ifis a matrix XPOPFRow i and column f ofPOPF,ifIs a matrix ZPOPFRow i and column f ofPOPF=LNPOPFL is a correlation coefficient matrix C for random variables in a standard Gaussian distribution domainZThe resulting lower triangular matrix, i.e. C, using Cholesky decompositionZ=L·LTT is a transposed symbol, i ∈ (1-N), f ∈ (1-N)POPF) N is the total number of random variables of the power system, NPOPFThe number of generated sample sets.
In the best embodiment of the invention, the transfer process of the product of the random variable in the standard Gaussian distribution domain and the random variable in the original distribution domain is approximated through the radial basis function neural network, and an analytical expression of the transfer process is given; analyzing the analytical expression of the transmission process, and changing the correlation coefficient conversion relation of the random variables in the original distribution domain and the Gaussian distribution domain in the integral form into a summation form, thereby simplifying the conversion relation of the correlation coefficients of the random variables in the original distribution domain and the standard Gaussian distribution domain, improving the calculation speed of the correlation coefficient of the random variables in the standard Gaussian distribution domain, and improving the sample generation speed of the random variables in the original distribution domain.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
Finally, it should be noted that: the above embodiments are only for illustrating the technical solutions of the present invention and not for limiting the same, and although the present invention is described in detail with reference to the above embodiments, those of ordinary skill in the art should understand that: modifications and equivalents may be made to the embodiments of the invention without departing from the spirit and scope of the invention, which is to be covered by the claims.

Claims (12)

1. A method for generating random variable samples based on a Nataf transform, the method comprising:
establishing a conversion relation between a correlation coefficient matrix of the random variable in the original distribution domain and a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain according to Nataf transformation;
determining a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain by adopting a radial basis network based on the conversion relation between the correlation coefficient matrix of the random variable in the original distribution domain and the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain;
and generating sample data of the random variable in the original distribution domain by using a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain.
2. The method according to claim 1, wherein the establishing a conversion relationship between the correlation coefficient matrix of the random variable in the original distribution domain and the correlation coefficient matrix of the random variable in the standard gaussian distribution domain according to the Nataf transform comprises:
forming an N-dimensional vector Z by using N random variables which obey standard Gaussian distribution, and determining a joint density function between a random variable i and a random variable j in the vector Z;
determining the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variables in the original distribution domain according to the principle that the cumulative distribution function value of the random variables in the original distribution domain is equal to the cumulative distribution function value of the random variables in the Gaussian distribution domain and the joint density function between the random variable i and the random variable j in the vector ZXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between them;
wherein i, j ∈ (1-N), N is the number of random variables in the power system, and the joint density function between the random variable i and the random variable j in the vector Z
Figure FDA0002483466070000011
ziIs an independent variable, z, of a random variable i in a standard Gaussian distribution domainjIs an argument of a random variable j in the domain of a standard gaussian distribution.
3. The method according to claim 2, wherein the correlation coefficient ρ of the random variable i and the random variable j in the correlation coefficient matrix of the random variables in the original distribution domain is determined according to a rule that the cumulative distribution function value of the random variables in the original distribution domain is equal to the cumulative distribution function value of the random variables in the gaussian distribution domain and a joint density function between the random variable i and the random variable j in the vector ZXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between the two, including:
determining the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the original distribution domain according to the following formulaXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between:
Figure FDA0002483466070000021
in the formula (I), the compound is shown in the specification,
Figure FDA0002483466070000022
to be phi (z)i) Substituting into the value obtained in the inverse function of the cumulative distribution function of the random variable i in the original distribution domain,
Figure FDA0002483466070000023
to be phi (z)j) Substituting into the inverse function of the cumulative distribution function of the random variable j in the original distribution domain to obtain a value, Φ (z)i) The argument for the random variable i in the normalized Gaussian distribution domain is ziThe value of the cumulative distribution function of the random variable i in the standard Gaussian distribution domain, phi (z), corresponding to the timej) The argument for the random variable j in the normalized Gaussian distribution domain is zjThe value of the cumulative distribution function of the random variable j in the standard Gaussian distribution domain, muIs the mean value of the random variable i after the synchronous sampling of N random variables of the power system in the original distribution domainjIs the mean value, sigma, of a random variable j after synchronously sampling N random variables of the power system in an original distribution domainiFor the variance, σ, of the random variable i after the N random variables of the power system are synchronously sampled in the original distribution domainjIs the variance of a random variable j after synchronously sampling N random variables of the power system in the original distribution domain.
4. The method of claim 1, wherein determining the correlation coefficient matrix of the random variable in the standard gaussian distribution domain using the radial basis network based on a transformation relationship between the correlation coefficient matrix of the random variable in the original distribution domain and the correlation coefficient matrix of the random variable in the standard gaussian distribution domain comprises:
synchronous sampling N for N random variables of power system in standard Gaussian distribution domainsThen, obtain NsVector Z formed by sampling data of random variable i and random variable j in sub-samplingij,SAnd will vector Zij,SIn the formula
Figure FDA0002483466070000024
Obtaining a vector Zij,SCorresponding vector RBFij,s
Will vector Zij,SSum vector RBFij,sRespectively as radial basis function neural networks WijInput data and output data of
Figure FDA0002483466070000025
As a radial basis neural network WijSetting function of the kth neuron of the hidden layer
Figure FDA0002483466070000026
As a radial basis neural network WijObtaining a setting function of an output layer to obtain a radial basis function (Wn)ijN of the hidden layersAn output weight matrix ω of each neuron;
correlation coefficient rho of random variable i and random variable j in correlation coefficient matrix based on random variable in original distribution domainXijAnd correlation coefficient rho of random variable i and random variable j in correlation coefficient matrix of random variable in standard Gaussian distribution domainZijThe conversion relation between them and the radial basis function neural network WijN of the hidden layersDetermining the output weight matrix omega of each neuron, and determining the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variables in the standard Gaussian distribution domainZij
Wherein the content of the first and second substances,
Figure FDA0002483466070000031
Zi,hthe value Z of the random variable i is the value of the random variable i when the nth sampling of the N random variables of the power system is carried out in the standard Gaussian distribution domainj,hIs the value of a random variable j when the nth sampling of the N random variables of the power system is carried out in a standard Gaussian distribution domain,
Figure FDA0002483466070000032
ωkis a radial basis network WijThe output weight of the kth neuron of the hidden layer,
Figure FDA0002483466070000033
Figure FDA0002483466070000034
is represented by (Z)i,h,Zj,h) As a radial basis network WijThe output data of the network, RBF (Z)ij) Is represented by (Z)i,Zj) As a radial basis network WijThe input data of (c) is the output data of the network output layer, (Z)i,k,Zj,k) Is a radial basis network WijCenter of the kth neuron of the hidden layer, ΦRBF,k(Zij) Is a radial basis network WijHidden layer output matrix of (1) h, k ∈ (1-N)s),
Figure FDA0002483466070000035
NsIs the total number of sample points that are,
Figure FDA0002483466070000036
is represented by (Z)i,Zj) As a radial basis network WijWhen the data is input, the output data of the kth neuron of the network hidden layer, i, j ∈ (1-N), N is the number of random variables in the power system, z isiIs an independent variable, z, of a random variable i in a standard Gaussian distribution domainjIs an argument of a random variable j in the domain of a standard gaussian distribution.
5. The method according to claim 4, wherein the correlation coefficient p of the random variable i and the random variable j in the correlation coefficient matrix based on the random variables in the original distribution domainXijAnd correlation coefficient rho of random variable i and random variable j in correlation coefficient matrix of random variable in standard Gaussian distribution domainZijThe conversion relation between them and the radial basis function neural network WijN of the hidden layersDetermining correlation coefficient rho of random variable i and random variable j in correlation coefficient matrix of random variable in standard Gaussian distribution domain by output weight matrix omega of each neuronZijThe method comprises the following steps:
step A: based on formula
Figure FDA0002483466070000037
And RBF (Z)ij)=ΦRBF(Zij) ω the correlation coefficient ρ of the random variable i and the random variable j in the correlation coefficient matrix of the random variables in the original distribution domainXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between them is transformed into:
Figure FDA0002483466070000038
and B: order to
Figure FDA0002483466070000041
And the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the original distribution domainXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship of (a) is transformed into:
Figure FDA0002483466070000042
and C: correlating coefficient moment of random variable in original distribution domainCorrelation coefficient rho of random variable i and random variable j in arrayXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between them is transformed into:
Figure FDA0002483466070000043
step D: order to
Figure FDA0002483466070000044
And initializing correlation coefficients rho of random variable i and random variable j in the correlation coefficient matrix of the random variable in the standard Gaussian distribution domainZij,0=ρXijThe current iteration number L is 1;
step E: will rhoZij,L-1Substitution into
Figure FDA0002483466070000045
In (1), calculating G (ρ)Zij,L-1) And G (ρ)Zij,L-1) Substitution into the iterative equation ρZij,L=ρZij,L-1+G(ρZij,L-1) In the method, the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain during the L-th iteration is obtainedZij,L
Step F: when | G (ρ)Zij,L-1) When | is less than | then let ρZij,LThe correlation coefficients of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain are obtained, otherwise, the correlation coefficient is made L +1, and the step E is returned;
wherein, muIs the mean value of the random variable i after the synchronous sampling of N random variables of the power system in the original distribution domainjIs the mean value, sigma, of a random variable j after synchronously sampling N random variables of the power system in an original distribution domainiFor the variance, σ, of the random variable i after the N random variables of the power system are synchronously sampled in the original distribution domainjTo synchronize the variance of the random variable j after sampling the N random variables of the power system in the original distribution domain,
Figure FDA0002483466070000046
to be phi (z)i) Substituting into the value obtained in the inverse function of the cumulative distribution function of the random variable i in the original distribution domain,
Figure FDA0002483466070000047
to be phi (z)j) Substituting into the inverse of the cumulative distribution function of the random variable j in the original distribution domain to obtain a value, UiIndependent variables, U, for the standard Gaussian distribution-compliant and independent random variable i in the Gaussian distribution domainjIs independent variable of independent random variable j in Gaussian distribution domain, which obeys standard Gaussian distribution, A is a first parameter, BkIs a second parameter, EkIs the third parameter, D is the fourth parameter, G (ρ)Zij) For the fifth parameter, for the iteration termination threshold,
Figure FDA0002483466070000051
Figure FDA0002483466070000052
Zi,kthe method is the value of a random variable i, Z in the k-th sampling of N random variables of the power system in a standard Gaussian distribution domainj,kThe method is the value of the random variable j in the standard Gaussian distribution domain when the kth sampling is carried out on the N random variables of the power system.
6. The method of claim 1, wherein generating sample data of random variables in an original distribution domain using a matrix of correlation coefficients of the random variables in a standard gaussian distribution domain comprises:
generating NPOPFSet independent and standard Gaussian distribution compliant sample data, and using NPOPFThe group of independent sample data which obeys the standard Gaussian distribution forms N rows NPOPFSample matrix U of columnsPOPF
Correlation coefficient matrix C based on random variables in standard Gaussian distribution domainZSum matrix UPOPFDetermining a sample matrix Z of random variables in a standard Gaussian distribution domainPOPF
Sample matrix Z of random variables in standard Gaussian distribution domainPOPFSample matrix X converted to random variables in the original distribution domainPOPF
Wherein each group of sample data comprises N elements,
Figure FDA0002483466070000053
XPOPF,ifis a matrix XPOPFRow i and column f ofPOPF,ifIs a matrix ZPOPFRow i and column f ofPOPF=LNPOPFL is a correlation coefficient matrix C for random variables in a standard Gaussian distribution domainZThe resulting lower triangular matrix, i.e. C, using Cholesky decompositionZ=L·LTT is a transposed symbol, i ∈ (1-N), f ∈ (1-N)POPF) N is the total number of random variables of the power system, NPOPFThe number of generated sample sets.
7. A Nataf transform-based random variable sample generation system, the system comprising:
the establishing module is used for establishing a conversion relation between a correlation coefficient matrix of the random variable in the original distribution domain and a correlation coefficient matrix of the random variable in the standard Gaussian distribution domain according to Nataf transformation;
the determining module is used for determining the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain by adopting a radial basis network based on the conversion relation between the correlation coefficient matrix of the random variable in the original distribution domain and the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain;
and the generating module is used for generating sample data of the random variable in the original distribution domain by using the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain.
8. The system of claim 7, wherein the setup module comprises:
the first determining unit is used for forming an N-dimensional vector Z by using N random variables which obey standard Gaussian distribution, and determining a joint density function between a random variable i and a random variable j in the vector Z;
a second determining unit, configured to determine a correlation coefficient ρ of the random variable i and the random variable j in the correlation coefficient matrix of the random variables in the original distribution domain according to a principle that a cumulative distribution function value of the random variables in the original distribution domain is equal to a cumulative distribution function value of the random variables in the gaussian distribution domain and a joint density function between the random variable i and the random variable j in the vector ZXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between them;
wherein i, j ∈ (1-N), N is the number of random variables in the power system, and the joint density function between the random variable i and the random variable j in the vector Z
Figure FDA0002483466070000061
ziIs an independent variable, z, of a random variable i in a standard Gaussian distribution domainjIs an argument of a random variable j in the domain of a standard gaussian distribution.
9. The system of claim 8, wherein the second determining unit is specifically configured to:
determining the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the original distribution domain according to the following formulaXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between:
Figure FDA0002483466070000062
in the formula (I), the compound is shown in the specification,
Figure FDA0002483466070000063
to be phi (z)i) Accumulation of random variable i substituted into original distribution domainThe values obtained in the inverse of the distribution function,
Figure FDA0002483466070000064
to be phi (z)j) Substituting into the inverse function of the cumulative distribution function of the random variable j in the original distribution domain to obtain a value, Φ (z)i) The argument for the random variable i in the normalized Gaussian distribution domain is ziThe value of the cumulative distribution function of the random variable i in the standard Gaussian distribution domain, phi (z), corresponding to the timej) The argument for the random variable j in the normalized Gaussian distribution domain is zjThe value of the cumulative distribution function of the random variable j in the standard Gaussian distribution domain, muIs the mean value of the random variable i after the synchronous sampling of N random variables of the power system in the original distribution domainjIs the mean value, sigma, of a random variable j after synchronously sampling N random variables of the power system in an original distribution domainiFor the variance, σ, of the random variable i after the N random variables of the power system are synchronously sampled in the original distribution domainjIs the variance of a random variable j after synchronously sampling N random variables of the power system in the original distribution domain.
10. The system of claim 7, wherein the determination module comprises:
a first acquisition unit for synchronously sampling N random variables of the power system in a standard Gaussian distribution domainsThen, obtain NsVector Z formed by sampling data of random variable i and random variable j in sub-samplingij,SAnd will vector Zij,SIn the formula
Figure FDA0002483466070000071
Obtaining a vector Zij,SCorresponding vector RBFij,s
A second acquisition unit for acquiring the vector Zij,SSum vector RBFij,sRespectively as radial basis function neural networks WijInput data and output data of
Figure FDA0002483466070000072
As a radial basis neural network WijSetting function of the kth neuron of the hidden layer
Figure FDA0002483466070000073
As a radial basis neural network WijObtaining a setting function of an output layer to obtain a radial basis function (Wn)ijN of the hidden layersAn output weight matrix ω of each neuron;
a fourth determining unit, configured to determine a correlation coefficient ρ of the random variable i and the random variable j based on the correlation coefficient matrix of the random variables in the original distribution domainXijAnd correlation coefficient rho of random variable i and random variable j in correlation coefficient matrix of random variable in standard Gaussian distribution domainZijThe conversion relation between them and the radial basis function neural network WijN of the hidden layersDetermining correlation coefficient rho of random variable i and random variable j in correlation coefficient matrix of random variable in standard Gaussian distribution domain by output weight matrix omega of each neuronZij
Wherein the content of the first and second substances,
Figure FDA0002483466070000074
Zi,hthe value Z of the random variable i is the value of the random variable i when the nth sampling of the N random variables of the power system is carried out in the standard Gaussian distribution domainj,hIs the value of a random variable j when the nth sampling of the N random variables of the power system is carried out in a standard Gaussian distribution domain,
Figure FDA0002483466070000075
ωkis a radial basis network WijThe output weight of the kth neuron of the hidden layer,
Figure FDA0002483466070000076
Figure FDA0002483466070000077
is represented by (Z)i,h,Zj,h) As a radial basis network WijThe output data of the network, RBF (Z)ij) Is represented by (Z)i,Zj) As a radial basis network WijThe input data of (c) is the output data of the network output layer, (Z)i,k,Zj,k) Is a radial basis network WijCenter of the kth neuron of the hidden layer, ΦRBF,k(Zij) Is a radial basis network WijHidden layer output matrix of (1) h, k ∈ (1-N)s),
Figure FDA0002483466070000081
NsIs the total number of sample points that are,
Figure FDA0002483466070000082
is represented by (Z)i,Zj) As a radial basis network WijWhen the data is input, the output data of the kth neuron of the network hidden layer, i, j ∈ (1-N), N is the number of random variables in the power system, z isiIs an independent variable, z, of a random variable i in a standard Gaussian distribution domainjIs an argument of a random variable j in the domain of a standard gaussian distribution.
11. The system of claim 10, wherein the fourth determination unit comprises:
a first transformation subunit for being based on
Figure FDA0002483466070000083
And RBF (Z)ij)=ΦRBF(Zij) ω the correlation coefficient ρ of the random variable i and the random variable j in the correlation coefficient matrix of the random variables in the original distribution domainXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between them is transformed into:
Figure FDA0002483466070000084
a second transformation sub-unit for performing a second transformation,for making
Figure FDA0002483466070000085
And the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the original distribution domainXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship of (a) is transformed into:
Figure FDA0002483466070000086
a third transformation subunit, configured to transform correlation coefficients ρ of the random variable i and the random variable j in a correlation coefficient matrix of the random variable in the original distribution domainXijCorrelation coefficient rho of random variable i and random variable j in correlation coefficient matrix with random variable in standard Gaussian distribution domainZijThe conversion relationship between them is transformed into:
Figure FDA0002483466070000087
an initialization subunit for ordering
Figure FDA0002483466070000088
And initializing correlation coefficients rho of random variable i and random variable j in the correlation coefficient matrix of the random variable in the standard Gaussian distribution domainZij,0=ρXijThe current iteration number L is 1;
an iteration subunit for dividing pZij,L-1Substitution into
Figure FDA0002483466070000091
In (1), calculating G (ρ)Zij,L-1) And G (ρ)Zij,L-1) Substitution into the iterative equation ρZij,L=ρZij,L-1+G(ρZij,L-1) In the method, the correlation coefficient rho of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain during the L-th iteration is obtainedZij,L
A judgment subunit for judging when | G (ρ)Zij,L-1) When | is less than | then let ρZij,LThe correlation coefficients of the random variable i and the random variable j in the correlation coefficient matrix of the random variable in the standard Gaussian distribution domain are obtained, otherwise, the correlation coefficient is made L +1, and the step E is returned;
wherein, muIs the mean value of the random variable i after the synchronous sampling of N random variables of the power system in the original distribution domainjIs the mean value, sigma, of a random variable j after synchronously sampling N random variables of the power system in an original distribution domainiFor the variance, σ, of the random variable i after the N random variables of the power system are synchronously sampled in the original distribution domainjTo synchronize the variance of the random variable j after sampling the N random variables of the power system in the original distribution domain,
Figure FDA0002483466070000092
to be phi (z)i) Substituting into the value obtained in the inverse function of the cumulative distribution function of the random variable i in the original distribution domain,
Figure FDA0002483466070000093
to be phi (z)j) Substituting into the inverse of the cumulative distribution function of the random variable j in the original distribution domain to obtain a value, UiIndependent variables, U, for the standard Gaussian distribution-compliant and independent random variable i in the Gaussian distribution domainjIs independent variable of independent random variable j in Gaussian distribution domain, which obeys standard Gaussian distribution, A is a first parameter, BkIs a second parameter, EkIs the third parameter, D is the fourth parameter, G (ρ)Zij) For the fifth parameter, for the iteration termination threshold,
Figure FDA0002483466070000094
Figure FDA0002483466070000095
Zi,kthe method is the value of a random variable i, Z in the k-th sampling of N random variables of the power system in a standard Gaussian distribution domainj,kThe method is the value of the random variable j in the standard Gaussian distribution domain when the kth sampling is carried out on the N random variables of the power system.
12. The system of claim 7, wherein the generation module comprises:
a sample construction unit for generating NPOPFSet independent and standard Gaussian distribution compliant sample data, and using NPOPFThe group of independent sample data which obeys the standard Gaussian distribution forms N rows NPOPFSample matrix U of columnsPOPF
A fifth determining unit for determining a correlation coefficient matrix C based on the random variables in the standard Gaussian distribution domainZSum matrix UPOPFDetermining a sample matrix Z of random variables in a standard Gaussian distribution domainPOPF
A conversion unit for converting a sample matrix Z of random variables in a standard Gaussian distribution domainPOPFSample matrix X converted to random variables in the original distribution domainPOPF
Wherein each group of sample data comprises N elements,
Figure FDA0002483466070000101
XPOPF,ifis a matrix XPOPFRow i and column f ofPOPF,ifIs a matrix ZPOPFRow i and column f ofPOPF=LNPOPFL is a correlation coefficient matrix C for random variables in a standard Gaussian distribution domainZThe resulting lower triangular matrix, i.e. C, using Cholesky decompositionZ=L·LTT is a transposed symbol, i ∈ (1-N), f ∈ (1-N)POPF) N is the total number of random variables of the power system, NPOPFThe number of generated sample sets.
CN202010384994.7A 2020-05-09 2020-05-09 Nataf transformation-based random variable sample generation method and system Pending CN111753252A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010384994.7A CN111753252A (en) 2020-05-09 2020-05-09 Nataf transformation-based random variable sample generation method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010384994.7A CN111753252A (en) 2020-05-09 2020-05-09 Nataf transformation-based random variable sample generation method and system

Publications (1)

Publication Number Publication Date
CN111753252A true CN111753252A (en) 2020-10-09

Family

ID=72673378

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010384994.7A Pending CN111753252A (en) 2020-05-09 2020-05-09 Nataf transformation-based random variable sample generation method and system

Country Status (1)

Country Link
CN (1) CN111753252A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115455668A (en) * 2022-08-26 2022-12-09 哈尔滨工业大学(深圳) Simulation data generation method and device for table data and electronic equipment

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115455668A (en) * 2022-08-26 2022-12-09 哈尔滨工业大学(深圳) Simulation data generation method and device for table data and electronic equipment
CN115455668B (en) * 2022-08-26 2023-05-30 哈尔滨工业大学(深圳) Simulation data generation method and device for table data and electronic equipment

Similar Documents

Publication Publication Date Title
Snijders Enumeration and simulation methods for 0–1 matrices with given marginals
Sallam et al. An improved gaining-sharing knowledge algorithm for parameter extraction of photovoltaic models
WO2019091356A1 (en) Method and system for probabilistic load flow computation
CN111079351B (en) Power distribution network probability power flow acquisition method and device considering wind power uncertainty
CN111178616B (en) Wind speed prediction method based on negative correlation learning and regularization extreme learning machine integration
CN111797132B (en) Multi-renewable energy power station power scene generation method considering space-time correlation
CN113688869B (en) Photovoltaic data missing reconstruction method based on generation countermeasure network
CN111682530B (en) Method, device, equipment and medium for determining out-of-limit probability of voltage of power distribution network
CN110456188A (en) The stability of power system detection system and method for sparse polynomial chaos expansion
CN109066651B (en) Method for calculating limit transmission power of wind power-load scene
CN114548532A (en) VMD-based TGCN-GRU ultra-short-term load prediction method and device and electronic equipment
CN111859283B (en) Scene generation method considering multi-energy charge time sequence and correlation
CN114283320A (en) Target detection method based on full convolution and without branch structure
Sun et al. A powerful Bio-Inspired optimization algorithm based PV cells diode models parameter estimation
CN111192158A (en) Transformer substation daily load curve similarity matching method based on deep learning
CN111753252A (en) Nataf transformation-based random variable sample generation method and system
CN113435595A (en) Two-stage optimization method for extreme learning machine network parameters based on natural evolution strategy
CN110707703B (en) Improved Nataf transformation-based efficient probabilistic power flow calculation method containing high-dimensional related uncertain sources
CN113627655B (en) Method and device for simulating and predicting pre-disaster fault scene of power distribution network
CN109494747B (en) Power grid probability load flow calculation method based on alternating gradient algorithm
CN111797564A (en) Method and system for obtaining correlation sample of high-dimensional distributed photovoltaic output
Chen et al. Variation-cognizant probabilistic power flow analysis via multi-task learning
Bae et al. Optimization of silicon solar cell fabrication based on neural network and genetic programming modeling
CN111179110A (en) Virtual power plant variable order aggregation equivalent robust dynamic model modeling method and device
CN112952843B (en) Distributed photovoltaic probability power flow calculation method for power distribution network based on Copula theory

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination