CN111680820A - Distributed photovoltaic power station fault diagnosis method and device - Google Patents

Distributed photovoltaic power station fault diagnosis method and device Download PDF

Info

Publication number
CN111680820A
CN111680820A CN202010381231.7A CN202010381231A CN111680820A CN 111680820 A CN111680820 A CN 111680820A CN 202010381231 A CN202010381231 A CN 202010381231A CN 111680820 A CN111680820 A CN 111680820A
Authority
CN
China
Prior art keywords
photovoltaic power
power station
theoretical
fault diagnosis
power generation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010381231.7A
Other languages
Chinese (zh)
Other versions
CN111680820B (en
Inventor
赵健
周宁
刘昊
孙芊
王鹏
马建伟
王磊
张建宾
朱红路
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sinokey Technology Co ltd
State Grid Corp of China SGCC
Electric Power Research Institute of State Grid Henan Electric Power Co Ltd
Original Assignee
Beijing Sinokey Technology Co ltd
State Grid Corp of China SGCC
Electric Power Research Institute of State Grid Henan Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sinokey Technology Co ltd, State Grid Corp of China SGCC, Electric Power Research Institute of State Grid Henan Electric Power Co Ltd filed Critical Beijing Sinokey Technology Co ltd
Priority to CN202010381231.7A priority Critical patent/CN111680820B/en
Publication of CN111680820A publication Critical patent/CN111680820A/en
Application granted granted Critical
Publication of CN111680820B publication Critical patent/CN111680820B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/067Enterprise or organisation modelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y04INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
    • Y04SSYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
    • Y04S10/00Systems supporting electrical power generation, transmission or distribution
    • Y04S10/50Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Marketing (AREA)
  • Tourism & Hospitality (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • Health & Medical Sciences (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Game Theory and Decision Science (AREA)
  • Development Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Educational Administration (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • Primary Health Care (AREA)
  • Photovoltaic Devices (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Supply And Distribution Of Alternating Current (AREA)

Abstract

The application relates to a distributed photovoltaic power station fault diagnosis method which comprises the steps of firstly collecting historical operation data of each photovoltaic power station, establishing a time sequence of output data of each photovoltaic power station, and collecting historical fault diagnosis records of each photovoltaic power station; training a BP neural network according to the collected historical operation data of each photovoltaic power station to obtain a prediction model of theoretical power generation, and establishing a distributed photovoltaic fault diagnosis model by using a decision tree method; and then, output data of each photovoltaic power station of the photovoltaic power stations are monitored, the output data are input into a prediction model of theoretical generated energy to obtain theoretical generated energy, correlation coefficients are calculated according to the output data and the theoretical generated energy, and the correlation coefficients are input into a distributed photovoltaic fault diagnosis model to judge the state of the power stations. The fault diagnosis method can be used for judging the state of the power station.

Description

Distributed photovoltaic power station fault diagnosis method and device
Technical Field
The application belongs to the technical field of photovoltaic power generation fault diagnosis, and particularly relates to a distributed photovoltaic power station fault diagnosis method and device based on a BP neural network and a decision tree.
Background
The photovoltaic price is low, is not limited by geographical position, can satisfy off-grid system energy demand, and market is wide. The renewable energy generation (RDEG) technology market, released by international market research institute Technavio, will grow 295.15GW scale during 2019-. In recent years, the development speed of the Chinese photovoltaic power generation is remarkable, 44.06GW is newly added to the national photovoltaic power generation in 2018, and the national photovoltaic power generation accumulation loading reaches 174.63 GW. With the rapid increase of the installed photovoltaic capacity, the intelligent operation function of the photovoltaic power generation system). The implementation of the above functions depends on the quality and reliability of the data.
The current distributed photovoltaic power station is different from a large grid-connected photovoltaic power station, and data collected during operation of the distributed photovoltaic power station often lack meteorological data of a power station field. The lack of meteorological information has led to the failure of many past photovoltaic power plant analysis methods, and has been constrained in practical engineering applications. Therefore, the photovoltaic power station needs to be analyzed and evaluated from a new perspective.
In addition, the output situation of the actual photovoltaic is complex, the difference between the output data and the theoretical data is large, and in order to analyze the power station data, the indexes capable of effectively describing the operating state of the photovoltaic power station are needed, and a method for diagnosing the weak point on the direct current side of the photovoltaic power station based on the time and space functions is provided.
Disclosure of Invention
The technical problem to be solved by the invention is as follows: in order to solve the defects in the prior art, the method and the device for diagnosing the faults of the distributed photovoltaic power station are provided.
The technical scheme adopted by the invention for solving the technical problems is as follows:
a distributed photovoltaic power station fault diagnosis method comprises the following steps:
s1: collecting historical operating data of each photovoltaic power station, establishing a time sequence of output data of each photovoltaic power station, and collecting historical fault diagnosis records of each photovoltaic power station;
s2: training a BP neural network according to the collected historical operation data of each photovoltaic power station to obtain a prediction model of theoretical power generation, wherein the prediction model of the theoretical power generation can calculate the theoretical power generation of a specific photovoltaic power station through the operation data of other photovoltaic power stations and establish a time sequence of the theoretical power generation of each photovoltaic power station;
s3: intercepting the time sequence of the output data of each photovoltaic power station and the time sequence of the theoretical generated energy according to the same occurrence time in a certain time period, calibrating the fault type of each intercepted section at the corresponding occurrence time according to historical fault diagnosis records, and respectively calculating the correlation coefficient of the intercepted section of the time sequence of the output data and the intercepted section of the time sequence of the theoretical generated energy
S4: taking the correlation coefficient as an input vector, marking the fault type as an output vector, taking the input vector and the output vector as training data, and establishing a distributed photovoltaic fault diagnosis model by using a decision tree method;
s5: the method comprises the steps of monitoring output data of each photovoltaic power station of the photovoltaic power stations, inputting the output data into a theoretical power generation prediction model to obtain theoretical power generation, calculating correlation coefficients of the output data and the theoretical power generation, and inputting the correlation coefficients into a distributed photovoltaic fault diagnosis model to judge the state of the power stations.
Preferably, in the distributed photovoltaic power station fault diagnosis method of the present invention, in the step S1, similarity is further performed on the time series of the output data of each photovoltaic power station through the pearson correlation coefficient, so as to obtain a plurality of photovoltaic power stations with higher similarity to the photovoltaic power station;
the theoretical power generation of a particular photovoltaic plant is calculated in step S2 from the selected operating data of a similar higher photovoltaic plant.
Preferably, according to the fault diagnosis method for the distributed photovoltaic power station, the correlation coefficients are relative Euclidean distance and Pearson correlation coefficient.
Preferably, in the distributed photovoltaic power station fault diagnosis method, the calculation formula of the relative Euclidean distance is
Figure BDA0002482057770000031
In the formula: (X)Tar,XRef) Is a relative Euclidean distance; wherein XTarIs an intercepted segment of the time series of the output data, XRefIs an intercepted segment of a time sequence of theoretical generated energy;
the calculation formula of the Pearson correlation coefficient is
Figure BDA0002482057770000032
Wherein XTarIs an intercepted segment of the time series of the output data, XRefIs an intercepted segment of a time sequence of theoretical generated energy; r is the Pearson correlation coefficient;
Figure BDA0002482057770000033
the average value of the intercepted segment of the time sequence of the output data and the intercepted segment of the time sequence of the theoretical generating capacity.
Preferably, in the distributed photovoltaic power plant fault diagnosis method, the time period intercepted in the step S3 is 2-5 h.
6. A distributed photovoltaic power station fault diagnosis device comprises:
a data acquisition module: collecting historical operating data of each photovoltaic power station, establishing a time sequence of output data of each photovoltaic power station, and collecting historical fault diagnosis records of each photovoltaic power station;
a prediction model of theoretical power generation: training a BP neural network according to the collected historical operating data of each photovoltaic power station to obtain a prediction model of theoretical power generation, wherein the prediction model of the theoretical power generation can calculate the theoretical power generation of a specific photovoltaic power station through the operating data of other photovoltaic power stations;
theoretical generated energy calculation module: the system comprises a power generation system, a power generation system and a power generation system, wherein the power generation system is used for collecting operation data of each photovoltaic power station, calculating theoretical power generation of each photovoltaic power station according to the collected operation data of each photovoltaic power station, and forming a theoretical power generation time sequence according to the calculated theoretical power generation;
a correlation calculation module: the system comprises a time sequence acquisition module, a fault diagnosis module, a power generation module and a power generation module, wherein the time sequence acquisition module is used for acquiring the time sequence of the output data of each photovoltaic power station and the time sequence of the theoretical power generation according to the same occurrence time in a certain time period, calibrating the fault type of each acquisition section corresponding to the occurrence time according to historical fault diagnosis records, and respectively calculating the correlation coefficient of the acquisition section of the time sequence of the output data and the acquisition section of the time sequence of the;
a fault diagnosis model: taking the correlation coefficient obtained by the correlation calculation module as an input vector, marking the fault type as an output vector, taking the input vector and the output vector as training data, and training and establishing by using a decision tree method;
a fault diagnosis module: and calculating a correlation coefficient according to the output data correlation calculation module of each photovoltaic power station of the monitored photovoltaic power stations, and inputting the correlation coefficient serving as an input vector into the distributed photovoltaic fault diagnosis model to judge the state of the power station.
Preferably, in the distributed photovoltaic power station fault diagnosis method of the invention, the data acquisition module further performs similarity on the time sequence of the output data of each photovoltaic power station through a pearson correlation coefficient to obtain a plurality of photovoltaic power stations with higher similarity to the photovoltaic power station;
and calculating the theoretical power generation of the specific photovoltaic power station by selecting similar higher operation data of the photovoltaic power station in the theoretical power generation prediction model.
Preferably, according to the fault diagnosis method for the distributed photovoltaic power station, the correlation coefficients are relative Euclidean distance and Pearson correlation coefficient.
Preferably, in the distributed photovoltaic power station fault diagnosis method, the calculation formula of the relative Euclidean distance is
Figure BDA0002482057770000051
In the formula: (X)Tar,XRef) Is a relative Euclidean distance; wherein XTarIs an intercepted segment of the time series of the output data, XRefIs an intercepted segment of a time sequence of theoretical generated energy;
the calculation formula of the Pearson correlation coefficient is
Figure BDA0002482057770000052
Wherein XTarIs an intercepted segment of the time series of the output data, XRefIs an intercepted segment of a time sequence of theoretical generated energy; r is the Pearson correlation coefficient;
Figure BDA0002482057770000053
the average value of the intercepted segment of the time sequence of the output data and the intercepted segment of the time sequence of the theoretical generating capacity.
Preferably, in the distributed photovoltaic power station fault diagnosis method, the time period intercepted by the correlation calculation module is 2-5 h.
The invention has the beneficial effects that:
drawings
The technical solution of the present application is further explained below with reference to the drawings and the embodiments.
FIG. 1 is a flow chart of a method of fault diagnosis for a photovoltaic power plant;
FIG. 2 is a schematic diagram of a BP neural network;
FIG. 3 is a graph of similarity of time series under different faults;
FIG. 4 is a distance graph of time series under different faults;
FIG. 5 is a schematic diagram of decision tree operation.
Detailed Description
It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict.
The technical solutions of the present application will be described in detail below with reference to the accompanying drawings in conjunction with embodiments.
Example 1
The embodiment provides a photovoltaic fault diagnosis method based on a BP neural network, as shown in fig. 1, the specific steps are as follows:
s1: collecting historical operating data of each photovoltaic power station, establishing a time sequence of output data of each photovoltaic power station, and collecting historical fault diagnosis records of each photovoltaic power station;
the historical operating data of the photovoltaic power station can also be subjected to data preprocessing, and a large blank part or an obvious error part is removed;
taking the data of 10 photovoltaic power stations 2018 in the whole year in the Xin county of China as an example. The ten photovoltaic power stations are composed of photovoltaic power generation systems and photovoltaic power station monitoring systems with the capacities of 40MW respectively, the data sampling interval is 10 minutes, and the output data of the 10 power stations form a time sequence of the output data according to time.
S2: training a BP neural network according to the collected historical operation data of each photovoltaic power station to obtain a prediction model of theoretical power generation, wherein the prediction model of the theoretical power generation can calculate the theoretical power generation of a specific photovoltaic power station through the operation data of other photovoltaic power stations and establish a time sequence of the theoretical power generation of each photovoltaic power station;
the function of the theoretical power generation prediction model is to calculate the theoretical power generation of each photovoltaic power station, and when the theoretical power generation of a specific power station (such as the power station 1) is calculated, the theoretical power generation is calculated by using the output data of other photovoltaic power stations (power stations 2-9) which do not comprise the specific power station.
The BP neural network is a multi-layer forward feedback neural network, and is mainly characterized by forward propagation of input signals and backward propagation of errors. In forward transmission, the input signal is processed layer by layer from the input layer through the hidden layer to the output layer. The neuron state of each layer only affects the state of the neurons of the next layer. If the output layer does not get the desired output, it performs back propagation and adjusts the network weights and thresholds based on the prediction error so that the decision of the BP neural network is constantly approaching the desired output.
X1,X2,X3,…,XnForming an input series X, Y for the input variables of the BP neural network (here the output data of other photovoltaic plants not including this particular plant)1,Y2,…,YmForming an output series Y, omega for the output variable of the BP neural network (i.e. the theoretical power generation of a particular photovoltaic plant)ijAnd ωjkIs the weight value of the BP neural network. Considering the BP neural network as a non-linear function, if the input variable is a and the output variable is b, the BP neural network reflects a non-linear mapping relationship from a independent variables to b dependent variables.
The network must be trained prior to the neural network prediction that predicts BP. The trained neural network has the capability of prediction and judgment. Generally, the training process of the BP neural network is as follows.
Step 1, initializing the network. Determining the number n of network input variables, the number l of hidden layer nodes and the number m of output variables according to the input and output sequence (X, Y) of the system, and initializing the network weight omega among the neurons of the input layer, the hidden layer and the output layerijjkInitializing hidden layer threshold a, and outputtingLayer threshold b, the stimulus function of the given learning step size and the hidden layer.
And 2, outputting and calculating the hidden layer. According to the input sequence X, the connection weight omega between the input layer and the hidden layerijAnd a hidden layer threshold a, calculating a hidden layer output G.
Figure BDA0002482057770000081
In the formula, l is the number of hidden layer nodes; f is a hidden layer excitation function, the function has various expression forms, and the function selected in this chapter is:
Figure BDA0002482057770000082
and 3, outputting layer output calculation. And calculating the prediction output P of the BP neural network according to the hidden layer output H and the connection weight omega jk and the threshold b.
Figure BDA0002482057770000083
And 4, calculating errors. And calculating a network prediction error e according to the network prediction output P and the expected output sequence Y.
ek=Yk-Pk(4)
And 5, updating the weight value. Updating the network weight omega of the network according to the network prediction error e calculated in the previous stepij,ωjk
Figure BDA0002482057770000084
ωij=ωjk+ηekGj(6)
In the formula, η is a learning step length.
And 6, updating the threshold. New network node thresholds a, b are generated based on the calculated network prediction error e.
Figure BDA0002482057770000091
bk=bk+ek(8)
And 7, judging whether the algorithm iteration is finished or not, and returning to the step 2 if the algorithm iteration is not finished.
Preferably, the output data of the partial power stations with higher correlation can be selected for training, so that the calculated amount is reduced, and the training process is accelerated
Selecting the power station 1 as a target power station, randomly selecting n (n is 2,3, … …,9) power stations from the rest power stations as reference power stations to perform fitting, and calculating fitting errors. And respectively selecting power stations 2-9 to repeat the process. And finally, averaging the errors with the number of the fitting power stations being n each time. Generally, 3 power stations are selected or the number of input sources is determined according to the calculation result.
For example, according to the historical operating data of 10 photovoltaic power stations in 2018 from Ci county, China, the Pearson correlation coefficient is shown in the following table 1. Therefore, the power stations 1, 4, 3, and 7 can be selected as the power stations for calculation, and when the theoretical power generation amount of the photovoltaic power station of the power station 1 is calculated, the theoretical power generation amount of the power stations 4, 3, and 7 can be selected for calculation.
TABLE 1 plant similarity analysis
Figure BDA0002482057770000092
Figure BDA0002482057770000101
S3: intercepting the time sequence of the output data of each photovoltaic power station and the time sequence of the theoretical generated energy according to the same occurrence time in a certain time period (the intercepted time period is 2-5h, for example, 3h), calibrating the fault type of each intercepted section corresponding to the occurrence time according to historical fault diagnosis records, and respectively calculating the correlation coefficient of the intercepted section of the time sequence of the output data and the intercepted section of the time sequence of the theoretical generated energy;
the correlation coefficients are relative Euclidean distance and Pearson correlation coefficient.
The formula for calculating the relative Euclidean distance is
Figure BDA0002482057770000102
In the formula: (X)Tar,XRef) Is a relative Euclidean distance; wherein XTarIs an intercepted segment of the time series of the output data, XRefIs an intercepted segment of a time sequence of theoretical generated energy;
the calculation formula of the Pearson correlation coefficient is
Figure BDA0002482057770000103
Wherein XTarIs an intercepted segment of the time series of the output data, XRefIs an intercepted segment of a time sequence of theoretical generated energy; r is the Pearson correlation coefficient;
Figure BDA0002482057770000104
the average value of the intercepted segment of the time sequence of the output data and the intercepted segment of the time sequence of the theoretical generating capacity.
S4: taking the correlation coefficient as an input vector, marking the fault type as an output vector, taking the input vector and the output vector as training data, and establishing a distributed photovoltaic fault diagnosis model by using a decision tree method;
table 2 output vector of fault type
Type of failure Output vector
Is normal η1=[1 0 0 0]
Abnormal aging η2=[0 1 0 0]
Shadow masking η3=[0 0 1 0]
Open circuit fault η4=[0 0 0 1]
The decision tree is a tree structure with sample attributes as leaf nodes and values of the attributes as branches. The basic principle of decision tree building is to recursively split the training data set into subsets such that each contains states where the target variables are similar, these targets being predictable attributes. And in the splitting process, splitting attribute selection is carried out by using the principle of an information theory. Let the set of fault features of a certain equipment be d ═ d1, d2, …, dn }, the set of fault points e ═ e1, e2, …, em }, d is the set of test attributes (relative euclidean distance and pearson correlation coefficient in this embodiment), and e is the set of class labels (output vector representing fault type in this embodiment). The formed root node of the fault characteristic decision tree is a training set, one internal node represents a test of a fault characteristic, one edge represents a test result, and the leaf represents a certain fault point or a certain fault processing mode. The attribute values for any internal node are discrete, and for each fault the fault signature either exists a 1 (indicating that the fault signature exists) or does not exist a 0 (indicating that the fault signature does not exist).
Taking historical fault diagnosis records as training samples, generating 10 different types of faults at different fault moments and different fault positions by using the established model, extracting corresponding fault characteristics to form corresponding training samples, wherein the number of effective training samples of the 10 different types of faults is 100, and the total number of the samples is 1000: and (3) adopting an ID3 algorithm to take the observable fault characteristics as the test splitting attributes, taking fault points as class labels, correspondingly dividing records, and adopting a pre-pruning mode to control the growth of the tree to form a decision.
1) The tree growth may be stopped when the number of instances to reach this node is less than a certain threshold.
2) Introducing a substitution error rate. When a set is continuously divided on a certain branch of the sub-tree in the calculation process, although all samples do not belong to the same class, if the number of records in different classes is greatly different, an error substitution rate formula is introduced:
Figure BDA0002482057770000121
in the formula: n represents the number of records of the branch; n' represents the number of records in the majority category in the branch; m represents the total number of records in the training set. If the value calculated by the formula is less than a certain threshold value, converting the subtree into a leaf node, otherwise, continuing to call the 1 st step for further decomposition. The algorithm recurses the above operations until the fault-free feature attributes are available to partition the current sample subset or satisfy the condition that the tree stops growing.
The data used in this example is derived from a device fault record, and is used to establish a decision tree to find the association between the fault signature and the fault point. Step 1: and extracting 320 pieces of data, wherein 70% of the data are taken as training tuples, and 30% of the data are taken as test data. And (5) counting a test attribute fault feature set and a fault point set. Step 2: data were preprocessed as above to count statistics of failure points in these 224 records. And 3, step 3: and selecting the attribute which can divide the training set into the most instances by calculating the gain of the fault characteristic information one by one. As can be seen from Table 3, the accuracy of the method for diagnosing various faults is over 97%, so that the fault diagnosis method has high accuracy in actual fault diagnosis of the photovoltaic power station and has practical application value.
TABLE 3 Fault diagnosis accuracy statistics
Figure BDA0002482057770000131
S5: the method comprises the steps of monitoring output data of each photovoltaic power station of the photovoltaic power stations, inputting the output data into a theoretical power generation prediction model to obtain theoretical power generation, calculating correlation coefficients of the output data and the theoretical power generation, and inputting the correlation coefficients into a distributed photovoltaic fault diagnosis model to judge the state of the power stations.
And (4) theoretical power generation calculation is carried out by inputting the monitored output data of each photovoltaic power station into a theoretical power generation prediction model, and if the theoretical power generation prediction model is trained by using a part of photovoltaic power stations with high similarity, the corresponding photovoltaic power stations are also used for calculation at the moment.
During judgment, the time sequence of the output data of the photovoltaic power station and the time sequence of the calculated theoretical power generation amount are also required to be intercepted according to the same time period length of the step S3, and the time sequence is 2-5 h.
Example 2
This implementation provides a distributed photovoltaic power plant fault diagnosis device, corresponding to the method of embodiment 1, including:
a data acquisition module: collecting historical operating data of each photovoltaic power station, establishing a time sequence of output data of each photovoltaic power station, and collecting historical fault diagnosis records of each photovoltaic power station;
a prediction model of theoretical power generation: training a BP neural network according to the collected historical operating data of each photovoltaic power station to obtain a prediction model of theoretical power generation, wherein the prediction model of the theoretical power generation can calculate the theoretical power generation of a specific photovoltaic power station through the operating data of other photovoltaic power stations;
theoretical generated energy calculation module: the system comprises a power generation system, a power generation system and a power generation system, wherein the power generation system is used for collecting operation data of each photovoltaic power station, calculating theoretical power generation of each photovoltaic power station according to the collected operation data of each photovoltaic power station, and forming a theoretical power generation time sequence according to the calculated theoretical power generation;
a correlation calculation module: the system comprises a time sequence acquisition module, a fault diagnosis module, a power generation module and a power generation module, wherein the time sequence acquisition module is used for acquiring the time sequence of the output data of each photovoltaic power station and the time sequence of the theoretical power generation according to the same occurrence time in a certain time period, calibrating the fault type of each acquisition section corresponding to the occurrence time according to historical fault diagnosis records, and respectively calculating the correlation coefficient of the acquisition section of the time sequence of the output data and the acquisition section of the time sequence of the;
a fault diagnosis model: taking the correlation coefficient obtained by the correlation calculation module as an input vector, marking the fault type as an output vector, taking the input vector and the output vector as training data, and training and establishing by using a decision tree method;
a fault diagnosis module: and calculating a correlation coefficient according to the output data correlation calculation module of each photovoltaic power station of the monitored photovoltaic power stations, and inputting the correlation coefficient serving as an input vector into the distributed photovoltaic fault diagnosis model to judge the state of the power station.
The data acquisition module is also used for carrying out similarity on the time sequence of the output data of each photovoltaic power station through Pearson correlation coefficients to obtain a plurality of photovoltaic power stations with higher similarity with the photovoltaic power station;
and calculating the theoretical power generation of the specific photovoltaic power station by selecting similar higher operation data of the photovoltaic power station in the theoretical power generation prediction model.
The correlation coefficients are relative Euclidean distance and Pearson correlation coefficient.
The formula for calculating the relative Euclidean distance is
Figure BDA0002482057770000151
In the formula: (X)Tar,XRef) Is a relative Euclidean distance; wherein XTarIs an intercepted segment of the time series of the output data, XRefIs an intercepted segment of a time sequence of theoretical generated energy;
the calculation formula of the Pearson correlation coefficient is
Figure BDA0002482057770000152
Wherein XTarIs an intercepted segment of the time series of the output data, XRefIs an intercepted segment of a time sequence of theoretical generated energy; r is the Pearson correlation coefficient;
Figure BDA0002482057770000153
the average value of the intercepted segment of the time sequence of the output data and the intercepted segment of the time sequence of the theoretical generating capacity.
The time period intercepted in the correlation calculation module is 2-5h, and preferably 3 h.
In light of the foregoing description of the preferred embodiments according to the present application, it is to be understood that various changes and modifications may be made without departing from the spirit and scope of the invention. The technical scope of the present application is not limited to the contents of the specification, and must be determined according to the scope of the claims.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

Claims (10)

1. A distributed photovoltaic power station fault diagnosis method is characterized by comprising the following steps:
s1: collecting historical operating data of each photovoltaic power station, establishing a time sequence of output data of each photovoltaic power station, and collecting historical fault diagnosis records of each photovoltaic power station;
s2: training a BP neural network according to the collected historical operation data of each photovoltaic power station to obtain a prediction model of theoretical power generation, wherein the prediction model of the theoretical power generation can calculate the theoretical power generation of a specific photovoltaic power station through the operation data of other photovoltaic power stations and establish a time sequence of the theoretical power generation of each photovoltaic power station;
s3: intercepting the time sequence of the output data of each photovoltaic power station and the time sequence of the theoretical generated energy according to the same occurrence time in a certain time period, calibrating the fault type of each intercepted section at the corresponding occurrence time according to historical fault diagnosis records, and respectively calculating the correlation coefficient of the intercepted section of the time sequence of the output data and the intercepted section of the time sequence of the theoretical generated energy;
s4: taking the correlation coefficient as an input vector, marking the fault type as an output vector, taking the input vector and the output vector as training data, and establishing a distributed photovoltaic fault diagnosis model by using a decision tree method;
s5: the method comprises the steps of monitoring output data of each photovoltaic power station of the photovoltaic power stations, inputting the output data into a theoretical power generation prediction model to obtain theoretical power generation, calculating correlation coefficients of the output data and the theoretical power generation, and inputting the correlation coefficients into a distributed photovoltaic fault diagnosis model to judge the state of the power stations.
2. The distributed photovoltaic power station fault diagnosis method according to claim 1, wherein in the step S1, similarity is further performed on the time series of the output data of each photovoltaic power station through pearson correlation coefficients, so as to obtain a plurality of photovoltaic power stations with higher similarity to the photovoltaic power station;
the theoretical power generation of a particular photovoltaic plant is calculated in step S2 from the selected operating data of a similar higher photovoltaic plant.
3. The distributed photovoltaic power plant fault diagnosis method as claimed in claim 1 or 2, characterized in that the correlation coefficients are relative euclidean distance and pearson correlation coefficients.
4. The distributed photovoltaic power plant fault diagnosis method of claim 3 wherein the calculation formula of the relative Euclidean distance is
Figure FDA0002482057760000021
In the formula: (X)Tar,XRef) Is a relative Euclidean distance; wherein XTarIs an intercepted segment of the time series of the output data, XRefIs an intercepted segment of a time sequence of theoretical generated energy;
the calculation formula of the Pearson correlation coefficient is
Figure FDA0002482057760000022
Wherein XTarIs the number of forces exertedAccording to a time series of truncated segments, XRefIs an intercepted segment of a time sequence of theoretical generated energy; r is the Pearson correlation coefficient;
Figure FDA0002482057760000023
the average value of the intercepted segment of the time sequence of the output data and the intercepted segment of the time sequence of the theoretical generating capacity.
5. The distributed photovoltaic power plant fault diagnosis method as claimed in any one of claims 1 to 4, wherein the time period intercepted in step S3 is 2 to 5 hours.
6. A distributed photovoltaic power station fault diagnosis device is characterized by comprising:
a data acquisition module: collecting historical operating data of each photovoltaic power station, establishing a time sequence of output data of each photovoltaic power station, and collecting historical fault diagnosis records of each photovoltaic power station;
a prediction model of theoretical power generation: training a BP neural network according to the collected historical operating data of each photovoltaic power station to obtain a prediction model of theoretical power generation, wherein the prediction model of the theoretical power generation can calculate the theoretical power generation of a specific photovoltaic power station through the operating data of other photovoltaic power stations;
theoretical generated energy calculation module: the system comprises a power generation system, a power generation system and a power generation system, wherein the power generation system is used for collecting operation data of each photovoltaic power station, calculating theoretical power generation of each photovoltaic power station according to the collected operation data of each photovoltaic power station, and forming a theoretical power generation time sequence according to the calculated theoretical power generation;
a correlation calculation module: the system comprises a time sequence acquisition module, a fault diagnosis module, a power generation module and a power generation module, wherein the time sequence acquisition module is used for acquiring the time sequence of the output data of each photovoltaic power station and the time sequence of the theoretical power generation according to the same occurrence time in a certain time period, calibrating the fault type of each acquisition section corresponding to the occurrence time according to historical fault diagnosis records, and respectively calculating the correlation coefficient of the acquisition section of the time sequence of the output data and the acquisition section of the time sequence of the;
a fault diagnosis model: taking the correlation coefficient obtained by the correlation calculation module as an input vector, marking the fault type as an output vector, taking the input vector and the output vector as training data, and training and establishing by using a decision tree method;
a fault diagnosis module: and calculating a correlation coefficient according to the output data correlation calculation module of each photovoltaic power station of the monitored photovoltaic power stations, and inputting the correlation coefficient serving as an input vector into the distributed photovoltaic fault diagnosis model to judge the state of the power station.
7. The distributed photovoltaic power station fault diagnosis method of claim 6, wherein the data acquisition module further performs similarity on the time series of the output data of each photovoltaic power station through Pearson correlation coefficients to obtain a plurality of photovoltaic power stations with higher similarity to the photovoltaic power station;
and calculating the theoretical power generation of the specific photovoltaic power station by selecting similar higher operation data of the photovoltaic power station in the theoretical power generation prediction model.
8. The distributed photovoltaic power plant fault diagnosis method as claimed in claim 6 or 7, characterized in that the correlation coefficients are relative Euclidean distance and Pearson correlation coefficients.
9. The distributed photovoltaic power plant fault diagnosis method of claim 8 wherein the calculation formula of the relative Euclidean distance is
Figure FDA0002482057760000041
In the formula: (X)Tar,XRef) Is a relative Euclidean distance; wherein XTarIs an intercepted segment of the time series of the output data, XRefIs an intercepted segment of a time sequence of theoretical generated energy;
the calculation formula of the Pearson correlation coefficient is
Figure FDA0002482057760000042
Wherein XTarIs an intercepted segment of the time series of the output data, XRefIs an intercepted segment of a time sequence of theoretical generated energy; r is the Pearson correlation coefficient;
Figure FDA0002482057760000043
the average value of the intercepted segment of the time sequence of the output data and the intercepted segment of the time sequence of the theoretical generating capacity.
10. The distributed photovoltaic power plant fault diagnosis method as claimed in any of the claims 6-9, characterized in that the time period intercepted in the correlation calculation module is 2-5 h.
CN202010381231.7A 2020-05-08 2020-05-08 Distributed photovoltaic power station fault diagnosis method and device Active CN111680820B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010381231.7A CN111680820B (en) 2020-05-08 2020-05-08 Distributed photovoltaic power station fault diagnosis method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010381231.7A CN111680820B (en) 2020-05-08 2020-05-08 Distributed photovoltaic power station fault diagnosis method and device

Publications (2)

Publication Number Publication Date
CN111680820A true CN111680820A (en) 2020-09-18
CN111680820B CN111680820B (en) 2022-08-19

Family

ID=72433396

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010381231.7A Active CN111680820B (en) 2020-05-08 2020-05-08 Distributed photovoltaic power station fault diagnosis method and device

Country Status (1)

Country Link
CN (1) CN111680820B (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112200464A (en) * 2020-10-14 2021-01-08 国网山东省电力公司聊城供电公司 Photovoltaic power station output data correction method and system considering spatial correlation
CN112731022A (en) * 2020-12-18 2021-04-30 合肥阳光智维科技有限公司 Photovoltaic inverter fault detection method, device and medium
CN113992153A (en) * 2021-11-19 2022-01-28 珠海康晋电气股份有限公司 Visual real-time monitoring distributed management system of photovoltaic power plant
CN114070198A (en) * 2021-12-06 2022-02-18 北京中电普华信息技术有限公司 Fault diagnosis method and device for distributed photovoltaic power generation system and electronic equipment
CN114757097A (en) * 2022-04-07 2022-07-15 国网河北省电力有限公司邯郸供电分公司 Line fault diagnosis method and device
CN114841081A (en) * 2022-06-21 2022-08-02 国网河南省电力公司郑州供电公司 Method and system for controlling abnormal accidents of power equipment
CN114899949A (en) * 2022-06-01 2022-08-12 深圳博浩远科技有限公司 Data acquisition method and device suitable for commercial photovoltaic inverter
CN115587642A (en) * 2022-06-22 2023-01-10 大唐海南能源开发有限公司 Photovoltaic system fault warning method based on BP neural network
CN116628608A (en) * 2023-04-23 2023-08-22 华能国际电力江苏能源开发有限公司 Photovoltaic power generation fault diagnosis method and system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8682585B1 (en) * 2011-07-25 2014-03-25 Clean Power Research, L.L.C. Computer-implemented system and method for inferring operational specifications of a photovoltaic power generation system
CN105846780A (en) * 2016-03-19 2016-08-10 上海大学 Decision tree model-based photovoltaic assembly fault diagnosis method
CN106961249A (en) * 2017-03-17 2017-07-18 广西大学 A kind of diagnosing failure of photovoltaic array and method for early warning
CN107516145A (en) * 2017-07-27 2017-12-26 浙江工业大学 A kind of multichannel photovoltaic power generation output forecasting method based on weighted euclidean distance pattern classification
CN109842373A (en) * 2019-04-15 2019-06-04 国网河南省电力公司电力科学研究院 Diagnosing failure of photovoltaic array method and device based on spatial and temporal distributions characteristic
CN110336534A (en) * 2019-07-15 2019-10-15 龙源(北京)太阳能技术有限公司 A kind of method for diagnosing faults based on photovoltaic array electric parameter time series feature extraction

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8682585B1 (en) * 2011-07-25 2014-03-25 Clean Power Research, L.L.C. Computer-implemented system and method for inferring operational specifications of a photovoltaic power generation system
CN105846780A (en) * 2016-03-19 2016-08-10 上海大学 Decision tree model-based photovoltaic assembly fault diagnosis method
CN106961249A (en) * 2017-03-17 2017-07-18 广西大学 A kind of diagnosing failure of photovoltaic array and method for early warning
CN107516145A (en) * 2017-07-27 2017-12-26 浙江工业大学 A kind of multichannel photovoltaic power generation output forecasting method based on weighted euclidean distance pattern classification
CN109842373A (en) * 2019-04-15 2019-06-04 国网河南省电力公司电力科学研究院 Diagnosing failure of photovoltaic array method and device based on spatial and temporal distributions characteristic
CN110336534A (en) * 2019-07-15 2019-10-15 龙源(北京)太阳能技术有限公司 A kind of method for diagnosing faults based on photovoltaic array electric parameter time series feature extraction

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112200464A (en) * 2020-10-14 2021-01-08 国网山东省电力公司聊城供电公司 Photovoltaic power station output data correction method and system considering spatial correlation
CN112200464B (en) * 2020-10-14 2023-04-28 国网山东省电力公司聊城供电公司 Correction method and system for photovoltaic power station output data considering spatial correlation
CN112731022A (en) * 2020-12-18 2021-04-30 合肥阳光智维科技有限公司 Photovoltaic inverter fault detection method, device and medium
CN113992153B (en) * 2021-11-19 2023-03-14 珠海康晋电气股份有限公司 Visual real-time monitoring distributed management system of photovoltaic power station
CN113992153A (en) * 2021-11-19 2022-01-28 珠海康晋电气股份有限公司 Visual real-time monitoring distributed management system of photovoltaic power plant
CN114070198A (en) * 2021-12-06 2022-02-18 北京中电普华信息技术有限公司 Fault diagnosis method and device for distributed photovoltaic power generation system and electronic equipment
CN114070198B (en) * 2021-12-06 2023-11-07 北京中电普华信息技术有限公司 Fault diagnosis method and device for distributed photovoltaic power generation system and electronic equipment
CN114757097A (en) * 2022-04-07 2022-07-15 国网河北省电力有限公司邯郸供电分公司 Line fault diagnosis method and device
CN114757097B (en) * 2022-04-07 2023-09-26 国网河北省电力有限公司邯郸供电分公司 Line fault diagnosis method and device
CN114899949A (en) * 2022-06-01 2022-08-12 深圳博浩远科技有限公司 Data acquisition method and device suitable for commercial photovoltaic inverter
CN114899949B (en) * 2022-06-01 2022-12-23 深圳博浩远科技有限公司 Data acquisition method and device suitable for commercial photovoltaic inverter
CN114841081A (en) * 2022-06-21 2022-08-02 国网河南省电力公司郑州供电公司 Method and system for controlling abnormal accidents of power equipment
CN115587642A (en) * 2022-06-22 2023-01-10 大唐海南能源开发有限公司 Photovoltaic system fault warning method based on BP neural network
CN115587642B (en) * 2022-06-22 2023-07-11 大唐海南能源开发有限公司 BP neural network-based photovoltaic system fault alarm method
CN116628608A (en) * 2023-04-23 2023-08-22 华能国际电力江苏能源开发有限公司 Photovoltaic power generation fault diagnosis method and system

Also Published As

Publication number Publication date
CN111680820B (en) 2022-08-19

Similar Documents

Publication Publication Date Title
CN111680820B (en) Distributed photovoltaic power station fault diagnosis method and device
CN109842373B (en) Photovoltaic array fault diagnosis method and device based on space-time distribution characteristics
CN105335752A (en) Principal component analysis multivariable decision-making tree-based connection manner identification method
CN112818604A (en) Wind turbine generator risk degree assessment method based on wind power prediction
CN110570122A (en) Offshore wind power plant reliability assessment method considering wind speed seasonal characteristics and current collection system element faults
CN111738520A (en) System load prediction method fusing isolated forest and long-short term memory network
CN110070228B (en) BP neural network wind speed prediction method for neuron branch evolution
CN106649479A (en) Probability graph-based transformer state association rule mining method
CN114006369B (en) Regional wind and light station power joint prediction method and device, electronic equipment and storage medium
CN113822418A (en) Wind power plant power prediction method, system, device and storage medium
CN114021483A (en) Ultra-short-term wind power prediction method based on time domain characteristics and XGboost
CN105471647A (en) Power communication network fault positioning method
CN115828466A (en) Fan main shaft component fault prediction method based on wide kernel convolution
CN110942098A (en) Power supply service quality analysis method based on Bayesian pruning decision tree
CN115034485A (en) Wind power interval prediction method and device based on data space
CN115021679A (en) Photovoltaic equipment fault detection method based on multi-dimensional outlier detection
CN112651576A (en) Long-term wind power prediction method and device
CN114595762A (en) Photovoltaic power station abnormal data sequence extraction method
CN113689072A (en) Offshore wind turbine generator running state evaluation method based on Markov chain
CN111428821A (en) Asset classification method based on decision tree
CN111061708A (en) Electric energy prediction and restoration method based on LSTM neural network
CN116317937A (en) Distributed photovoltaic power station operation fault diagnosis method
CN113496255B (en) Power distribution network mixed observation point distribution method based on deep learning and decision tree driving
CN114676931B (en) Electric quantity prediction system based on data center technology
CN110489852A (en) Improve the method and device of the wind power system quality of data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant