WO2020029327A1 - Photovoltaic array fault diagnosis method based on improved random forest algorithm - Google Patents

Photovoltaic array fault diagnosis method based on improved random forest algorithm Download PDF

Info

Publication number
WO2020029327A1
WO2020029327A1 PCT/CN2018/101743 CN2018101743W WO2020029327A1 WO 2020029327 A1 WO2020029327 A1 WO 2020029327A1 CN 2018101743 W CN2018101743 W CN 2018101743W WO 2020029327 A1 WO2020029327 A1 WO 2020029327A1
Authority
WO
WIPO (PCT)
Prior art keywords
photovoltaic array
fault diagnosis
fault
array
photovoltaic
Prior art date
Application number
PCT/CN2018/101743
Other languages
French (fr)
Chinese (zh)
Inventor
陶洪峰
周超超
魏强
刘巍
周龙辉
王鹏
Original Assignee
江南大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 江南大学 filed Critical 江南大学
Priority to RU2019127993A priority Critical patent/RU2734017C1/en
Publication of WO2020029327A1 publication Critical patent/WO2020029327A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H02GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
    • H02SGENERATION OF ELECTRIC POWER BY CONVERSION OF INFRARED RADIATION, VISIBLE LIGHT OR ULTRAVIOLET LIGHT, e.g. USING PHOTOVOLTAIC [PV] MODULES
    • H02S50/00Monitoring or testing of PV systems, e.g. load balancing or fault identification
    • H02S50/10Testing of PV devices, e.g. of PV modules or single PV cells
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02EREDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
    • Y02E10/00Energy generation through renewable energy sources
    • Y02E10/50Photovoltaic [PV] energy

Definitions

  • the invention relates to the field of photovoltaic technology, in particular to a photovoltaic array fault diagnosis method based on an improved random forest algorithm.
  • CSP solar thermal power
  • photovoltaic power is similar to thermal power generation, but the thermal energy mainly comes from large-scale mirror collection. After heating the water, it uses steam to promote the operation of traditional generators to achieve power generation. Although this type of power generation avoids the photoelectric conversion of silicon crystals, Light intensity requirements are high and power generation costs are high.
  • Photovoltaic power generation is based on the photovoltaic effect, which directly converts solar energy into electricity. The construction period is short, and no waste residue, waste water and other pollutants will be generated during operation, especially for mountainous areas, islands and remote areas with less developed transportation. More important value.
  • photovoltaic power generation uses photovoltaic cell modules in series and parallel to form a photovoltaic array to operate, and photovoltaic arrays use more photovoltaic cell modules, the probability of failure of the photovoltaic cell module itself is higher, and the photovoltaic array runs for a long time Outdoors, the operating environment is harsh, and it is prone to aging and damage. As a result, the photovoltaic cell module's power generation efficiency is reduced or even stopped. When a photovoltaic cell module in a photovoltaic array fails, it will lead to a decrease in system efficiency and adversely affect the operation and scheduling of the power system. In severe cases, it may even cause property damage and casualties. Therefore, the photovoltaic array is faulty.
  • Diagnosis is of great significance.
  • the currently used fault diagnosis methods mainly include two types: direct method and indirect method. Indirect method is more typical with infrared heat detection method and power generation comparison method. Direct method is more typical with ground capacitance method and time. Domain reflection method, intelligent diagnosis algorithm, and electrical characteristic detection method. Among them, fault diagnosis by combining intelligent diagnosis algorithm and electrical characteristic detection method is currently a method with great potential.
  • the present inventor proposes a photovoltaic array fault diagnosis method based on an improved random forest algorithm. This method determines the operating condition of a photovoltaic cell module by collecting the voltage and current of the photovoltaic cell modules between the arrays, thereby realizing a fault. Positioning.
  • a photovoltaic array fault diagnosis method based on an improved random forest algorithm includes:
  • the photovoltaic array includes n branches, and each branch includes m photovoltaic cell modules.
  • the typical operating state of the photovoltaic array is used to indicate the status of each photovoltaic cell module in the photovoltaic array.
  • Running status, m and n are positive integers;
  • a data sample set of the photovoltaic array is constructed, and the data sample set is divided into a training sample set and a test sample set;
  • the photovoltaic array fault diagnosis model is trained by using the training sample set, and the photovoltaic array fault diagnosis model is tested by using the test sample set.
  • the photovoltaic array fault diagnosis model includes N decision trees, where N is a positive integer and N ⁇ 2 ;
  • the PV array fault diagnosis model completed by the test is used to diagnose the PV array to be diagnosed, and the voting results of N decision trees for each typical operating state are obtained;
  • the fault diagnosis result of the photovoltaic array to be diagnosed is obtained according to the voting results corresponding to each typical operating state, and the fault diagnosis result is used to indicate the operating state of each photovoltaic cell module in the photovoltaic array.
  • the array trunk parameters and each array branch parameter include the maximum power point voltage, the maximum power point current, the open circuit voltage and the short circuit current of the circuit, respectively.
  • the photovoltaic array fault diagnosis model includes N decision trees and the corresponding weights of each decision tree, and the photovoltaic array fault diagnosis model completed by the test is used to diagnose the PV array to be diagnosed, including:
  • N decision trees vote for each typical running state, and each decision tree casts its own weight as the number of votes;
  • the sub-training set includes For the p samples selected in this round of sampling, the out-of-package sample set includes the samples that were not sampled in this round of sampling, and p is a positive integer;
  • a decision tree is trained according to the sub-training sample set corresponding to each round of sampling, and the decision tree is tested by using the out-of-package sample set corresponding to the round of sampling to obtain the out-of-package accuracy rate.
  • the training N decision trees and the weights corresponding to each decision tree are summarized to obtain a photovoltaic array fault diagnosis model.
  • w (i) is the weight corresponding to the i-th decision tree
  • Hoob (i) is the out-of-package accuracy rate corresponding to the i-th decision tree
  • i and j are parameters, 1 ⁇ i ⁇ N and i is positive Integer.
  • the method further includes:
  • the N out-of-package sample sets are aggregated to obtain a total out-of-package sample set.
  • the samples are tested using the trained photovoltaic array fault diagnosis model to obtain the initial out-of-package accuracy rate;
  • the sample includes a total of K fault features. Noise is added to the k-th fault feature, and the sample is tested again using the photovoltaic array fault diagnosis model to obtain a new out-of-package accuracy rate. The initial out-of-package accuracy rate and the new out-of-package accuracy rate are calculated. The difference between the accuracy rates determines the importance metric of the kth feature, where K is a positive integer, k is a parameter, and the starting value of k is 1;
  • the processed training sample set is used to construct a photovoltaic array fault diagnosis model.
  • the typical operating state corresponding to the highest number of votes includes only one, the typical operating state corresponding to the highest number of votes is output as a fault diagnosis result;
  • the typical running state corresponding to the highest number of votes includes at least two, the L decision trees with the highest weights are selected to vote for each typical running state. If there is a typical running state to obtain the selection of the most decision trees, then The typical running state is output as the fault diagnosis result; otherwise, the typical running state selected by the decision tree with the highest weight is output as the fault diagnosis result, where L is a positive integer and 2 ⁇ L ⁇ N.
  • the present application discloses a photovoltaic array fault diagnosis method based on an improved random forest algorithm, which collects parameters of the main trunk of the photovoltaic array and each branch, and the voltage difference between the arrays between different branches.
  • the parameters of each branch can reflect the operating status of each branch in the photovoltaic array, thereby diagnosing which branch in the photovoltaic array has what kind of failure, and the voltage difference between the arrays between different branches can reflect the branch
  • the operating status of each photovoltaic cell module in the system so as to diagnose which photovoltaic cell module is faulty.
  • the combination of the two types of information effectively achieves the fault location of the photovoltaic array.
  • the random forest algorithm is used, which is suitable for practical applications.
  • the characteristics of photovoltaic arrays overcome the problems of large amount of data and long training time required by traditional neural network algorithms, and can complete diagnostic tasks simply and quickly.
  • the method disclosed in this application is based on a data-driven idea, and combines random forest algorithm, decision tree weighting, voting tie processing, and variable importance measurement to realize fault diagnosis of photovoltaic arrays and effectively improve diagnostic accuracy.
  • the method disclosed in this application uses an improved random forest fault diagnosis model structure. Through the integration of multiple decision trees, a strong classifier is constructed, which uses decision tree weighting, voting tie processing, and fault feature importance measures. A series of steps were improved, and finally an improved random forest algorithm was constructed. The diagnosis results also show that although the improved algorithm has slightly longer training time, it has higher accuracy and more reliable diagnosis.
  • FIG. 1 is a flowchart of a photovoltaic array fault diagnosis method disclosed in the present application.
  • Figure 2 is a schematic diagram of the construction of a 3 * 3 photovoltaic array.
  • Figure 3-a is the U-I output characteristic curve of a single solar cell module under different light intensities.
  • Figure 3-b is the U-P output characteristic curve of a single solar cell module under different light intensities.
  • Figure 3-c is the U-I output characteristic curve of a single solar cell module under different temperature conditions.
  • Figure 3-d is the U-P output characteristic curve of a single solar cell module under different temperature conditions.
  • FIG. 4 is an output characteristic curve of a single solar cell module at a light intensity of 600 W / m2, 800 W / m2, and 1000 W / m2.
  • Figure 5-a is a comparison of the output characteristic curves of a photovoltaic array when there is no fault and when a single-mode fault occurs.
  • Figure 5-b is a comparison diagram of the output characteristic curves of a photovoltaic array when there is no fault and when a multi-mode fault occurs.
  • Figure 6-a is the output characteristic curve of a photovoltaic array when a photovoltaic cell module is shorted in a branch.
  • Figure 6-b is the output characteristic curve of a photovoltaic array when one branch is open.
  • Figure 6-c is the output characteristic curve of a photovoltaic array when two photovoltaic cell modules in a branch have shadows.
  • Figure 6-d shows the output characteristic curve of a photovoltaic array when one photovoltaic cell module is short-circuited in one branch, and one photovoltaic cell module is shadowed in the other branch.
  • This application discloses a photovoltaic array fault diagnosis method based on an improved random forest algorithm. Please refer to FIG. 1.
  • the entire fault diagnosis process includes the following steps:
  • the first step is to build a photovoltaic array in the form of m * n.
  • the photovoltaic array includes n branches in parallel, and each branch includes m photovoltaic cell modules in series, where m and n are positive integers.
  • the photovoltaic cell modules in a photovoltaic array are usually solar cell modules.
  • the specifications of each photovoltaic cell module are the same to ensure the potential balance of the photovoltaic array.
  • Each photovoltaic cell module is connected with a bypass diode in parallel and each branch in series with an isolation diode. Protection, detection devices such as fuses, voltage and current sensors are correctly installed and working properly.
  • FIG. 2 For a schematic diagram of a 3 * 3 photovoltaic array, please refer to FIG. 2. It includes branch A, branch B, and branch C. Each branch includes 3 branches. For photovoltaic cells connected in series, Figure 2 does not show devices such as bypass diodes and isolation diodes.
  • the photovoltaic cell module in the photovoltaic array of this application uses SunTech STP 270-24 / Vd solar cell module of Suntech Solar Co., Ltd.
  • the parameter specification table of this type of solar cell module is shown in the following table:
  • the UI (voltage-current) output characteristic curve when the light intensity rises from 200 W / m 2 to 1000 W / m 2 (200, 400, 600, 800, and 1000 W / m 2 respectively ) is as follows
  • Figure 3-a shows the UP (voltage-power) output characteristic curve shown in Figure 3-b.
  • the UI output characteristic curve when the temperature drops from 35 ° C to 15 ° C (respectively 35 ° C, 30 ° C, 25 ° C, 20 ° C, and 15 ° C) is shown in Fig.
  • the UP (voltage-power) output characteristic curve is shown in Figure 3-d.
  • the product manual gives the output characteristic curve of this type of solar cell module under the light intensity of 600W / m 2 , 800W / m 2 and 1000W / m 2 as shown in FIG. 4.
  • the second step is to determine the typical operating state of the photovoltaic array during operation.
  • the typical operating state of the photovoltaic cell module in this application is used to indicate the operating state of each photovoltaic cell module in the photovoltaic cell module.
  • Figure 5-b shows the faulty branches of photovoltaic arrays when there are no faults and multi-mode faults occur.
  • Fault array, normal branch and normal array output characteristic curve where the normal branch refers to the branch in which each photovoltaic cell module is operating normally, the normal array refers to the photovoltaic array in which each branch is operating normally, the fault A branch is a branch in which a photovoltaic cell module fails, and a fault array refers to a photovoltaic array in which a branch fails.
  • the output characteristic curve has two regions: high voltage region and high current region. In the high voltage region, the condition of the branch can be determined by detecting the output current of each branch. In the high current region, the output voltage of the photovoltaic cell module can be detected. To achieve fault location.
  • the above four types of faults may occur during the operation of photovoltaic arrays.
  • the operating states of photovoltaic arrays mainly include five categories: normal operating states, short circuit fault states, open circuit fault states, shadow fault states, and mixed fault states :
  • the shadow fault state that is, any branch or multiple branches have shadows. Unlike the short-circuit fault state, there can be shadows of all photovoltaic cell modules in a branch, so the number of this state is large. .
  • two-level state classification can be adopted.
  • rough classification is performed to locate the branches in the photovoltaic array. This step does not involve fault location. You can choose some of the most common operations based on actual conditions and experience. status.
  • the major faults of the system have evolved on the basis of minor faults. The probability of direct major faults is very small, and the types and combinations of major faults are too many, so this application mainly Consider light and moderate faults, that is, consider that at least one branch in the photovoltaic array is in a normal operating state. For example, in the photovoltaic array shown in FIG. 2, assuming that the branch C in the photovoltaic array is always in a normal operating state, the following 20 operating states can be selected as the result of the first-level state classification, as shown in the following table:
  • the second-level status classification can be performed to locate the photovoltaic cell module in the branch, thereby achieving fault location.
  • the status label F2 indicates that a photovoltaic cell module in a branch is short-circuited.
  • the 3 * 3 photovoltaic array shown in FIG. 2 assuming that the branch C in the photovoltaic array is in a normal operating state, the The condition indicates that there is a short-circuit of a photovoltaic cell module on branch A or branch B.
  • the results of the second-level state classification based on this are shown in the following table:
  • results of the second-level status classification based on the first-level status classification corresponding to the status label F3 can be shown in the following table:
  • the third step is to collect the circuit parameters of the photovoltaic array.
  • the array trunk parameters of the photovoltaic array are parameters on the main trunk of the photovoltaic array.
  • the array trunk parameters in this application include the maximum power point voltage V m , the maximum power point current I m , and the open circuit voltage V oc of the main trunk. And short-circuit current I sc , which is because these four parameters can well characterize the UI output characteristic curve of the photovoltaic cell module.
  • the maximum power point voltage V m the maximum power point current I m , the open circuit voltage V oc, and the short-circuit current I sc of the branch are collected as the array branch parameters.
  • the circuit parameters of a photovoltaic array can be measured by configuring a voltage sensor and a current sensor in the photovoltaic array. For an m ⁇ n scale photovoltaic array, it is necessary to configure [n ⁇ (m-1) / 2] voltage sensors and n + 1 current sensors, where the symbol [] means rounding up n ⁇ (m-1) / 2.
  • the current sensor is set in each branch and the main trunk to detect the current of the branch and the current of the main trunk.
  • the voltage sensor is set between different branches to detect the voltage difference between the arrays between different branches. As shown in FIG. 2, three voltage sensors U a , U b and U c are provided .
  • the voltage sensor U a is disposed between the branch A and the branch B, and the voltage sensor U b is disposed between the branch B and the branch C.
  • the voltage sensor U c is arranged between the branch C and the branch A.
  • the detection of the branch current and the detection of the voltage difference between the branches are essentially to detect the voltage and current of each photovoltaic cell module.
  • the changes in the parameter values such as voltage and current reflect the operation of the photovoltaic cell module, thereby achieving line faults.
  • the principle of positioning is as follows:
  • the voltage matrix U pv array of the photovoltaic array is:
  • U 11 represents the output voltage of the first photovoltaic cell module of the first branch in the photovoltaic array, that is, the output voltage of PV1 in FIG. 2
  • U mn represents the m-th branch of the n-th branch in the photovoltaic array
  • the output voltage of the photovoltaic cell module that is, the output voltage of PV9 in FIG. 2, and so on. Then the photovoltaic array equation holds as follows:
  • U array is the output voltage of the photovoltaic array
  • the output voltage of each photovoltaic cell module should be 1 / m of the output voltage of the entire photovoltaic array.
  • PV matrix weight matrix A pv array that is, the potential of the voltage sensor placement point between the branches is:
  • u 11 U 21 -U 11
  • u 21 U 31 -U 21
  • u (m-1) 1 U m1 -U (m-1) 1 , and so on.
  • U is the output voltage of the photovoltaic cell module in normal operation
  • m is the total number of photovoltaic cell modules included in the branch
  • x is the number of photovoltaic cell modules that have failed in the faulty branch
  • U bad is the faulty branch.
  • the elements of the rows of the non-faulty branch in the weight matrix A pv array are equal, and the elements of the rows of the faulty branch and the non-faulty branch
  • the elements of each row of the road are not equal; and the elements of each column of the branch with a fault are not equal difference sequences.
  • the following table shows the characteristic values of the voltage difference between the arrays collected by each voltage sensor in the case of a partial short-circuit fault:
  • status label F4-14 For the meaning of the above status label, please refer to the content in the second step above. Those skilled in the art understand that the situation indicated by status label F4-14 is that PV1 on branch A and PV4 on branch B are short-circuited. By analogy, this application does not repeat them one by one.
  • the voltage difference between the arrays collected by the voltage sensor is proportional to the output voltage U array of the photovoltaic array, so it can be further simplified.
  • the array The voltage difference is divided by the output voltage U array of the photovoltaic array to obtain a voltage coefficient, and the voltage difference is used to represent the voltage difference between the arrays.
  • the fourth step is the construction of fault feature vectors and data sample sets. Construct fault feature vectors based on the collected array trunk parameters, array branch parameters, and the voltage difference between the arrays. After the fault feature vectors are obtained, they can be summarized and constructed into a data sample set of photovoltaic arrays. Data samples are collected from the fault feature vectors.
  • the collection method is a method known to those skilled in the art, which is not described in this application.
  • the data sample set is divided to obtain a training sample set and a test sample set, which can usually be divided into a training sample set and a test sample set according to a ratio of 2: 1.
  • the fifth step is the construction of a photovoltaic array fault diagnosis model based on an improved random forest algorithm.
  • the training sample set is X
  • the dimension of the fault feature vector is K, that is, the number of fault features of the sample is K, K is a positive integer
  • the number of decision trees to be established in fault diagnosis is N, N ⁇ 2, and N is Positive integer
  • the steps of constructing a photovoltaic array fault diagnosis model based on an improved random forest algorithm using training sample sets can be briefly described as:
  • a decision tree is obtained by training the sub-training sample set corresponding to each round of sampling, that is, the sub-training sample set X i is used to train the decision tree Q i .
  • q fault features are randomly selected from K fault features, where q is a positive integer and q ⁇ K, and the Gini coefficient corresponding to each fault feature in the q fault features is calculated.
  • the fault feature corresponding to the coefficient is used as the best split feature for node splitting.
  • the calculation method of the Gini coefficient can use the existing calculation formula, which is not described in detail in this application.
  • the weights of decision trees are equal by default. Therefore, some decision trees with excellent performance may not show their advantages and may even be overwhelmed by a majority vote.
  • the tree is weighted, and the out-of-package accuracy rate of the decision tree is used to evaluate the performance of the decision tree. After all the decision tree training is completed, the out-of-package accuracy rate of each decision tree is calculated, and the decision is obtained for the i-th sampling training.
  • Tree Q i using its corresponding out-of-package accuracy rate H oob (i) to calculate the weight of the decision tree Q i is as follows:
  • w (i) is the weight corresponding to the i-th decision tree
  • j is a parameter
  • the photovoltaic array fault diagnosis model is further optimized. As the size of a photovoltaic array expands, the number of branches and voltage sensors running at the same time will increase, and the dimension K of the fault feature vector will increase, which will increase the difficulty of model training. The fault characteristics of the sample are measured for importance to delete some fault characteristics.
  • the specific steps are as follows:
  • the PV array fault diagnosis model includes a total of N decision trees. Using each decision tree that is not trained with the sample to test the sample, you can determine the correct test and the wrong decision The number of trees to obtain the initial out-of-package accuracy e.
  • the sample includes a total of K fault characteristics. Noise is added to the k-th fault characteristic, and the sample is tested again using the photovoltaic array fault diagnosis model to obtain a new out-of-package accuracy rate e1, and the initial out-of-package accuracy rate is calculated. And the new out-of-package accuracy rate, e-e1, to determine the importance metric of the k-th feature, where K is a positive integer, k is a parameter, and the starting value of k is 1.
  • the importance metrics of the K fault features calculated using the samples in the outsourced sample set are generally consistent, but there are errors in actual operation. Therefore, the After the importance metric values of the K fault features are calculated for each sample, they are summed to determine the t fault features with the smallest importance metric value among the K fault features. Then for each sample in the training sample set, t fault features with the smallest importance metric among the K fault features of the sample are deleted to obtain a processed training sample set, 1 ⁇ t ⁇ K.
  • the processed training sample set is used to construct a photovoltaic array fault diagnosis model.
  • the seventh step is to test the fault diagnosis model of the photovoltaic array.
  • indicators such as diagnostic accuracy, average training time, and test time can be obtained.
  • the F1-F20 listed above the first-level state type of the photovoltaic array is diagnosed without fault feature deletion.
  • the improved random forest algorithm and ordinary random forest algorithm are used for fault diagnosis.
  • the training sample set and the test sample set. Both algorithms use the same decision tree model.
  • the number of decision trees is 40.
  • the experimental results are compared as follows:
  • the second-level status classification can be regarded as a sub-class of the first-level status classification.
  • the short-circuit fault status can be obtained as the second-level status.
  • the eighth step is to use the completed photovoltaic array fault diagnosis model to diagnose the PV array to be diagnosed, that is, to use N decision trees to vote for each typical operating state.
  • the voting method is used to vote, that is, each decision tree
  • the weight corresponding to itself is cast as the number of votes.
  • the voting results corresponding to each typical operating state are statistically obtained, and then the fault diagnosis results of the photovoltaic array to be diagnosed are obtained according to the voting results corresponding to each typical operating state.
  • the obtained fault diagnosis results are used to indicate each photovoltaic in the photovoltaic array.
  • step 3 If the typical running state corresponding to the highest number of votes includes only one, the typical running state corresponding to the highest number of votes is directly output as the fault diagnosis result, otherwise, the following step 3) is performed.

Landscapes

  • Photovoltaic Devices (AREA)

Abstract

Disclosed is a photovoltaic array fault diagnosis method based on an improved random forest algorithm, relating to the field of photovoltaic technology. In the method, a running state of each branch in a photovoltaic array is reflected by means of parameters of a main trunk and each branch of the photovoltaic array, and a running state of each photovoltaic cell assembly in the branch is reflected by means of a voltage difference between arrays among different branches, so as to realize fault location of the photovoltaic array; and by optimizing and improving the three parts, i.e. decision tree weighting and voting, tie processing and the importance measurement of fault features, by means of out-of-package samples, the accuracy of the fault diagnosis can be higher and the reliability of the fault diagnosis can be stronger.

Description

一种基于改进随机森林算法的光伏阵列故障诊断方法Fault diagnosis method of photovoltaic array based on improved random forest algorithm 技术领域Technical field
本发明涉及光伏技术领域,尤其是一种基于改进随机森林算法的光伏阵列故障诊断方法。The invention relates to the field of photovoltaic technology, in particular to a photovoltaic array fault diagnosis method based on an improved random forest algorithm.
背景技术Background technique
随着能源需求的增长、化石能源的日益枯竭与成本上涨,以及全球气候变暖等诸多因素的影响,可再生能源技术得到了飞速发展,其中,太阳能具有易获取、无噪声、清洁、无穷无尽等优点,成为了可再生能源的重要部分。目前利用太阳能发电主要有两种形式:光热发电和光伏发电。光热发电与火力发电类似,但热能主要来自大规模的镜面收集,将水加热后,利用蒸汽推动传统发电机工作,从而实现发电,这种发电方式虽然规避了硅晶的光电转换,但对光照强度的要求高且发电成本偏高。而光伏发电基于光生伏特效应,直接将太阳能转化为电能,建设周期短,在运行时不会产生废渣、废水和其他污染物,尤其是对于交通不发达的山区、海岛和偏远地区,光伏发电具有更重要的价值。With the increase of energy demand, the increasing depletion and cost of fossil energy, and the impact of many factors such as global warming, renewable energy technologies have developed rapidly. Among them, solar energy has easy access, no noise, clean and endless. And other advantages have become an important part of renewable energy. At present, there are two main forms of solar power generation: solar thermal power and photovoltaic power. CSP is similar to thermal power generation, but the thermal energy mainly comes from large-scale mirror collection. After heating the water, it uses steam to promote the operation of traditional generators to achieve power generation. Although this type of power generation avoids the photoelectric conversion of silicon crystals, Light intensity requirements are high and power generation costs are high. Photovoltaic power generation is based on the photovoltaic effect, which directly converts solar energy into electricity. The construction period is short, and no waste residue, waste water and other pollutants will be generated during operation, especially for mountainous areas, islands and remote areas with less developed transportation. More important value.
一般情况下,光伏发电都是利用光伏电池组件串、并联组成光伏阵列进行运行的,而光伏阵列使用的光伏电池组件较多,光伏电池组件本身发生故障的概率就较高,并且光伏阵列长期运行在户外,运行环境恶劣,易出现老化和损坏等情况,从而造成光伏电池组件发电效率的降低甚至停止工作。当光伏阵列中某个光伏电池组件发生故障后,会导致系统效率的下降,并对电力系统的运行调度造成不利的影响,严重时,甚至造成财产损失和人员伤亡,因此,对光伏阵列进行故障诊断具有重要意义,目前所采用的故障诊断方法主要包括直接法和间接法两类,间接法比较典型的有红外热量检测法和发电功率对比法,直接法比较典型的有对地电容法、时域反射法、智能诊断算法和电特性检测法,其中,通过将智能诊断算法和电特性检测法进行结合进行故障诊断,是目前非常具有潜力的方法。Under normal circumstances, photovoltaic power generation uses photovoltaic cell modules in series and parallel to form a photovoltaic array to operate, and photovoltaic arrays use more photovoltaic cell modules, the probability of failure of the photovoltaic cell module itself is higher, and the photovoltaic array runs for a long time Outdoors, the operating environment is harsh, and it is prone to aging and damage. As a result, the photovoltaic cell module's power generation efficiency is reduced or even stopped. When a photovoltaic cell module in a photovoltaic array fails, it will lead to a decrease in system efficiency and adversely affect the operation and scheduling of the power system. In severe cases, it may even cause property damage and casualties. Therefore, the photovoltaic array is faulty. Diagnosis is of great significance. The currently used fault diagnosis methods mainly include two types: direct method and indirect method. Indirect method is more typical with infrared heat detection method and power generation comparison method. Direct method is more typical with ground capacitance method and time. Domain reflection method, intelligent diagnosis algorithm, and electrical characteristic detection method. Among them, fault diagnosis by combining intelligent diagnosis algorithm and electrical characteristic detection method is currently a method with great potential.
但目前的光伏阵列故障诊断方法都较少考虑故障定位的情况,主要原因是:目前的智能诊断算法多采用以BP(Back Propagation)为代表的神经网络,而光伏 阵列中的光伏电池组件的数目较多,当某一个或某多个光伏电池组件发生故障时,故障的排列组合形式较多,再加上不同的故障类型,有时甚至会出现上百种故障组合,则会需要较多的训练样本,并且训练时间也会随之增加,诊断精度也不能够得到保证,诊断难度较大。However, the current fault diagnosis methods of photovoltaic arrays rarely consider fault location. The main reason is that the current intelligent diagnostic algorithms mostly use neural networks represented by BP (Back Propagation), and the number of photovoltaic cell modules in photovoltaic arrays. There are many. When one or more photovoltaic cell modules fail, there are many forms of arrangement and combination of faults. Coupled with different fault types, sometimes there are even hundreds of fault combinations, which will require more training. The sample and training time will increase accordingly, the diagnostic accuracy cannot be guaranteed, and the diagnosis is difficult.
发明内容Summary of the invention
本发明人针对上述问题及技术需求,提出了一种基于改进随机森林算法的光伏阵列故障诊断方法,该方法通过采集阵列间光伏电池组件的电压和电流确定光伏电池组件的运行情况,从而实现故障定位。In view of the above problems and technical requirements, the present inventor proposes a photovoltaic array fault diagnosis method based on an improved random forest algorithm. This method determines the operating condition of a photovoltaic cell module by collecting the voltage and current of the photovoltaic cell modules between the arrays, thereby realizing a fault. Positioning.
本发明的技术方案如下:The technical solution of the present invention is as follows:
一种基于改进随机森林算法的光伏阵列故障诊断方法,该方法包括:A photovoltaic array fault diagnosis method based on an improved random forest algorithm, the method includes:
确定光伏阵列在运行过程中的典型运行状态,光伏阵列包括n个支路,每个支路中包括m个光伏电池组件,光伏阵列的典型运行状态用于指示光伏阵列中的各个光伏电池组件的运行状态,m和n均为正整数;Determine the typical operating state of the photovoltaic array during operation. The photovoltaic array includes n branches, and each branch includes m photovoltaic cell modules. The typical operating state of the photovoltaic array is used to indicate the status of each photovoltaic cell module in the photovoltaic array. Running status, m and n are positive integers;
在光伏阵列处于每个典型运行状态时,采集光伏阵列的阵列干路参数、各个支路对应的阵列支路参数以及不同支路之间的阵列间电压差;When the photovoltaic array is in each typical operating state, collect the array trunk parameters of the photovoltaic array, the corresponding array branch parameters of each branch, and the voltage difference between the arrays between different branches;
根据阵列干路参数、阵列支路参数以及阵列间电压差构建故障特征向量,根据故障特征向量构建得到光伏阵列的数据样本集,将数据样本集划分为训练样本集和测试样本集;Construct fault feature vectors according to the array trunk parameters, array branch parameters, and the voltage difference between the arrays. According to the fault feature vectors, a data sample set of the photovoltaic array is constructed, and the data sample set is divided into a training sample set and a test sample set;
基于随机森林算法利用训练样本集训练得到光伏阵列故障诊断模型,并利用测试样本集对光伏阵列故障诊断模型进行测试,光伏阵列故障诊断模型中包括N个决策树,N为正整数且N≥2;Based on the random forest algorithm, the photovoltaic array fault diagnosis model is trained by using the training sample set, and the photovoltaic array fault diagnosis model is tested by using the test sample set. The photovoltaic array fault diagnosis model includes N decision trees, where N is a positive integer and N ≥ 2 ;
利用测试完成的光伏阵列故障诊断模型对待诊断光伏阵列进行诊断,得到N个决策树对各个典型运行状态的投票结果;The PV array fault diagnosis model completed by the test is used to diagnose the PV array to be diagnosed, and the voting results of N decision trees for each typical operating state are obtained;
根据各个典型运行状态对应的投票结果得到待诊断光伏阵列的故障诊断结果,故障诊断结果用于指示光伏阵列中的各个光伏电池组件的运行状态。The fault diagnosis result of the photovoltaic array to be diagnosed is obtained according to the voting results corresponding to each typical operating state, and the fault diagnosis result is used to indicate the operating state of each photovoltaic cell module in the photovoltaic array.
其进一步的技术方案为,阵列干路参数和每个阵列支路参数分别包括电路的最大功率点电压、最大功率点电流、开路电压和短路电流。Its further technical solution is that the array trunk parameters and each array branch parameter include the maximum power point voltage, the maximum power point current, the open circuit voltage and the short circuit current of the circuit, respectively.
其进一步的技术方案为,光伏阵列故障诊断模型中包括N个决策树及每个决策树对应的权值,则利用测试完成的光伏阵列故障诊断模型对待诊断光伏阵列进行诊断,包括:Its further technical solution is that the photovoltaic array fault diagnosis model includes N decision trees and the corresponding weights of each decision tree, and the photovoltaic array fault diagnosis model completed by the test is used to diagnose the PV array to be diagnosed, including:
N个决策树对各个典型运行状态进行投票,每个决策树将自身对应的权值 作为票数投出;N decision trees vote for each typical running state, and each decision tree casts its own weight as the number of votes;
统计得到各个典型运行状态对应的投票结果。Statistics obtained the voting results corresponding to each typical operating state.
其进一步的技术方案为,基于随机森林算法利用训练样本集训练得到光伏阵列故障诊断模型,包括:Its further technical solution is to obtain a photovoltaic array fault diagnosis model by training on a training sample set based on a random forest algorithm, including:
对包含p个样本的训练样本集进行N轮抽样,对于每一轮抽样,采用有放回的抽样方式抽样p次得到该轮抽样对应的子训练样本集和包外样本集,子训练集中包括该轮抽样中抽取到的p个样本,包外样本集中包括训练样本集中在该轮抽样中未被抽取的样本,p为正整数;Perform N rounds of sampling on the training sample set containing p samples. For each round of sampling, use the sampling method with replacement to sample p times to obtain the sub-training sample set and out-of-package sample set corresponding to the round of sampling. The sub-training set includes For the p samples selected in this round of sampling, the out-of-package sample set includes the samples that were not sampled in this round of sampling, and p is a positive integer;
根据每一轮抽样对应的子训练样本集训练得到一个决策树,并利用该轮抽样对应的包外样本集对决策树进行测试得到包外准确率,根据包外准确率计算得到决策树对应的权值;A decision tree is trained according to the sub-training sample set corresponding to each round of sampling, and the decision tree is tested by using the out-of-package sample set corresponding to the round of sampling to obtain the out-of-package accuracy rate. Weight
对训练得到的N个决策树和每个决策树对应的权值进行汇总得到光伏阵列故障诊断模型。The training N decision trees and the weights corresponding to each decision tree are summarized to obtain a photovoltaic array fault diagnosis model.
其进一步的技术方案为,根据包外准确率计算得到决策树对应的权值,包括计算:Its further technical solution is to calculate the weight corresponding to the decision tree according to the out-of-package accuracy rate, including calculation:
Figure PCTCN2018101743-appb-000001
Figure PCTCN2018101743-appb-000001
其中,w(i)为第i个决策树对应的权值,Hoob(i)为第i个决策树对应的包外准确率,i和j均为参数,1≤i≤N且i为正整数。Among them, w (i) is the weight corresponding to the i-th decision tree, Hoob (i) is the out-of-package accuracy rate corresponding to the i-th decision tree, i and j are parameters, 1≤i≤N and i is positive Integer.
其进一步的技术方案为,该方法还包括:In a further technical solution, the method further includes:
将N个包外样本集进行汇总得到总包外样本集,对于总包外样本集中的每个样本,利用训练得到的光伏阵列故障诊断模型对样本进行测试得到初始包外准确率;The N out-of-package sample sets are aggregated to obtain a total out-of-package sample set. For each sample in the out-of-package sample set, the samples are tested using the trained photovoltaic array fault diagnosis model to obtain the initial out-of-package accuracy rate;
样本共包括K个故障特征,在其中第k个故障特征中加入噪声,并利用光伏阵列故障诊断模型重新对样本进行测试得到新的包外准确率,计算初始包外准确率和新的包外准确率之间的差值确定第k个特征的重要性度量值,K为正整数,k为参数且k的起始值为1;The sample includes a total of K fault features. Noise is added to the k-th fault feature, and the sample is tested again using the photovoltaic array fault diagnosis model to obtain a new out-of-package accuracy rate. The initial out-of-package accuracy rate and the new out-of-package accuracy rate are calculated. The difference between the accuracy rates determines the importance metric of the kth feature, where K is a positive integer, k is a parameter, and the starting value of k is 1;
在k<K时,令k=k+1,并再次执行在其中第k个故障特征中加入噪声的步骤;直至k=K时得到K个故障特征的重要性度量值;When k <K, let k = k + 1, and perform the step of adding noise to the k-th fault feature again; until k = K, obtain the importance measure values of K fault features;
对于训练样本集中的每个样本,删除样本的K个故障特征中重要性度量值 最小的t个故障特征,得到处理后的训练样本集;For each sample in the training sample set, delete the t fault features with the smallest importance metric among the K fault features of the sample to obtain a processed training sample set;
基于随机森林算法利用处理后的训练样本集构建得到光伏阵列故障诊断模型。Based on the random forest algorithm, the processed training sample set is used to construct a photovoltaic array fault diagnosis model.
其进一步的技术方案为,根据各个典型运行状态对应的投票结果得到待诊断光伏阵列的故障诊断结果,包括:Its further technical solution is to obtain the fault diagnosis result of the photovoltaic array to be diagnosed according to the voting results corresponding to each typical operating state, including:
若最高的投票数对应的典型运行状态仅包括一个,则将最高的投票数对应的典型运行状态作为故障诊断结果输出;If the typical operating state corresponding to the highest number of votes includes only one, the typical operating state corresponding to the highest number of votes is output as a fault diagnosis result;
若最高的投票数对应的典型运行状态包括至少两个,则选取权值最高的L个决策树对各个典型运行状态重新进行投票,若存在一个典型运行状态获得最多个决策树的选择,则将典型运行状态作为故障诊断结果输出;否则将权值最高的决策树选择的典型运行状态作为故障诊断结果输出,L为正整数且2≤L<N。If the typical running state corresponding to the highest number of votes includes at least two, the L decision trees with the highest weights are selected to vote for each typical running state. If there is a typical running state to obtain the selection of the most decision trees, then The typical running state is output as the fault diagnosis result; otherwise, the typical running state selected by the decision tree with the highest weight is output as the fault diagnosis result, where L is a positive integer and 2≤L <N.
本发明的有益技术效果是:The beneficial technical effects of the present invention are:
1、本申请公开了一种基于改进随机森林算法的光伏阵列故障诊断方法,采集光伏阵列总干路和各个支路的参数以及不同支路之间的阵列间电压差,光伏阵列总干路和各个支路的参数可以反映光伏阵列中各条支路的运行状态,从而诊断出光伏阵列中哪条支路发生了何种故障,而不同支路之间的阵列间电压差可以反映出支路中各个光伏电池组件的运行状态,从而诊断出具体是哪个光伏电池组件发生了何种故障,两类信息的结合有效地实现了光伏阵列的故障定位;同时利用随机森林算法,该算法适用于实际光伏阵列的特点,克服了传统神经网络算法需要数据量大、训练时间长等问题,能够简单快速地完成诊断任务。1. The present application discloses a photovoltaic array fault diagnosis method based on an improved random forest algorithm, which collects parameters of the main trunk of the photovoltaic array and each branch, and the voltage difference between the arrays between different branches. The parameters of each branch can reflect the operating status of each branch in the photovoltaic array, thereby diagnosing which branch in the photovoltaic array has what kind of failure, and the voltage difference between the arrays between different branches can reflect the branch The operating status of each photovoltaic cell module in the system, so as to diagnose which photovoltaic cell module is faulty. The combination of the two types of information effectively achieves the fault location of the photovoltaic array. At the same time, the random forest algorithm is used, which is suitable for practical applications. The characteristics of photovoltaic arrays overcome the problems of large amount of data and long training time required by traditional neural network algorithms, and can complete diagnostic tasks simply and quickly.
2、本申请公开的方法基于数据驱动的思想,将随机森林算法、决策树赋权、投票平局处理和变量重要性度量结合起来,实现光伏阵列的故障诊断并有效提升诊断精度。2. The method disclosed in this application is based on a data-driven idea, and combines random forest algorithm, decision tree weighting, voting tie processing, and variable importance measurement to realize fault diagnosis of photovoltaic arrays and effectively improve diagnostic accuracy.
3、本申请公开的方法采用改进随机森林的故障诊断模型结构,通过多个决策树的集成,构建了一个强分类器,在利用决策树赋权、投票平局处理和故障特征重要性度量等一系列步骤进行改进,最后构建改进随机森林算法,诊断结果也表明改进后的算法虽然训练时间稍有延长,但精度更高,诊断可靠性更强。3. The method disclosed in this application uses an improved random forest fault diagnosis model structure. Through the integration of multiple decision trees, a strong classifier is constructed, which uses decision tree weighting, voting tie processing, and fault feature importance measures. A series of steps were improved, and finally an improved random forest algorithm was constructed. The diagnosis results also show that although the improved algorithm has slightly longer training time, it has higher accuracy and more reliable diagnosis.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1是本申请公开的光伏阵列故障诊断方法的流程图。FIG. 1 is a flowchart of a photovoltaic array fault diagnosis method disclosed in the present application.
图2是3*3的光伏阵列的搭建示意图。Figure 2 is a schematic diagram of the construction of a 3 * 3 photovoltaic array.
图3-a是单个太阳能电池组件在不同光照强度下的U-I输出特性曲线。Figure 3-a is the U-I output characteristic curve of a single solar cell module under different light intensities.
图3-b是单个太阳能电池组件在不同光照强度下的U-P输出特性曲线。Figure 3-b is the U-P output characteristic curve of a single solar cell module under different light intensities.
图3-c是单个太阳能电池组件在不同温度情况下的U-I输出特性曲线。Figure 3-c is the U-I output characteristic curve of a single solar cell module under different temperature conditions.
图3-d是单个太阳能电池组件在不同温度情况下的U-P输出特性曲线。Figure 3-d is the U-P output characteristic curve of a single solar cell module under different temperature conditions.
图4是单个太阳能电池组件在光照强度600W/m2、800W/m2、1000W/m2情况下的输出特性曲线。FIG. 4 is an output characteristic curve of a single solar cell module at a light intensity of 600 W / m2, 800 W / m2, and 1000 W / m2.
图5-a是光伏阵列在无故障和发生单模式故障时的输出特性曲线比较图。Figure 5-a is a comparison of the output characteristic curves of a photovoltaic array when there is no fault and when a single-mode fault occurs.
图5-b是光伏阵列在无故障和发生多模式故障时的输出特性曲线比较图。Figure 5-b is a comparison diagram of the output characteristic curves of a photovoltaic array when there is no fault and when a multi-mode fault occurs.
图6-a是光伏阵列在一个支路中有一个光伏电池组件短路时的输出特性曲线。Figure 6-a is the output characteristic curve of a photovoltaic array when a photovoltaic cell module is shorted in a branch.
图6-b是光伏阵列在一个支路开路时的输出特性曲线。Figure 6-b is the output characteristic curve of a photovoltaic array when one branch is open.
图6-c是光伏阵列在一个支路中有两个光伏电池组件存在阴影时的输出特性曲线。Figure 6-c is the output characteristic curve of a photovoltaic array when two photovoltaic cell modules in a branch have shadows.
图6-d是光伏阵列在一个支路中有一个光伏电池组件短路,另一个支路中有一个光伏电池组件存在阴影时的输出特性曲线。Figure 6-d shows the output characteristic curve of a photovoltaic array when one photovoltaic cell module is short-circuited in one branch, and one photovoltaic cell module is shadowed in the other branch.
具体实施方式detailed description
下面结合附图对本发明的具体实施方式做进一步说明。The specific embodiments of the present invention are further described below with reference to the accompanying drawings.
本申请公开了一种基于改进随机森林算法的光伏阵列故障诊断方法,请参考图1,整个故障诊断流程包括如下步骤:This application discloses a photovoltaic array fault diagnosis method based on an improved random forest algorithm. Please refer to FIG. 1. The entire fault diagnosis process includes the following steps:
第一步,搭建m*n形式的光伏阵列,该光伏阵列包括并联的n个支路,每个支路包括串联的m个光伏电池组件,m和n均为正整数。光伏阵列中的光伏电池组件通常是太阳能电池组件,各个光伏电池组件规格均相同,以保证光伏阵列的电位平衡,并且每个光伏电池组件并联旁路二极管、每个支路串联隔离二极管,同时快速熔断器、电压和电流传感器等保护、检测装置均安装正确并正常工作。The first step is to build a photovoltaic array in the form of m * n. The photovoltaic array includes n branches in parallel, and each branch includes m photovoltaic cell modules in series, where m and n are positive integers. The photovoltaic cell modules in a photovoltaic array are usually solar cell modules. The specifications of each photovoltaic cell module are the same to ensure the potential balance of the photovoltaic array. Each photovoltaic cell module is connected with a bypass diode in parallel and each branch in series with an isolation diode. Protection, detection devices such as fuses, voltage and current sensors are correctly installed and working properly.
本申请以n=3、m=3为例,搭建的3*3的光伏阵列的示意图请参考图2,共包括支路A、支路B和支路C,每个支路中包括3个串联的光伏电池组件,图2未示出旁路二极管、隔离二极管等器件。本申请光伏阵列中的光伏电池组件采用无锡尚德太阳能有限公司的SunTech STP 270-24/Vd型太阳能电池组件, 该型号的太阳能电池组件的参数规格表如下表所示:This application takes n = 3 and m = 3 as examples. For a schematic diagram of a 3 * 3 photovoltaic array, please refer to FIG. 2. It includes branch A, branch B, and branch C. Each branch includes 3 branches. For photovoltaic cells connected in series, Figure 2 does not show devices such as bypass diodes and isolation diodes. The photovoltaic cell module in the photovoltaic array of this application uses SunTech STP 270-24 / Vd solar cell module of Suntech Solar Co., Ltd. The parameter specification table of this type of solar cell module is shown in the following table:
参数类型Parameter Type 参数值Parameter value
开路电压(Voc)Open circuit voltage (Voc) 44.5V44.5V
最大功率点电压(Vm)Maximum power point voltage (Vm) 35.0V35.0V
短路电流(Isc)Short-circuit current (Isc) 8.20A8.20A
最大功率点电流(Im)Maximum power point current (Im) 7.71A7.71A
最大功率(Pmax)Maximum power (Pmax) 270W270W
运行温度Operating temperature -40℃~+85℃-40 ℃ ~ + 85 ℃
系统最大承受电压Maximum withstand voltage of the system 1000V DC1000V DC
串联后保险丝的额定电流Rated current of fuse after series connection 20A20A
功率误差Power error ±3%± 3%
在保持温度不变的情况下,光照强度从200W/m 2上升到1000W/m 2(分别为200、400、600、800和1000W/m 2)时的U-I(电压-电流)输出特性曲线如图3-a所示、U-P(电压-功率)输出特性曲线如图3-b所示。在保持光照强度不变的情况下,温度从35℃下降到15℃(分别为35℃、30℃、25℃、20℃和15℃)时的U-I输出特性曲线如图3-c所示、U-P(电压-功率)输出特性曲线如图3-d所示。特别的,产品手册中给出了该型号的太阳能电池组件在光照强度为600W/m 2、800W/m 2和1000W/m 2情况下的输出特性曲线如图4所示。 With the temperature kept constant, the UI (voltage-current) output characteristic curve when the light intensity rises from 200 W / m 2 to 1000 W / m 2 (200, 400, 600, 800, and 1000 W / m 2 respectively ) is as follows Figure 3-a shows the UP (voltage-power) output characteristic curve shown in Figure 3-b. Under the condition of keeping the light intensity constant, the UI output characteristic curve when the temperature drops from 35 ° C to 15 ° C (respectively 35 ° C, 30 ° C, 25 ° C, 20 ° C, and 15 ° C) is shown in Fig. The UP (voltage-power) output characteristic curve is shown in Figure 3-d. In particular, the product manual gives the output characteristic curve of this type of solar cell module under the light intensity of 600W / m 2 , 800W / m 2 and 1000W / m 2 as shown in FIG. 4.
第二步,确定光伏阵列在运行过程中的典型运行状态,本申请中的光伏电池组件的典型运行状态用于指示光伏电池组件中的各个光伏电池组件的运行状态。光伏阵列在实际运行时会出现较多类型的故障,主要有四类故障:短路故障、开路故障、阴影故障以及混合故障。当光伏阵列中的一个支路或多个支路中存在光伏电池组件发生故障时,该发生故障的支路以及整个光伏阵列的输出特性曲线都会发生变化,如图5-a示出了光伏阵列在无故障和发生单模式故障时的故障支路、故障阵列、正常支路和正常阵列的输出特性曲线,图5-b示出了光伏阵列在无故障和发生多模式故障时的故障支路、故障阵列、正常支路和正常阵列的输出特性曲线,其中正常支路是指支路中各个光伏电池组件均正常运行的支路,正常阵列是指各个支路均正常运行的光伏阵列,故障支路是支路中存在光伏电池组件发生故障的支路,故障阵列是指有支路发生故障的光伏阵列。由图中可以看出,正常支路的输出电流大,各个光伏电池组件的输出也相等,而故障支路的输出电流能力较弱,能看到明显的电流较低。输出特性曲线存在两个区域:高电压区域和大电流区域,在高电压区域可以通过检测各个支路的输出电流来判断支路的情况,而在大电流区域,可以检测光伏电池组件的输出电压来实现故障定位。The second step is to determine the typical operating state of the photovoltaic array during operation. The typical operating state of the photovoltaic cell module in this application is used to indicate the operating state of each photovoltaic cell module in the photovoltaic cell module. There are many types of faults in photovoltaic arrays during actual operation. There are four main types of faults: short-circuit faults, open-circuit faults, shadow faults, and mixed faults. When one or more branches in a photovoltaic array have a photovoltaic cell component fault, the faulty branch and the output characteristic curve of the entire photovoltaic array will change, as shown in Figure 5-a. Output characteristics of faulty branches, faulty arrays, normal branches, and normal arrays when there are no faults and single-mode faults. Figure 5-b shows the faulty branches of photovoltaic arrays when there are no faults and multi-mode faults occur. , Fault array, normal branch and normal array output characteristic curve, where the normal branch refers to the branch in which each photovoltaic cell module is operating normally, the normal array refers to the photovoltaic array in which each branch is operating normally, the fault A branch is a branch in which a photovoltaic cell module fails, and a fault array refers to a photovoltaic array in which a branch fails. It can be seen from the figure that the output current of the normal branch is large, and the output of each photovoltaic cell module is also equal, while the output current capacity of the faulty branch is weak, and it can be seen that the current is significantly lower. The output characteristic curve has two regions: high voltage region and high current region. In the high voltage region, the condition of the branch can be determined by detecting the output current of each branch. In the high current region, the output voltage of the photovoltaic cell module can be detected. To achieve fault location.
光伏阵列在运行过程中可能出现上述四类故障,再加上正常运行的情况, 光伏阵列的运行状态主要包括五大类:正常运行状态、短路故障状态、开路故障状态、阴影故障状态以及混合故障状态:The above four types of faults may occur during the operation of photovoltaic arrays. In addition to the normal operating conditions, the operating states of photovoltaic arrays mainly include five categories: normal operating states, short circuit fault states, open circuit fault states, shadow fault states, and mixed fault states :
(1)、正常运行状态,即所有支路均正常运行。(1) Normal operating state, that is, all branches are operating normally.
(2)、短路故障状态,即任意一条支路或多条支路短路。需要说明的是,一条支路中不可能所有光伏电池组件都短路,若全部光伏电池组件都短路,则熔断装置熔断,该支路会开路。(2) Short circuit fault state, that is, any one branch or multiple branches are short-circuited. It should be noted that it is impossible for all photovoltaic cell modules to be short-circuited in a branch. If all photovoltaic cell modules are short-circuited, the fuse device is blown and the branch will be opened.
(3)、开路故障状态,即任意一条支路或多条支路开路。(3) Open circuit fault state, that is, any branch or multiple branches are open.
(4)、阴影故障状态,即任意一条支路或多条支路存在阴影,与短路故障状态不同,可以存在一条支路中所有光伏电池组件都存在阴影的情况,因此该状态的数量较多。(4) The shadow fault state, that is, any branch or multiple branches have shadows. Unlike the short-circuit fault state, there can be shadows of all photovoltaic cell modules in a branch, so the number of this state is large. .
(5)、混合故障状态,主要包括光伏阵列中同时发生短路故障和阴影故障的情况。(5) Mixed fault states, which mainly include the case where a short-circuit fault and a shadow fault occur simultaneously in the photovoltaic array.
对于这五大类运行状态,可以采用两级状态分类的方式,首先进行粗略分类从而定位到光伏阵列中的支路,这一步骤不涉及故障定位,可以根据实际情况和经验选择若干最为常见的运行状态。需要说明的是,在光伏系统中,系统的重度故障都是在轻度故障的基础上演变而来,直接发生重度故障的几率微乎其微,并且重度故障的故障类型排列组合过多,所以本申请主要考虑轻、中度故障,即考虑光伏阵列中至少存在一条支路处于正常运行状态的情况。比如在图2所示的光伏阵列中,假设光伏阵列中的支路C始终处于正常运行状态,则可以选择如下20种运行状态作为第一级状态分类的结果,如下表所示:For these five types of operating states, two-level state classification can be adopted. First, rough classification is performed to locate the branches in the photovoltaic array. This step does not involve fault location. You can choose some of the most common operations based on actual conditions and experience. status. It should be noted that in photovoltaic systems, the major faults of the system have evolved on the basis of minor faults. The probability of direct major faults is very small, and the types and combinations of major faults are too many, so this application mainly Consider light and moderate faults, that is, consider that at least one branch in the photovoltaic array is in a normal operating state. For example, in the photovoltaic array shown in FIG. 2, assuming that the branch C in the photovoltaic array is always in a normal operating state, the following 20 operating states can be selected as the result of the first-level state classification, as shown in the following table:
Figure PCTCN2018101743-appb-000002
Figure PCTCN2018101743-appb-000002
Figure PCTCN2018101743-appb-000003
Figure PCTCN2018101743-appb-000003
以状态标签F2、F7、F10和F18对应的运行状态为例,他们的输出特性曲线依次如图6-a、6-b、6-c和6-d所示。在第一级状态分类的基础上,可以进行第二级状态分类从而定位到支路中的光伏电池组件,从而实现故障定位。以状态标签F2为例,该情况表示一个支路中有一个光伏电池组件短路,在图2所示的3*3的光伏阵列中,假设光伏阵列中的支路C处于正常运行状态,则该情况表示支路A或支路B上存在一个光伏电池组件短路,共包括6种情况,则在此基础上的第二级状态分类的结果如下表所示:Taking the operating states corresponding to the status labels F2, F7, F10, and F18 as examples, their output characteristic curves are shown in order in Figures 6-a, 6-b, 6-c, and 6-d. On the basis of the first-level status classification, the second-level status classification can be performed to locate the photovoltaic cell module in the branch, thereby achieving fault location. Taking the status label F2 as an example, this situation indicates that a photovoltaic cell module in a branch is short-circuited. In the 3 * 3 photovoltaic array shown in FIG. 2, assuming that the branch C in the photovoltaic array is in a normal operating state, the The condition indicates that there is a short-circuit of a photovoltaic cell module on branch A or branch B. There are 6 cases in total, and the results of the second-level state classification based on this are shown in the following table:
Figure PCTCN2018101743-appb-000004
Figure PCTCN2018101743-appb-000004
同样的,在状态标签F3对应的第一级状态分类的基础上进行的第二级状态分类的结果可以如下表所示:Similarly, the results of the second-level status classification based on the first-level status classification corresponding to the status label F3 can be shown in the following table:
Figure PCTCN2018101743-appb-000005
Figure PCTCN2018101743-appb-000005
其余各种情况都可以以此类推。至此,通过两级状态分类就得到了光伏阵列的各个典型运行状态。The rest can be deduced by analogy. At this point, the typical operating states of the photovoltaic array are obtained through two-level state classification.
第三步,光伏阵列的电路参数的采集。在光伏阵列处于每个典型运行状态时,采集光伏阵列的阵列干路参数、各个支路对应的阵列支路参数以及不同支路之间的阵列间电压差。其中,光伏阵列的阵列干路参数是光伏阵列的总干路上的参数,本申请中的阵列干路参数包括总干路的最大功率点电压V m、最大功 率点电流I m、开路电压V oc和短路电流I sc,这是由于这四个参数能够很好的刻画光伏电池组件的U-I输出特性曲线。类似的,对于光伏阵列中的每个支路,采集支路的最大功率点电压V m、最大功率点电流I m、开路电压V oc和短路电流I sc作为阵列支路参数。如图2所示,在实际操作时,光伏阵列的电路参数可以通过在光伏阵列中配置电压传感器和电流传感器测得,对于m×n规模的光伏阵列,需要配置[n×(m-1)/2]个电压传感器和n+1个电流传感器,其中符号[]表示对n×(m-1)/2进行向上取整。电流传感器设置在各个支路中以及总干路中,用于检测支路的电流和总干路的电流,电压传感器设置在不同支路间,用于检测不同支路间的阵列间电压差,如图2中设置有3个电压传感器U a、U b和U c,电压传感器U a设置在支路A和支路B之间,电压传感器U b设置在支路B和支路C之间,电压传感器U c设置在支路C和支路A之间。支路电流的检测和支路间电压差的检测实质上都是为了检测每个光伏电池组件的电压和电流,通过电压和电流等参数数值的变化反映光伏电池组件的运行情况,从而实现行故障定位,其原理如下: The third step is to collect the circuit parameters of the photovoltaic array. When the photovoltaic array is in each typical operating state, collect the array trunk parameters of the photovoltaic array, the corresponding array branch parameters of each branch, and the voltage difference between the arrays between different branches. The array trunk parameters of the photovoltaic array are parameters on the main trunk of the photovoltaic array. The array trunk parameters in this application include the maximum power point voltage V m , the maximum power point current I m , and the open circuit voltage V oc of the main trunk. And short-circuit current I sc , which is because these four parameters can well characterize the UI output characteristic curve of the photovoltaic cell module. Similarly, for each branch in the photovoltaic array, the maximum power point voltage V m , the maximum power point current I m , the open circuit voltage V oc, and the short-circuit current I sc of the branch are collected as the array branch parameters. As shown in Figure 2, in actual operation, the circuit parameters of a photovoltaic array can be measured by configuring a voltage sensor and a current sensor in the photovoltaic array. For an m × n scale photovoltaic array, it is necessary to configure [n × (m-1) / 2] voltage sensors and n + 1 current sensors, where the symbol [] means rounding up n × (m-1) / 2. The current sensor is set in each branch and the main trunk to detect the current of the branch and the current of the main trunk. The voltage sensor is set between different branches to detect the voltage difference between the arrays between different branches. As shown in FIG. 2, three voltage sensors U a , U b and U c are provided . The voltage sensor U a is disposed between the branch A and the branch B, and the voltage sensor U b is disposed between the branch B and the branch C. The voltage sensor U c is arranged between the branch C and the branch A. The detection of the branch current and the detection of the voltage difference between the branches are essentially to detect the voltage and current of each photovoltaic cell module. The changes in the parameter values such as voltage and current reflect the operation of the photovoltaic cell module, thereby achieving line faults. The principle of positioning is as follows:
在光伏阵列中,光伏阵列的电压矩阵U pv  array为: In a photovoltaic array, the voltage matrix U pv array of the photovoltaic array is:
Figure PCTCN2018101743-appb-000006
Figure PCTCN2018101743-appb-000006
其中,U 11表示光伏阵列中第1个支路的第1个光伏电池组件的输出电压,也即图2中的PV1的输出电压,U mn表示光伏阵列中第n个支路的第m个光伏电池组件的输出电压,也即图2中的PV9的输出电压,依次类推。则光伏阵列有等式成立如下: Among them, U 11 represents the output voltage of the first photovoltaic cell module of the first branch in the photovoltaic array, that is, the output voltage of PV1 in FIG. 2, and U mn represents the m-th branch of the n-th branch in the photovoltaic array The output voltage of the photovoltaic cell module, that is, the output voltage of PV9 in FIG. 2, and so on. Then the photovoltaic array equation holds as follows:
Figure PCTCN2018101743-appb-000007
Figure PCTCN2018101743-appb-000007
其中,U array是光伏阵列的输出电压,则
Figure PCTCN2018101743-appb-000008
也即,每个光伏电池组件的输出电压应该是整个光伏阵列的输出电压的1/m。
Where U array is the output voltage of the photovoltaic array, then
Figure PCTCN2018101743-appb-000008
That is, the output voltage of each photovoltaic cell module should be 1 / m of the output voltage of the entire photovoltaic array.
定义光伏阵列的权值矩阵A pv  array,即支路间的电压传感器放置点电位为: Define the PV matrix weight matrix A pv array , that is, the potential of the voltage sensor placement point between the branches is:
Figure PCTCN2018101743-appb-000009
Figure PCTCN2018101743-appb-000009
其中,u 11=U 21-U 11,u 21=U 31-U 21,u (m-1)1=U m1-U (m-1)1,依次类推,则当光伏阵列无故障、处于正常运行状态时,权值矩阵A pv  array中各列元素的值是等差数列、各行元素的值是相等的。若某个支路中的光伏电池组件发生故障,其他光伏电 池组件的电压也会发生变化,利用如下公式计算变化后的光伏电池组件的输出电压为: Among them, u 11 = U 21 -U 11 , u 21 = U 31 -U 21 , u (m-1) 1 = U m1 -U (m-1) 1 , and so on. When the photovoltaic array is fault-free, In the normal running state, the values of the elements in each column in the weight matrix A pv array are equal difference sequences, and the values of the elements in each row are equal. If a photovoltaic cell module in one branch fails, the voltage of other photovoltaic cell modules will also change. Use the following formula to calculate the output voltage of the changed photovoltaic cell module as
U×m=U bad×(m-x) U × m = U bad × (mx)
其中,U表示正常运行的光伏电池组件的输出电压,m为支路中包含的光伏电池组件的总个数,x为故障支路中发生故障的光伏电池组件的个数,U bad为故障支路中正常运行的光伏电池组件的输出电压。比如在图2中,光伏电池组件PV1、PV2和PV3的输出电压分别为
Figure PCTCN2018101743-appb-000010
当光伏电池组件PV1发生故障时,它会被旁路二极管短路,则光伏电池组件PV1输出为0,上式中的x=1,则有
Figure PCTCN2018101743-appb-000011
则光伏电池组件PV2和PV3两端的输出电压即为
Figure PCTCN2018101743-appb-000012
也即相应的光伏电池组件PV2和PV3的输出电压将会增加。则当光伏阵列处于故障状态(包括短路、开路、阴影和混合故障)时,权值矩阵A pv  array中无故障的支路的各行元素相等、存在故障的支路的各行元素与无故障的支路的各行元素不相等;且存在故障的支路的各列元素不是等差数列。下表示出了部分短路故障情况下,各个电压传感器采集到的阵列间电压差的特征值:
Among them, U is the output voltage of the photovoltaic cell module in normal operation, m is the total number of photovoltaic cell modules included in the branch, x is the number of photovoltaic cell modules that have failed in the faulty branch, and U bad is the faulty branch. The output voltage of a photovoltaic cell module operating normally in the circuit. For example, in Figure 2, the output voltages of photovoltaic cell modules PV1, PV2 and PV3 are
Figure PCTCN2018101743-appb-000010
When the PV cell module PV1 fails, it will be short-circuited by the bypass diode, then the output of the PV cell module PV1 is 0, and x = 1 in the above formula, then
Figure PCTCN2018101743-appb-000011
Then the output voltage across the photovoltaic cell modules PV2 and PV3 is
Figure PCTCN2018101743-appb-000012
That is, the output voltage of the corresponding photovoltaic cell modules PV2 and PV3 will increase. When the photovoltaic array is in a fault state (including short circuit, open circuit, shadow, and mixed faults), the elements of the rows of the non-faulty branch in the weight matrix A pv array are equal, and the elements of the rows of the faulty branch and the non-faulty branch The elements of each row of the road are not equal; and the elements of each column of the branch with a fault are not equal difference sequences. The following table shows the characteristic values of the voltage difference between the arrays collected by each voltage sensor in the case of a partial short-circuit fault:
状态标签Status label UaUa UbUb UcUc 状态标签Status label UaUa UbUb UcUc
F1F1 Uarray/3Uarray / 3 Uarray/3Uarray / 3 Uarray/3Uarray / 3 F3-46F3-46 2Uarray/32Uarray / 3 2Uarray/32Uarray / 3 Uarray/3Uarray / 3
F2-1F2-1 Uarray/6Uarray / 6 Uarray/3Uarray / 3 2Uarray/32Uarray / 3 F3-56F3-56 -Uarray/3-Uarray / 3 2Uarray/32Uarray / 3 Uarray/3Uarray / 3
F2-2F2-2 Uarray/6Uarray / 6 Uarray/3Uarray / 3 Uarray/6Uarray / 6 F4-14F4-14 Uarray/2Uarray / 2 Uarray/6Uarray / 6 2Uarray/32Uarray / 3
F2-3F2-3 2Uarray/32Uarray / 3 Uarray/3Uarray / 3 Uarray/6Uarray / 6 F4-15F4-15 00 Uarray/6Uarray / 6 2Uarray/32Uarray / 3
F2-4F2-4 2Uarray/32Uarray / 3 Uarray/6Uarray / 6 Uarray/3Uarray / 3 F4-16F4-16 00 2Uarray/32Uarray / 3 2Uarray/32Uarray / 3
F2-5F2-5 Uarray/6Uarray / 6 Uarray/6Uarray / 6 Uarray/3Uarray / 3 F4-24F4-24 Uarray/2Uarray / 2 Uarray/6Uarray / 6 Uarray/6Uarray / 6
F2-6F2-6 Uarray/6Uarray / 6 2Uarray/32Uarray / 3 Uarray/3Uarray / 3 F4-25F4-25 00 Uarray/6Uarray / 6 Uarray/6Uarray / 6
F3-12F3-12 -Uarray/3-Uarray / 3 Uarray/3Uarray / 3 2Uarray/32Uarray / 3 F4-26F4-26 00 2Uarray/32Uarray / 3 Uarray/6Uarray / 6
F3-13F3-13 2Uarray/32Uarray / 3 Uarray/3Uarray / 3 2Uarray/32Uarray / 3 F4-34F4-34 UarrayUarray Uarray/6Uarray / 6 Uarray/6Uarray / 6
F3-23F3-23 2Uarray/32Uarray / 3 Uarray/3Uarray / 3 -Uarray/3-Uarray / 3 F4-35F4-35 Uarray/2Uarray / 2 Uarray/6Uarray / 6 Uarray/6Uarray / 6
F3-45F3-45 2Uarray/32Uarray / 3 -Uarray/3-Uarray / 3 Uarray/3Uarray / 3 F4-36F4-36 Uarray/2Uarray / 2 2Uarray/32Uarray / 3 Uarray/6Uarray / 6
上述状态标签的表示含义可以参考上述第二步中的内容,本领域技术人员理解的是,状态标签F4-14表示的情况是支路A上的PV1和支路B上的PV4短路,以此类推,本申请不一一赘述。For the meaning of the above status label, please refer to the content in the second step above. Those skilled in the art understand that the situation indicated by status label F4-14 is that PV1 on branch A and PV4 on branch B are short-circuited. By analogy, this application does not repeat them one by one.
当光伏阵列存在阴影故障时,各个电压传感器采集到的阵列间电压差也会在一个范围内,部分阴影故障情况的特征值如下表所示:When the photovoltaic array has a shadow fault, the voltage difference between the arrays collected by each voltage sensor will also be within a range. The characteristic values of some shadow fault conditions are shown in the following table:
状态标签Status label UaUa UbUb UcUc
F9-1F9-1 Uarray/6<Ua<Uarray/3Uarray / 6 <Ua <Uarray / 3 Uarray/3Uarray / 3 Uarray/3<Uc<2Uarray/3Uarray / 3 <Uc <2Uarray / 3
F9-2F9-2 Uarray/6<Ua<Uarray/3Uarray / 6 <Ua <Uarray / 3 Uarray/3Uarray / 3 Uarray/6<Uc<Uarray/3Uarray / 6 <Uc <Uarray / 3
F9-3F9-3 Uarray/3<Ua<2Uarray/3Uarray / 3 <Ua <2Uarray / 3 Uarray/3Uarray / 3 Uarray/6<Uc<Uarray/3Uarray / 6 <Uc <Uarray / 3
F9-4F9-4 Uarray/3<Ua<2Uarray/3Uarray / 3 <Ua <2Uarray / 3 Uarray/6<Ub<Uarray/3Uarray / 6 <Ub <Uarray / 3 Uarray/3Uarray / 3
F9-5F9-5 Uarray/6<Ua<Uarray/3Uarray / 6 <Ua <Uarray / 3 Uarray/6<Ub<Uarray/3Uarray / 6 <Ub <Uarray / 3 Uarray/3Uarray / 3
F9-6F9-6 Uarray/6<Ua<Uarray/3Uarray / 6 <Ua <Uarray / 3 Uarray/3<Ub<2Uarray/3Uarray / 3 <Ub <2Uarray / 3 Uarray/3Uarray / 3
状态标签F9-1表示的情况是:支路A上的PV1存在阴影,同样的,本领域技术人员可以理解其余各个状态标签表示的含义,本申请不一一赘述。The situation indicated by the status label F9-1 is: there is a shadow on PV1 on the branch A. Similarly, those skilled in the art can understand the meanings of the remaining status labels, which are not described in detail in this application.
由上述论述可知,光伏阵列发生故障时会在权值矩阵A pv  array中有所体现,进而会导致电压传感器采集到的阵列间电压差发生变化,因此本申请通过对采集阵列间电压差进行分析就能实现光伏阵列的故障定位。 It can be known from the above discussion that when a photovoltaic array fails, it will be reflected in the weight matrix A pv array , which will cause the voltage difference between the arrays collected by the voltage sensor to change. Therefore, this application analyzes the voltage difference between the collection arrays. Can realize the fault location of photovoltaic array.
另外,由上可知,电压传感器采集到的阵列间电压差都是与光伏阵列的输出电压U array成比例关系的,因此可以进一步简化,不直接利用采集到的阵列间电压差,而是将阵列间电压差除以光伏阵列的输出电压U array得到电压系数,以电压系数来表示阵列间电压差。 In addition, as can be seen from the above, the voltage difference between the arrays collected by the voltage sensor is proportional to the output voltage U array of the photovoltaic array, so it can be further simplified. Instead of directly using the collected voltage difference between the arrays, the array The voltage difference is divided by the output voltage U array of the photovoltaic array to obtain a voltage coefficient, and the voltage difference is used to represent the voltage difference between the arrays.
第四步,故障特征向量和数据样本集的构建。根据采集到的阵列干路参数、阵列支路参数以及阵列间电压差构建故障特征向量,得到故障特征向量后,就能汇总并构建成光伏阵列的数据样本集,由故障特征向量汇总得到数据样本集的方法是本领域技术人员都知道的方法,本申请对此不进行展开叙述。对数据样本集进行划分得到训练样本集和测试样本集,通常可以按照2:1的比例划分成为训练样本集和测试样本集。The fourth step is the construction of fault feature vectors and data sample sets. Construct fault feature vectors based on the collected array trunk parameters, array branch parameters, and the voltage difference between the arrays. After the fault feature vectors are obtained, they can be summarized and constructed into a data sample set of photovoltaic arrays. Data samples are collected from the fault feature vectors. The collection method is a method known to those skilled in the art, which is not described in this application. The data sample set is divided to obtain a training sample set and a test sample set, which can usually be divided into a training sample set and a test sample set according to a ratio of 2: 1.
第五步,基于改进随机森林算法的光伏阵列故障诊断模型的构建。假设训练样本集为X,故障特征向量的维度为K,也即样本的故障特征的数量为K,K为正整数;故障诊断中需要建立的决策树的数目为N,N≥2且N为正整数,则基于改进随机森林算法利用训练样本集构建得到光伏阵列故障诊断模型的步骤可以简述为:The fifth step is the construction of a photovoltaic array fault diagnosis model based on an improved random forest algorithm. Suppose the training sample set is X, and the dimension of the fault feature vector is K, that is, the number of fault features of the sample is K, K is a positive integer; the number of decision trees to be established in fault diagnosis is N, N≥2, and N is Positive integer, the steps of constructing a photovoltaic array fault diagnosis model based on an improved random forest algorithm using training sample sets can be briefly described as:
1、对包含p个样本的训练样本集X进行N轮抽样,对于第i轮抽样,采用有放回的抽样方式抽样p次,由于是进行有放回地抽取,因此会出现部分样本被重复抽取而部分样本没有被抽到的情况,该轮抽样中抽取到的p个样本可以构成该轮抽样对应的子训练样本集X i,训练样本集中在该轮抽样中未被抽到的样本构成该轮抽样对应的包外样本集oob i(out-of-bag),p为正整数,i为参数,1≤i≤N且i为正整数。 1. Perform N rounds of sampling on the training sample set X containing p samples. For the i-th round of sampling, use sampling with replacement sampling p times. Because the sampling is performed with replacement, some samples will be repeated. In the case where some samples are not drawn, the p samples extracted in this round of sampling can constitute the sub-training sample set X i corresponding to this round of sampling, and the training samples are concentrated in the unsampled samples in this round of sampling. the samples corresponding to the outer wheel sets of samples packet oob i (out-of-bag ), p is a positive integer, i is the parameter, 1≤i≤N and i is a positive integer.
2、利用每一轮抽样对应的子训练样本集训练得到一个决策树,也即利用子训练样本集X i训练决策树Q i。在决策树的节点分裂时,从K个故障特征中随机选择q个故障特征,q为正整数且q≤K,计算q个故障特征中的各个故障特 征对应的基尼系数,将与最小的基尼系数对应的故障特征作为最佳分裂特征来进行节点分裂,基尼系数的计算方法可以采用现有的计算公式,本申请对此不作详细介绍。 2. A decision tree is obtained by training the sub-training sample set corresponding to each round of sampling, that is, the sub-training sample set X i is used to train the decision tree Q i . When the nodes of the decision tree are split, q fault features are randomly selected from K fault features, where q is a positive integer and q≤K, and the Gini coefficient corresponding to each fault feature in the q fault features is calculated. The fault feature corresponding to the coefficient is used as the best split feature for node splitting. The calculation method of the Gini coefficient can use the existing calculation formula, which is not described in detail in this application.
3、对于第i轮抽样,其对应的包外样本集oob i中的样本由于没有参与决策树Q i的训练,因此可以将其当做测试集使用,利用包外样本集oob i对决策树Q i进行测试从而得到该决策树Q i的包外准确率H oob(i)。 3. For the i-th round of sampling, since the samples in the corresponding out-of-package sample set oob i do not participate in the training of the decision tree Q i , it can be used as a test set, and the out-of-package sample set oob i is used for the decision tree Q. i is tested to obtain the out-of-packet accuracy H oob (i) of the decision tree Q i .
在普通的随机森林算法中,决策树的权值默认是相等的,因此部分性能优秀的决策树可能无法表现自身优势,甚至会被淹没在多数投票中,所以本申请对此进行改进,对决策树进行了加权,利用决策树的包外准确率定义评价该决策树的性能优劣,在所有决策树训练完成后,计算得到各个决策树的包外准确率,对于第i轮抽样训练得到决策树Q i,利用其对应的包外准确率H oob(i)计算该决策树Q i的权值的计算公式如下: In ordinary random forest algorithms, the weights of decision trees are equal by default. Therefore, some decision trees with excellent performance may not show their advantages and may even be overwhelmed by a majority vote. The tree is weighted, and the out-of-package accuracy rate of the decision tree is used to evaluate the performance of the decision tree. After all the decision tree training is completed, the out-of-package accuracy rate of each decision tree is calculated, and the decision is obtained for the i-th sampling training. Tree Q i , using its corresponding out-of-package accuracy rate H oob (i) to calculate the weight of the decision tree Q i is as follows:
Figure PCTCN2018101743-appb-000013
Figure PCTCN2018101743-appb-000013
其中,w(i)即第i个决策树对应的权值,j为参数。Among them, w (i) is the weight corresponding to the i-th decision tree, and j is a parameter.
4、将所有决策树和每个决策树对应的权值追加到决策树集合中,从而汇总构成光伏阵列故障诊断模型。4. Add all decision trees and the corresponding weights of each decision tree to the decision tree set, so as to form a photovoltaic array fault diagnosis model.
第六步,光伏阵列故障诊断模型的进一步优化。随着光伏阵列规模的扩大,其同时运行的支路和电压传感器的数目也会随之增加,故障特征向量的维度K也会增加,这将增加模型训练的难度,而通过包外样本能够对样本的故障特征进行重要性度量从而删减部分故障特征,具体步骤如下:In the sixth step, the photovoltaic array fault diagnosis model is further optimized. As the size of a photovoltaic array expands, the number of branches and voltage sensors running at the same time will increase, and the dimension K of the fault feature vector will increase, which will increase the difficulty of model training. The fault characteristics of the sample are measured for importance to delete some fault characteristics. The specific steps are as follows:
1、将N个包外样本集oob i进行汇总得到总包外样本集,对于总包外样本集中的每个样本,利用训练得到的光伏阵列故障诊断模型对该样本进行测试得到该样本对应的初始包外准确率e,具体的:光伏阵列故障诊断模型中共包括N个决策树,利用其中未使用该样本进行训练的各个决策树对该样本进行测试,就能确定测试正确和测试错误的决策树的个数,从而得到初始包外准确率e。 1. Aggregate N out-of-package sample sets oob i to obtain a total out-of-package sample set. For each sample in the out-of-package sample set, use the trained photovoltaic array fault diagnosis model to test the sample to obtain the corresponding sample. The initial out-of-package accuracy e, specifically: The PV array fault diagnosis model includes a total of N decision trees. Using each decision tree that is not trained with the sample to test the sample, you can determine the correct test and the wrong decision The number of trees to obtain the initial out-of-package accuracy e.
2、该样本共包括K个故障特征,在其中第k个故障特征中加入噪声,并利用光伏阵列故障诊断模型重新对该样本进行测试得到新的包外准确率e1,计算初始包外准确率和新的包外准确率之间的差值e-e1,从而确定第k个特征的重要性度量值,K为正整数,k为参数且k的起始值为1。2. The sample includes a total of K fault characteristics. Noise is added to the k-th fault characteristic, and the sample is tested again using the photovoltaic array fault diagnosis model to obtain a new out-of-package accuracy rate e1, and the initial out-of-package accuracy rate is calculated. And the new out-of-package accuracy rate, e-e1, to determine the importance metric of the k-th feature, where K is a positive integer, k is a parameter, and the starting value of k is 1.
3、在k<K时,令k=k+1,并再次执行上述步骤2,也即执行在其中第k个故障特征中加入噪声的步骤。直至k=K时,得到该样本的全部K个故障特征的重要性度量值。3. When k <K, let k = k + 1, and perform step 2 again, that is, the step of adding noise to the k-th fault feature. Until k = K, importance metric values of all K fault features of the sample are obtained.
4、利用总包外样本集中的各个样本计算得到的K个故障特征的重要性度量值总的来说是趋于一致的,但实际操作时难免有误差,因此在利用总包外样本集中的各个样本分别计算得到K个故障特征的重要性度量值后,对其汇总从而确定K个故障特征中重要性度量值最小的t个故障特征。然后对于训练样本集中的每个样本,删除样本的K个故障特征中重要性度量值最小的t个故障特征,得到处理后的训练样本集,1≤t<K。4. The importance metrics of the K fault features calculated using the samples in the outsourced sample set are generally consistent, but there are errors in actual operation. Therefore, the After the importance metric values of the K fault features are calculated for each sample, they are summed to determine the t fault features with the smallest importance metric value among the K fault features. Then for each sample in the training sample set, t fault features with the smallest importance metric among the K fault features of the sample are deleted to obtain a processed training sample set, 1 ≦ t <K.
5、基于随机森林算法利用处理后的训练样本集构建得到光伏阵列故障诊断模型。5. Based on the random forest algorithm, the processed training sample set is used to construct a photovoltaic array fault diagnosis model.
第七步,光伏阵列故障诊断模型的测试。利用测试样本集对光伏阵列故障诊断模型进行测试,可以最终得到诊断精度、平均训练时间和测试时间等指标。按照上述列举的F1-F20,对于光伏阵列的第一级状态类型进行诊断,没有进行故障特征删减,采用改进随机森林算法和普通的随机森林算法进行故障诊断,其中按照2:1的比例划分训练样本集和测试样本集,两种算法使用相同的决策树模型,决策树数目为40,实验结果所得的指标对比如下:The seventh step is to test the fault diagnosis model of the photovoltaic array. By using the test sample set to test the PV array fault diagnosis model, indicators such as diagnostic accuracy, average training time, and test time can be obtained. According to the F1-F20 listed above, the first-level state type of the photovoltaic array is diagnosed without fault feature deletion. The improved random forest algorithm and ordinary random forest algorithm are used for fault diagnosis. The training sample set and the test sample set. Both algorithms use the same decision tree model. The number of decision trees is 40. The experimental results are compared as follows:
诊断方法diagnosis method 训练时间/sTraining time / s 测试时间/sTest time / s 诊断精度/%Diagnostic accuracy /%
普通随机森林Ordinary random forest 0.9761200.976120 0.380790.38079 8989
改进随机森林Improve Random Forest 1.4723871.472387 0.387110.38711 9191
由上表可以看出,本申请提供的基于改进随机森林的算法虽然在训练时间上稍有增加,但诊断精度更高。As can be seen from the above table, although the algorithm based on the improved random forest provided in this application slightly increases the training time, the diagnosis accuracy is higher.
二级状态分类可以看成是一级状态分类的一个子类,当光伏阵列规模较大时,其状态的排列组合较多,以短路故障状态为例,统计可得短路故障状态进行二级状态分类后的数目为49,实验结果所得的指标对比如下:同样设置决策树数目为40,决策树的平均诊断精度为64.13%,故障特征向量的维度K=19,在删除重要性度量值最小的4维故障特征后,决策树的平均诊断精度上升至72.95%,整个光伏阵列故障诊断模型的诊断精度也随之提高。The second-level status classification can be regarded as a sub-class of the first-level status classification. When the size of a photovoltaic array is large, there are many permutations and combinations of the states. Taking the short-circuit fault status as an example, the short-circuit fault status can be obtained as the second-level status. The number after classification is 49, and the comparison of the experimental results is as follows: Similarly, the number of decision trees is set to 40, the average diagnosis accuracy of the decision tree is 64.13%, and the dimension of the fault feature vector is K = 19. After the 4-dimensional fault feature, the average diagnostic accuracy of the decision tree rose to 72.95%, and the diagnostic accuracy of the entire photovoltaic array fault diagnostic model also improved.
诊断方法diagnosis method 训练时间/sTraining time / s 测试时间/sTest time / s 诊断精度/%Diagnostic accuracy /%
普通随机森林Ordinary random forest 2.06462.0646 0.491090.49109 94.4994.49
改进随机森林Improve Random Forest 2.11712.1171 0.509350.50935 96.1296.12
第八步,利用测试完成的光伏阵列故障诊断模型对待诊断光伏阵列进行诊断,也即利用N个决策树对各个典型运行状态进行投票,投票时采用赋权投票的方式,也即每个决策树将自身对应的权值作为票数投出。The eighth step is to use the completed photovoltaic array fault diagnosis model to diagnose the PV array to be diagnosed, that is, to use N decision trees to vote for each typical operating state. The voting method is used to vote, that is, each decision tree The weight corresponding to itself is cast as the number of votes.
在投票完成后,统计得到各个典型运行状态对应的投票结果,然后根据各个典型运行状态对应的投票结果得到待诊断光伏阵列的故障诊断结果,得到的故障诊断结果用于指示光伏阵列中的各个光伏电池组件的运行状态,具体的:After the voting is completed, the voting results corresponding to each typical operating state are statistically obtained, and then the fault diagnosis results of the photovoltaic array to be diagnosed are obtained according to the voting results corresponding to each typical operating state. The obtained fault diagnosis results are used to indicate each photovoltaic in the photovoltaic array. The operating status of the battery assembly, specifically:
1)、检测得到最高的投票数的典型运行状态的数量。1) The number of typical operating states that detect the highest number of votes.
2)、若最高的投票数对应的典型运行状态仅包括一个,则直接将最高的投票数对应的典型运行状态作为故障诊断结果输出,否则执行如下步骤3)。2) If the typical running state corresponding to the highest number of votes includes only one, the typical running state corresponding to the highest number of votes is directly output as the fault diagnosis result, otherwise, the following step 3) is performed.
3)、在最高的投票数对应的典型运行状态包括至少两个时,本申请创造性的加入了平局处理策略,也即选取权值最高的L个决策树对各个典型运行状态重新进行投票,L为正整数且2≤L<N,根据实际需要选择。若存在一个典型运行状态获得最多个决策树的选择,则将典型运行状态作为故障诊断结果输出;否则将权值最高的决策树选择的典型运行状态作为故障诊断结果输出。比如,以L=3为例,在重新投票时,若存在两个决策树的投票结果相同,则输出该结果,否则输出权值最高的决策树的投票结果。3) When the typical running state corresponding to the highest number of votes includes at least two, this application creatively adds a draw processing strategy, that is, the L decision trees with the highest weights are selected to vote again for each typical running state, L It is a positive integer and 2≤L <N, and it is selected according to actual needs. If there is a typical operating state to obtain the selection of the most decision trees, the typical operating state is output as the fault diagnosis result; otherwise, the typical operating state selected by the decision tree with the highest weight is output as the fault diagnosis result. For example, taking L = 3 as an example, when re-voting, if the voting results of two decision trees are the same, the result is output, otherwise the voting result of the decision tree with the highest weight is output.
以上所述的仅是本申请的优选实施方式,本发明不限于以上实施例。可以理解,本领域技术人员在不脱离本发明的精神和构思的前提下直接导出或联想到的其他改进和变化,均应认为包含在本发明的保护范围之内。What has been described above are only preferred embodiments of the present application, and the present invention is not limited to the above embodiments. It can be understood that other improvements and changes directly derived or associated by those skilled in the art without departing from the spirit and concept of the present invention should be considered to be included in the protection scope of the present invention.

Claims (7)

  1. 一种基于改进随机森林算法的光伏阵列故障诊断方法,其特征在于,所述方法包括:A photovoltaic array fault diagnosis method based on an improved random forest algorithm, characterized in that the method includes:
    确定光伏阵列在运行过程中的典型运行状态,所述光伏阵列包括n个支路,每个支路中包括m个光伏电池组件,所述光伏阵列的典型运行状态用于指示所述光伏阵列中的各个光伏电池组件的运行状态,m和n均为正整数;Determine a typical operating state of a photovoltaic array during operation, the photovoltaic array includes n branches, and each branch includes m photovoltaic cell components, and the typical operating state of the photovoltaic array is used to indicate The operating state of each photovoltaic cell module, m and n are positive integers;
    在所述光伏阵列处于每个所述典型运行状态时,采集所述光伏阵列的阵列干路参数、各个支路对应的阵列支路参数以及不同支路之间的阵列间电压差;When the photovoltaic array is in each of the typical operating states, collecting an array trunk parameter of the photovoltaic array, an array branch parameter corresponding to each branch, and a voltage difference between the arrays between different branches;
    根据所述阵列干路参数、阵列支路参数以及阵列间电压差构建故障特征向量,根据所述故障特征向量构建得到所述光伏阵列的数据样本集,将所述数据样本集划分为训练样本集和测试样本集;A fault feature vector is constructed according to the array trunk parameters, array branch parameters, and voltage difference between the arrays. A data sample set of the photovoltaic array is constructed based on the fault feature vectors, and the data sample set is divided into training sample sets. And test sample sets;
    基于随机森林算法利用所述训练样本集训练得到光伏阵列故障诊断模型,并利用所述测试样本集对所述光伏阵列故障诊断模型进行测试,所述光伏阵列故障诊断模型中包括N个决策树,N为正整数且N≥2;Use the training sample set to train a photovoltaic array fault diagnosis model based on a random forest algorithm, and use the test sample set to test the photovoltaic array fault diagnosis model. The photovoltaic array fault diagnosis model includes N decision trees. N is a positive integer and N≥2;
    利用测试完成的所述光伏阵列故障诊断模型对待诊断光伏阵列进行诊断,得到所述N个决策树对各个所述典型运行状态的投票结果;Diagnose the photovoltaic array to be diagnosed by using the photovoltaic array fault diagnosis model completed by the test, and obtain voting results of the N decision trees for each of the typical operating states;
    根据各个所述典型运行状态对应的投票结果得到所述待诊断光伏阵列的故障诊断结果,所述故障诊断结果用于指示所述光伏阵列中的各个光伏电池组件的运行状态。A fault diagnosis result of the photovoltaic array to be diagnosed is obtained according to a voting result corresponding to each of the typical operating states, and the fault diagnosis result is used to indicate an operating state of each photovoltaic cell module in the photovoltaic array.
  2. 根据权利要求1所述的方法,其特征在于,The method according to claim 1, wherein:
    所述阵列干路参数和每个所述阵列支路参数分别包括电路的最大功率点电压、最大功率点电流、开路电压和短路电流。The array trunk parameters and each of the array branch parameters include a maximum power point voltage, a maximum power point current, an open circuit voltage, and a short circuit current of a circuit, respectively.
  3. 根据权利要求1或2所述的方法,其特征在于,所述光伏阵列故障诊断模型中包括N个决策树及每个所述决策树对应的权值,则所述利用测试完成的所述光伏阵列故障诊断模型对待诊断光伏阵列进行诊断,包括:The method according to claim 1 or 2, wherein the photovoltaic array fault diagnosis model includes N decision trees and weights corresponding to each of the decision trees, then the photovoltaics completed by the test are used. The array fault diagnosis model diagnoses the PV array to be diagnosed, including:
    所述N个决策树对各个所述典型运行状态进行投票,每个所述决策树将自身对应的权值作为票数投出;The N decision trees vote for each of the typical operating states, and each of the decision trees casts its corresponding weight as the number of votes;
    统计得到各个所述典型运行状态对应的投票结果。The voting results corresponding to each of the typical operating states are obtained by statistics.
  4. 根据权利要求3所述的方法,其特征在于,所述基于随机森林算法利用 所述训练样本集训练得到光伏阵列故障诊断模型,包括:The method according to claim 3, wherein the random forest algorithm based on the training sample set is used to obtain a photovoltaic array fault diagnosis model, comprising:
    对包含p个样本的训练样本集进行N轮抽样,对于每一轮抽样,采用有放回的抽样方式抽样p次得到该轮抽样对应的子训练样本集和包外样本集,所述子训练集中包括该轮抽样中抽取到的p个样本,所述包外样本集中包括所述训练样本集中在该轮抽样中未被抽取的样本,p为正整数;Perform N rounds of sampling on the training sample set containing p samples. For each round of sampling, use the sampling method with replacement to sample p times to obtain the sub-training sample set and out-of-package sample set corresponding to the round of sampling. The set includes p samples selected in the round of sampling, and the out-of-package sample set includes samples that are not drawn in the round of sampling in the training sample set, and p is a positive integer;
    根据每一轮抽样对应的子训练样本集训练得到一个决策树,并利用该轮抽样对应的包外样本集对所述决策树进行测试得到包外准确率,根据所述包外准确率计算得到所述决策树对应的权值;A decision tree is trained according to the sub-training sample set corresponding to each round of sampling, and the decision tree is tested using the out-of-package sample set corresponding to the round of sampling to obtain an out-of-package accuracy rate, which is calculated according to the out-of-package accuracy rate A weight corresponding to the decision tree;
    对训练得到的N个决策树和每个决策树对应的权值进行汇总得到所述光伏阵列故障诊断模型。The training N decision trees and the weights corresponding to each decision tree are summarized to obtain the photovoltaic array fault diagnosis model.
  5. 根据权利要求4所述的方法,其特征在于,所述根据所述包外准确率计算得到所述决策树对应的权值,包括计算:The method according to claim 4, wherein the calculating the weight value corresponding to the decision tree according to the out-of-package accuracy rate comprises calculating:
    Figure PCTCN2018101743-appb-100001
    Figure PCTCN2018101743-appb-100001
    其中,w(i)为第i个决策树对应的权值,Hoob(i)为第i个决策树对应的包外准确率,i和j均为参数,1≤i≤N且i为正整数。Among them, w (i) is the weight corresponding to the i-th decision tree, Hoob (i) is the out-of-package accuracy rate corresponding to the i-th decision tree, i and j are parameters, 1≤i≤N and i is positive Integer.
  6. 根据权利要求4所述的方法,其特征在于,所述方法还包括:The method according to claim 4, further comprising:
    将N个所述包外样本集进行汇总得到总包外样本集,对于所述总包外样本集中的每个样本,利用训练得到的所述光伏阵列故障诊断模型对所述样本进行测试得到初始包外准确率;The N out-of-package sample sets are aggregated to obtain a total out-of-package sample set. For each sample in the out-of-package sample set, the samples are tested by using the trained photovoltaic array fault diagnosis model to obtain an initial Out-of-package accuracy
    所述样本共包括K个故障特征,在其中第k个故障特征中加入噪声,并利用所述光伏阵列故障诊断模型重新对所述样本进行测试得到新的包外准确率,计算所述初始包外准确率和新的包外准确率之间的差值确定所述第k个特征的重要性度量值,K为正整数,k为参数且k的起始值为1;The sample includes a total of K fault characteristics, noise is added to the k-th fault characteristic, and the sample is retested using the photovoltaic array fault diagnosis model to obtain a new out-of-package accuracy rate, and the initial package is calculated. The difference between the external accuracy rate and the new out-of-package accuracy rate determines the importance metric of the k-th feature, K is a positive integer, k is a parameter, and the starting value of k is 1;
    在k<K时,令k=k+1,并再次执行所述在其中第k个故障特征中加入噪声的步骤;直至k=K时得到所述K个故障特征的重要性度量值;When k <K, let k = k + 1, and perform the step of adding noise to the k-th fault feature again; until k = K, obtain the importance metric values of the K fault features;
    对于所述训练样本集中的每个样本,删除所述样本的K个故障特征中重要性度量值最小的t个故障特征,得到处理后的训练样本集;For each sample in the training sample set, delete t fault features with the smallest importance metric among the K fault features of the sample to obtain a processed training sample set;
    基于随机森林算法利用处理后的所述训练样本集构建得到光伏阵列故障诊断模型。A photovoltaic array fault diagnosis model is constructed by using the processed training sample set based on a random forest algorithm.
  7. 根据权利要求3所述的方法,其特征在于,所述根据各个所述典型运行状态对应的投票结果得到所述待诊断光伏阵列的故障诊断结果,包括:The method according to claim 3, wherein the obtaining a fault diagnosis result of the photovoltaic array to be diagnosed according to a voting result corresponding to each of the typical operating states comprises:
    若最高的投票数对应的典型运行状态仅包括一个,则将最高的投票数对应的典型运行状态作为所述故障诊断结果输出;If the typical operating state corresponding to the highest number of votes includes only one, outputting the typical operating state corresponding to the highest number of votes as the fault diagnosis result;
    若最高的投票数对应的典型运行状态包括至少两个,则选取权值最高的L个决策树对各个所述典型运行状态重新进行投票,若存在一个典型运行状态获得最多个决策树的选择,则将所述典型运行状态作为所述故障诊断结果输出;否则将权值最高的决策树选择的典型运行状态作为所述故障诊断结果输出,L为正整数且2≤L<N。If the typical running state corresponding to the highest number of votes includes at least two, the L decision trees with the highest weights are selected to vote again for each of the typical running states. If there is a typical running state to obtain the selection of the most decision trees, Then output the typical operating state as the fault diagnosis result; otherwise, output the typical operating state selected by the decision tree with the highest weight as the fault diagnosis result, where L is a positive integer and 2 ≦ L <N.
PCT/CN2018/101743 2018-08-10 2018-08-22 Photovoltaic array fault diagnosis method based on improved random forest algorithm WO2020029327A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
RU2019127993A RU2734017C1 (en) 2018-08-10 2018-08-22 Method of diagnosing faults in photovoltaic arrays, based on an improved random forest algorithm

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810907602.3A CN109039281A (en) 2018-08-10 2018-08-10 A kind of diagnosing failure of photovoltaic array method based on improvement random forests algorithm
CN201810907602.3 2018-08-10

Publications (1)

Publication Number Publication Date
WO2020029327A1 true WO2020029327A1 (en) 2020-02-13

Family

ID=64632631

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/101743 WO2020029327A1 (en) 2018-08-10 2018-08-22 Photovoltaic array fault diagnosis method based on improved random forest algorithm

Country Status (3)

Country Link
CN (1) CN109039281A (en)
RU (1) RU2734017C1 (en)
WO (1) WO2020029327A1 (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111506998A (en) * 2020-04-15 2020-08-07 哈尔滨工业大学 Method for constructing parameter drift fault feature sample library in manufacturing process of electromechanical product
CN112578794A (en) * 2020-12-12 2021-03-30 云南昆船智能装备有限公司 AGV fault detection method based on machine learning, storage medium and system
CN112949198A (en) * 2021-03-10 2021-06-11 阳光电源股份有限公司 Fault detection method and device for photovoltaic string and computer storage medium
CN113159601A (en) * 2021-04-30 2021-07-23 国网湖北省电力有限公司武汉供电公司 Photovoltaic power station operation state analysis method based on DBSCAN clustering algorithm
CN113489457A (en) * 2021-06-01 2021-10-08 厦门科灿信息技术有限公司 Photovoltaic module fault detection method and device
CN113554177A (en) * 2020-04-02 2021-10-26 北京航空航天大学 Satellite power supply system autonomous fault diagnosis method based on soft decision
CN113552443A (en) * 2021-06-08 2021-10-26 广西大学 Hybrid connection high-voltage line fault identification method based on alternating current-direct current pulse and random forest
CN113625108A (en) * 2021-08-02 2021-11-09 四川轻化工大学 Flexible direct current power distribution network fault identification method
CN114022782A (en) * 2021-11-01 2022-02-08 中国石油大学(华东) Sea fog detection method based on MODIS satellite data
CN115940806A (en) * 2023-01-10 2023-04-07 柏腾数科(杭州)科技有限公司 Photovoltaic power generation system and fault diagnosis method thereof
CN116910668A (en) * 2023-09-11 2023-10-20 国网浙江省电力有限公司余姚市供电公司 Lightning arrester fault early warning method, device, equipment and storage medium
CN117235617A (en) * 2023-11-10 2023-12-15 兰州理工大学 ML-RFKNN-based photovoltaic array fault diagnosis method in sand and dust weather

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111245365B (en) * 2020-01-09 2020-12-15 南京工业大学 Photovoltaic module fault diagnosis method based on dynamic weighted depth forest
CN112818587B (en) * 2021-01-08 2023-03-21 北京理工大学 Photovoltaic array reconstruction method based on divide-and-conquer Q-learning algorithm
CN114336742B (en) * 2021-12-22 2023-10-27 浙江大学 Solar photovoltaic controller based on self-adaptive neural network

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104753461A (en) * 2015-04-10 2015-07-01 福州大学 Method for diagnosing and classifying faults of photovoltaic power generation arrays on basis of particle swarm optimization support vector machines
CN106961249A (en) * 2017-03-17 2017-07-18 广西大学 A kind of diagnosing failure of photovoltaic array and method for early warning
US20180131190A1 (en) * 2016-11-08 2018-05-10 Sunpower Corporation Energy flow prediction for electric systems including photovoltaic solar systems
CN108062571A (en) * 2017-12-27 2018-05-22 福州大学 Diagnosing failure of photovoltaic array method based on differential evolution random forest grader

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5196607B2 (en) * 2011-06-10 2013-05-15 株式会社システム・ジェイディー Fault diagnosis method and diagnostic object
JP6209412B2 (en) * 2013-09-27 2017-10-04 株式会社日立製作所 Fault diagnosis system and fault diagnosis method for photovoltaic power generation system
RU2582064C1 (en) * 2014-12-16 2016-04-20 Общество с ограниченной ответственностью "Аби Девелопмент" Methods and systems for effective automatic recognition of symbols using forest solutions
CN106997669B (en) * 2017-05-31 2020-06-09 青岛大学 Method for judging traffic jam cause based on feature importance
CN108510507A (en) * 2018-03-27 2018-09-07 哈尔滨理工大学 A kind of 3D vertebra CT image active profile dividing methods of diffusion-weighted random forest

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104753461A (en) * 2015-04-10 2015-07-01 福州大学 Method for diagnosing and classifying faults of photovoltaic power generation arrays on basis of particle swarm optimization support vector machines
US20180131190A1 (en) * 2016-11-08 2018-05-10 Sunpower Corporation Energy flow prediction for electric systems including photovoltaic solar systems
CN106961249A (en) * 2017-03-17 2017-07-18 广西大学 A kind of diagnosing failure of photovoltaic array and method for early warning
CN108062571A (en) * 2017-12-27 2018-05-22 福州大学 Diagnosing failure of photovoltaic array method based on differential evolution random forest grader

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113554177A (en) * 2020-04-02 2021-10-26 北京航空航天大学 Satellite power supply system autonomous fault diagnosis method based on soft decision
CN113554177B (en) * 2020-04-02 2023-10-03 北京航空航天大学 Satellite power supply system autonomous fault diagnosis method based on soft decision
CN111506998A (en) * 2020-04-15 2020-08-07 哈尔滨工业大学 Method for constructing parameter drift fault feature sample library in manufacturing process of electromechanical product
CN111506998B (en) * 2020-04-15 2022-02-08 哈尔滨工业大学 Method for constructing parameter drift fault feature sample library in manufacturing process of electromechanical product
CN112578794A (en) * 2020-12-12 2021-03-30 云南昆船智能装备有限公司 AGV fault detection method based on machine learning, storage medium and system
CN112578794B (en) * 2020-12-12 2023-09-01 云南昆船智能装备有限公司 AGV fault detection method, storage medium and system based on machine learning
CN112949198A (en) * 2021-03-10 2021-06-11 阳光电源股份有限公司 Fault detection method and device for photovoltaic string and computer storage medium
CN112949198B (en) * 2021-03-10 2024-04-12 阳光电源股份有限公司 Method and device for detecting faults of photovoltaic string and computer storage medium
CN113159601A (en) * 2021-04-30 2021-07-23 国网湖北省电力有限公司武汉供电公司 Photovoltaic power station operation state analysis method based on DBSCAN clustering algorithm
CN113489457B (en) * 2021-06-01 2022-08-16 厦门科华数能科技有限公司 Photovoltaic module fault detection method and device
CN113489457A (en) * 2021-06-01 2021-10-08 厦门科灿信息技术有限公司 Photovoltaic module fault detection method and device
CN113552443A (en) * 2021-06-08 2021-10-26 广西大学 Hybrid connection high-voltage line fault identification method based on alternating current-direct current pulse and random forest
CN113625108A (en) * 2021-08-02 2021-11-09 四川轻化工大学 Flexible direct current power distribution network fault identification method
CN114022782A (en) * 2021-11-01 2022-02-08 中国石油大学(华东) Sea fog detection method based on MODIS satellite data
CN114022782B (en) * 2021-11-01 2024-05-10 中国石油大学(华东) Sea fog detection method based on MODIS satellite data
CN115940806A (en) * 2023-01-10 2023-04-07 柏腾数科(杭州)科技有限公司 Photovoltaic power generation system and fault diagnosis method thereof
CN115940806B (en) * 2023-01-10 2023-05-23 柏腾数科(杭州)科技有限公司 Photovoltaic power generation system and fault diagnosis method thereof
CN116910668A (en) * 2023-09-11 2023-10-20 国网浙江省电力有限公司余姚市供电公司 Lightning arrester fault early warning method, device, equipment and storage medium
CN116910668B (en) * 2023-09-11 2024-04-02 国网浙江省电力有限公司余姚市供电公司 Lightning arrester fault early warning method, device, equipment and storage medium
CN117235617A (en) * 2023-11-10 2023-12-15 兰州理工大学 ML-RFKNN-based photovoltaic array fault diagnosis method in sand and dust weather

Also Published As

Publication number Publication date
RU2734017C1 (en) 2020-10-12
CN109039281A (en) 2018-12-18

Similar Documents

Publication Publication Date Title
WO2020029327A1 (en) Photovoltaic array fault diagnosis method based on improved random forest algorithm
WO2020029344A1 (en) Random forest algorithm-based photovoltaic array fault diagnosis method
Dhimish et al. Comparing Mamdani Sugeno fuzzy logic and RBF ANN network for PV fault detection
Spataru et al. Diagnostic method for photovoltaic systems based on light I–V measurements
Dhimish et al. Novel open-circuit photovoltaic bypass diode fault detection algorithm
CN108805256A (en) Photovoltaic module method for diagnosing faults based on cuckoo algorithm and BP neural network
Reddy et al. A MATLAB based PV module models analysis under conditions of nonuniform irradiance
Chine et al. ANN-based fault diagnosis technique for photovoltaic stings
da Costa et al. A comparison of machine learning-based methods for fault classification in photovoltaic systems
Djalab et al. Robust method for diagnosis and detection of faults in photovoltaic systems using artificial neural networks
CN113708726B (en) Photovoltaic array fault discrimination method based on photovoltaic module voltage real-time calculation comparison
Zaki et al. Detection and localization the open and short circuit faults in PV system: A MILP approach
Aallouche et al. Online fault detection and identification for an isolated PV system using ANN
Nilsson Fault detection in photovoltaic systems
Guerriero et al. Innovative algorithm for true maximum detection based on individual PV panel sensor network
Al-Kashashnehand et al. Wireless Sensor Network Based Real-Time Monitoring and Fault Detection for Photovoltaic Systems
CN113872526B (en) Photovoltaic array fault diagnosis method based on minimum mismatch fault current prediction
Hachana et al. Fault diagnosis of building integrated PV generator: a metaheuristic technique
Gong et al. Fault diagnosis method of photovoltaic array based on random forest algorithm
Sapountzoglou et al. A grid connected pv system fault diagnosis method
Ghazali et al. A multi-scale dual-stage model for PV array fault detection, classification, and monitoring technique
CN113919426A (en) Photovoltaic array fault identification method based on k-medoids algorithm
TW202207617A (en) Photovoltaic array fault diagnosing method based on electrical timing waveform
Wang et al. Fault diagnosis method based on an improved KNN algorithm for PV strings
Dhimish et al. Fault detection algorithm for multiple GCPV array configurations

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18929821

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18929821

Country of ref document: EP

Kind code of ref document: A1