CN110795690A - Wind power plant operation abnormal data detection method - Google Patents

Wind power plant operation abnormal data detection method Download PDF

Info

Publication number
CN110795690A
CN110795690A CN201911018724.8A CN201911018724A CN110795690A CN 110795690 A CN110795690 A CN 110795690A CN 201911018724 A CN201911018724 A CN 201911018724A CN 110795690 A CN110795690 A CN 110795690A
Authority
CN
China
Prior art keywords
data
wind speed
wind
power
turbine generator
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911018724.8A
Other languages
Chinese (zh)
Inventor
陈子新
刘永前
王金山
丛智慧
李硕
马亮
韩爽
阎洁
李莉
张�浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
DATANG (CHIFENG) NEW ENERGY Co Ltd
Original Assignee
DATANG (CHIFENG) NEW ENERGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by DATANG (CHIFENG) NEW ENERGY Co Ltd filed Critical DATANG (CHIFENG) NEW ENERGY Co Ltd
Priority to CN201911018724.8A priority Critical patent/CN110795690A/en
Publication of CN110795690A publication Critical patent/CN110795690A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Business, Economics & Management (AREA)
  • Economics (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Computational Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Software Systems (AREA)
  • Tourism & Hospitality (AREA)
  • Computing Systems (AREA)
  • Strategic Management (AREA)
  • Marketing (AREA)
  • Human Resources & Organizations (AREA)
  • General Health & Medical Sciences (AREA)
  • Algebra (AREA)
  • Water Supply & Treatment (AREA)
  • Databases & Information Systems (AREA)
  • General Business, Economics & Management (AREA)
  • Public Health (AREA)
  • Probability & Statistics with Applications (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Wind Motors (AREA)
  • Control Of Eletrric Generators (AREA)

Abstract

A method for detecting abnormal operation data of a wind power plant is provided, which belongs to the technical field of wind power and comprises the following steps: collecting operating data, operating environment parameters and wind turbine generator parameters of a wind turbine generator; performing data preliminary processing according to the power characteristics of the wind turbine generator; removing discrete abnormal data points of the wind turbine generator operation by adopting a quartile method; introducing pitch angle, wind direction, temperature and humidity data at corresponding moments into wind speed and power operation data of an original unit to form multidimensional data, sorting the multidimensional data according to the wind speed, and dividing the multidimensional data into an inter-cell data set at a wind speed interval of a first threshold; normalizing the data set of each wind speed cell; and taking the normalized data in each wind speed interval as the input of a multi-dimensional data clustering model, and detecting abnormal data in the multi-dimensional data clustering model. The method solves the technical problem of low accuracy rate of identifying abnormal data.

Description

Wind power plant operation abnormal data detection method
Technical Field
The invention relates to the technical field of wind power, in particular to a method for detecting abnormal operation data of a wind turbine generator.
Background
The wind power curve is one of the main curves describing the corresponding relation between wind speed and power output and reflecting the performance of the wind turbine. The equivalent power curve of the wind turbine can be used for evaluating the performance and the operation condition of the wind turbine and the wind power plant, and in addition, the data are important for wind power prediction and directly influence the final precision of a prediction result, so that the formulation of a power system scheduling plan is influenced. The equivalent power curve of the wind turbine generator set is determined to have high requirements on wind speed and power quality, but due to sensor precision, electromagnetic interference, information processing errors, storage or communication faults and wind power station wind abandoning and electricity limiting, a large amount of abnormal data exist in original recorded data, and the integral distribution rule and the corresponding relation of the wind speed and the power are seriously damaged. If the data are directly used without being processed, the analysis of the operation rule of the wind turbine generator is influenced to destroy the operation rule of the wind turbine generator, and further, the performance analysis of the wind turbine generator and the prediction precision of the power of the wind turbine generator are influenced.
The identification and cleaning of the operating power data of the wind turbine generator becomes a necessary step in the research of the wind turbine generator, and the cleaning of the abnormal operating data of the existing wind turbine generator can be divided into three types in principle: identifying anomalous data based on the density or distance of the data points; establishing a mathematical model of a wind power curve to identify abnormal data; and identifying abnormal data according to the position distribution characteristics of the abnormal data. Research on the existing method shows that abnormal data are commonly identified according to data point density or distance in practical application, and the method mainly comprises a quartile method, a clustering method, an intra-group optimal variance method, a 3 sigma method and the like. The method is simple to operate and good in effect when identifying the abnormal data of the wind turbine generator set, but the method is limited in effect when identifying the situations that the normal data and the abnormal data of the wind turbine generator set are equivalent, namely the intensive accumulation type abnormal data.
Disclosure of Invention
Aiming at the problems in the prior art, the invention provides a method for detecting abnormal operation data of a wind power plant, which comprises the following steps:
step 1: collecting operating data, operating environment parameters and wind turbine generator parameters of a wind turbine generator;
step 2: performing data primary processing on the power and wind speed data according to the power characteristics of the wind turbine generator;
and step 3: identifying and processing the wind speed and power abnormal data after primary processing by adopting a quartile method, and clearing discrete abnormal data points of the wind turbine generator;
and 4, step 4: after discrete abnormal data points of the wind turbine generator during operation are eliminated, introducing pitch angle, wind direction, temperature and humidity data at corresponding moments into wind speed and power operation data of an original wind turbine generator to form multidimensional data, sequencing the multidimensional data according to the wind speed, and dividing the multidimensional data into a data set between cells at a wind speed interval of a first threshold;
and 5: normalizing the data set of each wind speed cell;
step 6: and taking the normalized data in each wind speed interval as the input of a multi-dimensional data clustering model, and detecting abnormal data in the multi-dimensional data clustering model.
According to one aspect of the invention, the wind turbine operating data in step 1 includes wind speed, wind direction, power and pitch angle, the operating environment parameters include temperature and humidity, and the wind turbine parameters include cut-in wind speed, cut-out wind speed and turbine rated power.
According to an aspect of the invention, the step 2 comprises:
step 21: clearing data that the output power of the wind turbine is larger than zero when the wind speed is smaller than the cut-in wind speed; and
step 22: and clearing data of which the power is less than or equal to zero when the wind speed is greater than the cut-in wind speed and less than the cut-out wind speed.
According to an aspect of the invention, said step 3 comprises:
step 31: clearing wind speed abnormal data by a quartile method, comprising the following steps: sorting the data from large power to small power, dividing the data into cells by a power interval unit of a second threshold value, and removing abnormal values of wind speed in the cells by a quartile method;
step 32: clearing power anomaly data by a quartile method, comprising: sorting the data processed in the step 31 according to the wind speed, dividing the data into cells at a wind speed interval of a third threshold value, and removing the abnormal power value in the cells by a quartile method.
According to an aspect of the invention, the first threshold, the second threshold and the third threshold are respectively: 0.5m/s, 25KW and 0.5 m/s.
According to an aspect of the present invention, the normalization in step 5 is performed by:
Figure BDA0002246514130000021
wherein XiFor the data set of the ith wind speed cell,
Figure BDA0002246514130000022
the data is normalized; x is the original data of the selected variable in the interval; xminIs the minimum value of the selected variable within the interval; xmaxThe maximum value of the variable selected within the interval.
According to one aspect of the invention, said step 6 comprises:
step 61: given a wind speed inter-cell dataset X containing n samplesi={x1,x2,…,xnD-dimensional real feature vectors are taken as each sample;
step 62: constructing a similarity matrix S epsilon R according to input sample datan×nWherein the similarity matrix Sij=Wij,WijIs a contiguous matrix representing the ithA weight vector between the sample and the jth sample;
Figure BDA0002246514130000031
where σ is a parameter controlling the propagation of the domain, where the element Sij=Sji
And step 63: according to the adjacency matrix W, constructing a degree matrix D:
Figure BDA0002246514130000032
wherein:
Figure BDA0002246514130000033
step 64: calculating a Laplace matrix:
L=D-W
constructing a standardized laplacian matrix:
L=D-1/2LD-1/2
step 65: calculating all eigenvalues and eigenvectors of a Laplace matrix L, wherein L has n real eigenvalues greater than or equal to 0, i.e. 0 ═ λ1≤λ2≤…≤λnThe corresponding feature vector is α12,…,αn
And step 66: calculating eigenvalue interval value { C1,C2,…,Cn-2Get C outimax=max{C1,C2,…,Cn-2Let p be argCimax
Figure BDA0002246514130000034
Step 67, the first k eigenvectors are selected and k is n-p-1, i.e., f is α12,…,αn-p-1};
Step 68: standardizing a matrix formed by the selected characteristic vectors F according to rows to finally form an n multiplied by k dimensional characteristic matrix F;
Figure BDA0002246514130000035
step 68: clustering the selected K-dimensional matrix F, wherein each row of data is a data sample, the clustering number is K, and the clustering algorithm is K-means to obtain cluster division { U1,U2,…,UkThe cluster center is { w }1,w2,…,wk}。
Step 69: clustering the data in each wind speed interval by the same steps to find out each { w1,w2,…,wkCenter of maximum cluster wmaxCorresponding UmaxThe data in the rest data clusters are abnormal data and are cleared.
And 6, the clustering method in the multi-dimensional data clustering model uses spectral clustering. And by calculating the distance between characteristic values of the Laplace matrix in the spectral clustering process, the defect that the clustering category number needs to be input in advance is solved in an optimized mode, so that the value of a clustering parameter k can be automatically determined in the clustering process, the clustering accuracy is improved, and the operation is simplified.
Compared with the prior art, the invention has the following effects:
1. the principle is reasonable, and the identification efficiency is high. Due to wind abandoning and electricity limiting or communication faults, a large amount of accumulation type abnormal data often appear in the operation data of the wind turbine generator in certain specific wind speed intervals, the normal data and the abnormal data are equivalent in quantity, and the existing research is not ideal for identifying the data. Aiming at the defect, after the dispersion abnormal data are removed by using a quartile method, environmental factors and other operation data are introduced into a wind speed interval, so that more information of the abnormal data is provided, data identification dimensionality is increased, a wind power plant operation abnormal data identification model based on multidimensional clustering is established, and the accumulation type abnormal data identification effect is improved. Therefore, the method can not only clear the conventional scattered abnormal data points, but also effectively identify the abnormal data clusters.
2. The process is simple and the universality is good. The method does not depend on training and learning by using a normal data set, has higher automatic processing capability and stronger universality, and can meet the data processing requirements of different types and different data volume abnormal values of the wind turbine generator and the wind power plant. In addition, aiming at the defect that the input parameter k value in each wind turbine generator set clustering method cannot be determined in advance in the existing method, the invention adopts a multidimensional spectral clustering method for automatically determining the clustering parameter k value, thereby simplifying the operation steps.
Drawings
FIG. 1 is a block diagram of the overall steps of a wind farm abnormal operation data detection method based on multidimensional clustering according to an embodiment.
FIG. 2 is a detailed flowchart of the multidimensional clustering model in the step of identifying abnormal data of the wind turbine generator according to one embodiment.
Detailed Description
The technical solution of the present invention will be described in further detail with reference to the accompanying fig. 1-2 and the specific embodiments.
As shown in FIG. 1, the invention discloses a method for detecting abnormal operation data of a wind power plant.
Step 1: the method comprises the steps of collecting operation data, operation environment parameters and wind turbine generator parameters of the wind turbine generator. The wind turbine generator system operating data comprises wind speed, wind direction, power and pitch angle, the operating environment parameters comprise temperature and humidity, and the wind turbine generator system parameters comprise cut-in wind speed, cut-out wind speed and set rated power.
Step 2: and performing data preliminary processing on the power and wind speed data according to the power characteristics of the wind turbine generator.
And step 3: and identifying and processing the wind speed and power abnormal data after preliminary processing by adopting a quartile method, and clearing discrete abnormal data points of the wind turbine generator. In order to remove the conventional scattered abnormal data points of the unit operation, the abnormal data is firstly identified and processed by the wind speed and power data which are primarily processed by adopting a quartile method.
And 4, step 4: after discrete abnormal data points of the wind turbine generator during operation are removed, pitch angle, wind direction, temperature and humidity data at corresponding moments are introduced into wind speed and power operation data of an original wind turbine generator to form multidimensional data, the multidimensional data are sorted according to the wind speed, and are divided into data sets among cells according to the wind speed interval of a first threshold value.
According to one embodiment, as shown in FIG. 1, the data sets between cells are divided into wind speed intervals with a first threshold of 0.5m/s, and the resulting data sets are as follows:
Figure BDA0002246514130000051
wherein XiThe data set of the ith wind speed interval is shown as v, wind speed, d, wind direction, p, power, α, pitch angle, t and h, the temperature and humidity of the environment at the corresponding moment, m is the number of wind speed intervals, and n is the data volume of the ith wind speed interval.
And 5: and carrying out normalization processing on the data set of each wind speed cell.
According to one embodiment, the normalization process is performed by:
Figure BDA0002246514130000052
wherein
Figure BDA0002246514130000053
The data is normalized; x is the original data of the selected variable in the interval; xminIs the minimum value of the selected variable within the interval; xmaxThe maximum value of the variable selected within the interval.
Step 6: and taking the normalized data in each wind speed interval as the input of a multi-dimensional data clustering model, and detecting abnormal data in the multi-dimensional data clustering model. Since the multidimensional clustering method uses dimension reduction processing in processing multidimensional clustering, it is good for the conventional clustering method in processing high-dimensional clustering complexity, and according to one embodiment, the clustering method in the present application uses a spectral clustering model.
The preliminary treatment in the step 2 specifically includes:
step 21: and clearing data that the output power of the group is larger than zero when the wind speed is smaller than the cut-in wind speed. When the wind speed is lower than the cut-in wind speed, the output power of the unit is less than or equal to zero, so the data with the power greater than zero are abnormal data, and the data are often directly cleared due to communication faults. In order to not affect the overall data characteristics of the v-p data and facilitate subsequent processing of abnormal data, the subsequent processing can not consider the data.
Step 22: and clearing data of which the power is less than or equal to zero when the wind speed is greater than the cut-in wind speed and less than the cut-out wind speed. And when the wind speed is greater than the cut-in wind speed and less than the cut-out wind speed, the data with the power less than or equal to zero is regarded as abnormal data of the unit shutdown, and the abnormal data are directly cleared.
The step 3 specifically comprises the following steps:
step 31: and clearing wind speed abnormal data by a quartile method. The method specifically comprises the following steps: the data are sorted from high power to low power, divided into small regions by a second threshold value, for example, a power interval unit of 25KW, and an abnormal value of the wind speed in the region is removed by a quartile method. When the power is higher than the rated power, the wind speed data are concentrated and less, normal data are mainly distributed on the right side, namely a wind speed large area, and therefore the identification of abnormal values does not set an upper limit.
Figure BDA0002246514130000061
Wherein Q3Wind speed quartile 3, Q1Is the 1 st quartile, prThe rated power of the wind turbine generator is obtained. I isQRFor a quartile range, the following is calculated:
IQR=Q3-Q1
step 32: and clearing the power abnormal data by a quartile method. The method specifically comprises the following steps: the data processed in step 31 are sorted according to wind speed, the data are divided into small sections at intervals of a third threshold value, for example, the wind speed is 0.5m/s, and abnormal power values in the sections are removed by a quartile method. Because abnormal data are distributed more under the power curve, the amount of normal data and abnormal data is possibly equivalent, and the normal data are prevented from being deleted by mistake, so that the upper limit is not set when the abnormal value is identified.
P<Q3-1.5IQR,v>vin
The step 6 specifically comprises the following steps:
step 61: given a wind speed inter-cell dataset X containing n samplesi={x1,x2,…,xnEach sample is a d-dimensional real feature vector, where d is 5.
Step 62: constructing a similarity matrix S epsilon R according to input sample datan×nWherein the similarity matrix Sij=Wij,WijIs a adjacency matrix, representing the weight vector between the ith sample and the jth sample.
Figure BDA0002246514130000062
Where σ is a parameter controlling the propagation of the domain, where the element Sij=Sji
And step 63: according to the adjacency matrix W, constructing a degree matrix D:
Figure BDA0002246514130000063
wherein:
Figure BDA0002246514130000064
step 64: calculating a Laplace matrix:
L=D-W
constructing a standardized laplacian matrix:
L=D-1/2LD-1/2
step 65: calculating all eigenvalues and eigenvectors of a Laplace matrix L, wherein L has n real eigenvalues greater than or equal to 0, i.e. 0 ═ λ1≤λ2≤…≤λnThe corresponding feature vector is α12,…,αn. And in the calculation of the later step, the characteristic value 0 and the corresponding characteristic vector are ignored, and no practical significance is realized in the calculation.
And step 66: calculating eigenvalue interval value { C1,C2,…,Cn-2Get C outimax=max{C1,C2,…,Cn-2Let p be argCimaxThe symbol p of this formula is not power, and represents a numerical value.
Figure BDA0002246514130000071
Step 67, the first k eigenvectors are selected and k is n-p-1, i.e., f is α12,…,αn-p-1P in this formula is not power, and represents the value in step 66.
Step 68: and normalizing the matrix formed by the selected feature vectors F according to rows to finally form the feature matrix F with the dimension of n multiplied by k.
Figure BDA0002246514130000072
Step 68: clustering the selected K-dimensional matrix F, wherein each row of data is a data sample, the clustering number is K, and the clustering algorithm is K-means to obtain cluster division { U1,U2,…,UkThe cluster center is { w }1,w2,…,wk}。
Step 69: clustering the data in each wind speed interval by the same steps to find out each { w1,w2,…,wkCenter of maximum cluster wmaxCorresponding UmaxThe data in the rest data clusters are abnormal data and are cleared.
According to the invention, environmental factors and other unit operation parameters are introduced into the existing wind speed and power data, model modeling dimensions are increased, an abnormal data identification model based on a multi-dimensional clustering method is further established, a clustering parameter k value is automatically determined by optimizing a clustering algorithm, the operation is simplified, and the accumulation type abnormal data identification effect is improved. The method considers the influence of various factors on the power of the wind turbine generator, and combines a quartile method to establish a multidimensional clustering model to identify abnormal operation data of the wind turbine generator, but the traditional clustering method has low data dimension and low accuracy rate of identifying abnormal data. The abnormal data model based on multi-dimensional clustering identification provided by the invention can automatically determine the category number, and the vast majority of clustering methods have the defect that the k value of the clustering parameter in each wind turbine generator set needs to be input in advance. Therefore, the method is simple to operate and high in calculation efficiency in practice.
The embodiments set forth in the foregoing description are exemplary only, and modifications may be made therein by those skilled in the art without departing from the spirit of the present application, and such modifications are intended to be within the scope of the present application.

Claims (7)

1. A wind power plant operation abnormal data detection method is characterized by comprising the following steps:
step 1: collecting operating data, operating environment parameters and wind turbine generator parameters of a wind turbine generator;
step 2: performing data primary processing on the power and wind speed data according to the power characteristics of the wind turbine generator;
and step 3: identifying and processing the wind speed and power abnormal data after primary processing by adopting a quartile method, and clearing discrete abnormal data points of the wind turbine generator;
and 4, step 4: after discrete abnormal data points of the wind turbine generator during operation are eliminated, introducing pitch angle, wind direction, temperature and humidity data at corresponding moments into wind speed and power operation data of an original wind turbine generator to form multidimensional data, sequencing the multidimensional data according to the wind speed, and dividing the multidimensional data into a data set between cells at a wind speed interval of a first threshold;
and 5: normalizing the data set of each wind speed cell;
step 6: and taking the normalized data in each wind speed interval as the input of a multi-dimensional data clustering model, and detecting abnormal data in the multi-dimensional data clustering model.
2. The method of claim 1, wherein:
the wind turbine generator operation data in the step 1 comprise wind speed, wind direction, power and pitch angle, the operation environment parameters comprise temperature and humidity, and the wind turbine generator parameters comprise cut-in wind speed, cut-out wind speed and set rated power.
3. The method of claim 1, wherein:
the step 2 comprises the following steps:
step 21: clearing data that the output power of the wind turbine is larger than zero when the wind speed is smaller than the cut-in wind speed; and
step 22: and clearing data of which the power is less than or equal to zero when the wind speed is greater than the cut-in wind speed and less than the cut-out wind speed.
4. The method of claim 1, wherein:
the step 3 comprises the following steps:
step 31: clearing wind speed abnormal data by a quartile method, comprising the following steps: sorting the data from large power to small power, dividing the data into cells by a power interval unit of a second threshold value, and removing abnormal values of wind speed in the cells by a quartile method;
step 32: clearing power anomaly data by a quartile method, comprising: sorting the data processed in the step 31 according to the wind speed, dividing the data into cells at a wind speed interval of a third threshold value, and removing the abnormal power value in the cells by a quartile method.
5. The method of claim 4, wherein:
the first threshold, the second threshold and the third threshold are respectively: 0.5m/s, 25KW and 0.5 m/s.
6. The method of claim 1, wherein:
the normalization processing mode in the step 5 is as follows:
Figure FDA0002246514120000021
wherein XiFor the data set of the ith wind speed cell,
Figure FDA0002246514120000022
the data is normalized; x is the original data of the selected variable in the interval; xminIs the minimum value of the selected variable within the interval; xmaxThe maximum value of the variable selected within the interval.
7. The method of claim 1, wherein:
the step 6 comprises the following steps:
step 61: given a wind speed inter-cell dataset X containing n samplesi={x1,x2,…,xnD-dimensional real feature vectors are taken as each sample;
step 62: constructing a similarity matrix S epsilon R according to input sample datan×nWherein the similarity matrix Sij=Wij,WijRepresenting a weight vector between the ith sample and the jth sample for the adjacency matrix;
Figure FDA0002246514120000023
where σ is a parameter controlling the propagation of the domain, where the element Sij=Sji
And step 63: according to the adjacency matrix W, constructing a degree matrix D:
Figure FDA0002246514120000024
wherein:
Figure FDA0002246514120000025
step 64: calculating a Laplace matrix:
L=D-W
constructing a standardized laplacian matrix:
L=D-1/2LD-1/2
step 65: calculating all eigenvalues and eigenvalues of the Laplace matrix LVector calculating all eigenvalues and eigenvectors of matrix L, where L has n real eigenvalues greater than or equal to 0, i.e. 0 ═ λ1≤λ2≤…≤λnThe corresponding feature vector is α12,…,αn
And step 66: calculating eigenvalue interval value { C1,C2,…,Cn-2Get C outimax=max{C1,C2,…,Cn-2Let p be argCimax
Figure FDA0002246514120000031
Step 67, the first k eigenvectors are selected and k is n-p-1, i.e., f is α12,…,αn-p-1};
Step 68: standardizing a matrix formed by the selected characteristic vectors F according to rows to finally form an n multiplied by k dimensional characteristic matrix F;
step 68: clustering the selected K-dimensional matrix F, wherein each row of data is a data sample, the clustering number is K, and the clustering algorithm is K-means to obtain cluster division { U1,U2,…,UkThe cluster center is { w }1,w2,…,wk}。
Step 69: clustering the data in each wind speed interval by the same steps to find out each { w1,w2,…,wkCenter of maximum cluster wmaxCorresponding UmaxThe data in the rest data clusters are abnormal data and are cleared.
CN201911018724.8A 2019-10-24 2019-10-24 Wind power plant operation abnormal data detection method Pending CN110795690A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911018724.8A CN110795690A (en) 2019-10-24 2019-10-24 Wind power plant operation abnormal data detection method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911018724.8A CN110795690A (en) 2019-10-24 2019-10-24 Wind power plant operation abnormal data detection method

Publications (1)

Publication Number Publication Date
CN110795690A true CN110795690A (en) 2020-02-14

Family

ID=69441152

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911018724.8A Pending CN110795690A (en) 2019-10-24 2019-10-24 Wind power plant operation abnormal data detection method

Country Status (1)

Country Link
CN (1) CN110795690A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111881617A (en) * 2020-07-02 2020-11-03 上海电气风电集团股份有限公司 Data processing method, and performance evaluation method and system of wind generating set
CN112507616A (en) * 2020-12-03 2021-03-16 重庆大学 Wind power plant flagship fan optimization selection method based on spectral clustering
CN112682269A (en) * 2020-12-21 2021-04-20 浙江工业大学 Wind turbine generator state monitoring method based on OC-RKELM
CN113569399A (en) * 2021-07-20 2021-10-29 华能四平风力发电有限公司 Wind turbine generator operation data processing method, system, equipment and storage medium
CN114969017A (en) * 2022-07-28 2022-08-30 深圳量云能源网络科技有限公司 Wind power data cleaning method, cleaning device and prediction method
CN115150295A (en) * 2022-07-05 2022-10-04 中国银行股份有限公司 Method and device for detecting working condition data
CN115478991A (en) * 2022-09-21 2022-12-16 中节能风力发电股份有限公司 Method, equipment and medium for detecting abnormal form of power curve of wind turbine generator

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104239964A (en) * 2014-08-18 2014-12-24 华北电力大学 Ultra-short-period wind speed prediction method based on spectral clustering type and genetic optimization extreme learning machine
CN108412710A (en) * 2018-01-30 2018-08-17 同济大学 A kind of Wind turbines wind power data cleaning method
CN109783486A (en) * 2019-01-17 2019-05-21 华北电力大学 Data cleaning method, device and server
CN109918364A (en) * 2019-02-28 2019-06-21 华北电力大学 A kind of data cleaning method based on two-dimensional probability density estimation and quartile method
CN110134919A (en) * 2019-04-30 2019-08-16 华北电力大学 Wind turbines abnormal data cleaning method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104239964A (en) * 2014-08-18 2014-12-24 华北电力大学 Ultra-short-period wind speed prediction method based on spectral clustering type and genetic optimization extreme learning machine
CN108412710A (en) * 2018-01-30 2018-08-17 同济大学 A kind of Wind turbines wind power data cleaning method
CN109783486A (en) * 2019-01-17 2019-05-21 华北电力大学 Data cleaning method, device and server
CN109918364A (en) * 2019-02-28 2019-06-21 华北电力大学 A kind of data cleaning method based on two-dimensional probability density estimation and quartile method
CN110134919A (en) * 2019-04-30 2019-08-16 华北电力大学 Wind turbines abnormal data cleaning method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
卢鹏丽 等: "一种自动确定特征向量与类别数目的谱聚类算法" *
王昊: "风力发电功率曲线非线性建模研究" *
赵永宁 等: "风电场弃风异常数据簇的特征及处理方法" *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111881617A (en) * 2020-07-02 2020-11-03 上海电气风电集团股份有限公司 Data processing method, and performance evaluation method and system of wind generating set
CN111881617B (en) * 2020-07-02 2024-03-26 上海电气风电集团股份有限公司 Data processing method, performance evaluation method and system of wind generating set
CN112507616A (en) * 2020-12-03 2021-03-16 重庆大学 Wind power plant flagship fan optimization selection method based on spectral clustering
CN112507616B (en) * 2020-12-03 2023-02-03 重庆大学 Wind power plant flagship fan optimization selection method based on spectral clustering
CN112682269A (en) * 2020-12-21 2021-04-20 浙江工业大学 Wind turbine generator state monitoring method based on OC-RKELM
CN113569399A (en) * 2021-07-20 2021-10-29 华能四平风力发电有限公司 Wind turbine generator operation data processing method, system, equipment and storage medium
CN115150295A (en) * 2022-07-05 2022-10-04 中国银行股份有限公司 Method and device for detecting working condition data
CN115150295B (en) * 2022-07-05 2024-04-09 中国银行股份有限公司 Method and device for detecting working condition data
CN114969017A (en) * 2022-07-28 2022-08-30 深圳量云能源网络科技有限公司 Wind power data cleaning method, cleaning device and prediction method
CN115478991A (en) * 2022-09-21 2022-12-16 中节能风力发电股份有限公司 Method, equipment and medium for detecting abnormal form of power curve of wind turbine generator

Similar Documents

Publication Publication Date Title
CN110795690A (en) Wind power plant operation abnormal data detection method
CN109871860B (en) Daily load curve dimension reduction clustering method based on kernel principal component analysis
CN112699913A (en) Transformer area household variable relation abnormity diagnosis method and device
CN111444615B (en) Photovoltaic array fault diagnosis method based on K nearest neighbor and IV curve
CN110674120A (en) Wind power plant data cleaning method and device
CN114114039B (en) Method and device for evaluating consistency of single battery cells of battery system
CN116522268B (en) Line loss anomaly identification method for power distribution network
CN110717610A (en) Wind power prediction method based on data mining
CN114861788A (en) Load abnormity detection method and system based on DBSCAN clustering
CN113570200A (en) Power grid operation state monitoring method and system based on multidimensional information
CN116148753A (en) Intelligent electric energy meter operation error monitoring system
CN113125903A (en) Line loss anomaly detection method, device, equipment and computer-readable storage medium
CN115021679A (en) Photovoltaic equipment fault detection method based on multi-dimensional outlier detection
CN117113235A (en) Cloud computing data center energy consumption optimization method and system
CN110427019B (en) Industrial process fault classification method and control device based on multivariate discriminant analysis
CN112288157A (en) Wind power plant power prediction method based on fuzzy clustering and deep reinforcement learning
CN110084301B (en) Hidden Markov model-based multi-working-condition process working condition identification method
CN110020680B (en) PMU data classification method based on random matrix theory and fuzzy C-means clustering algorithm
CN107274025B (en) System and method for realizing intelligent identification and management of power consumption mode
CN114167730B (en) Method, computer and storage medium for evaluating overall health state of blower of thermal power generating unit
CN113595242B (en) Non-invasive load identification method based on depth CNN-HMM
CN115017988A (en) Competitive clustering method for state anomaly diagnosis
CN116150687A (en) Fluid pipeline leakage identification method based on multi-classification G-WLSTSVM model
CN115169654A (en) Short-term power load prediction method based on unsupervised clustering and support vector machine
CN113554079A (en) Electric power load abnormal data detection method and system based on secondary detection method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200214