WO2021027294A1 - Method and apparatus for improving wind power system data quality - Google Patents

Method and apparatus for improving wind power system data quality Download PDF

Info

Publication number
WO2021027294A1
WO2021027294A1 PCT/CN2020/082958 CN2020082958W WO2021027294A1 WO 2021027294 A1 WO2021027294 A1 WO 2021027294A1 CN 2020082958 W CN2020082958 W CN 2020082958W WO 2021027294 A1 WO2021027294 A1 WO 2021027294A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
error
time point
parameter
model
Prior art date
Application number
PCT/CN2020/082958
Other languages
French (fr)
Chinese (zh)
Inventor
杨晓茹
鲍亭文
王旻轩
樊静
金超
Original Assignee
北京天泽智云科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京天泽智云科技有限公司 filed Critical 北京天泽智云科技有限公司
Publication of WO2021027294A1 publication Critical patent/WO2021027294A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]

Definitions

  • the invention relates to the field of data processing, in particular to a method and device for improving the data quality of a wind power system.
  • the predictive maintenance system in the field of wind power has formed a full life cycle management system from terminal data collection, algorithm model building, and predictive maintenance. It provides early warning of wind power failure, stable operation of wind power, safe grid connection, and saving operation and maintenance costs. All provided technical guidance.
  • the embodiment of the present invention provides a method and device for improving the data quality of a wind power system, which can effectively process collected system data and improve the quality of source data.
  • a method for improving data quality of a wind power system comprising:
  • the system data includes: data corresponding to each time point in a certain time period, and the data at each time point includes one or more data segments corresponding to different sensors;
  • the processed system data is corrected to obtain the corrected system data.
  • the error types include any one or more of the following: missing errors, type errors, numerical errors, rule errors, and repeated errors;
  • the processing of the error data in the system according to the error type includes any one or more of the following:
  • the repeated data within the set time will be deleted; if the error type is repeated error data segment is the wind speed parameter , Or wind direction parameter, or temperature parameter, delete the repeated data segment.
  • the performing data padding on the data segment whose error type is a missing error includes:
  • the data segment whose error type is missing error is a wind speed parameter or a power parameter, the data shall be filled according to the wind power model;
  • the correcting the processed system data based on the mechanism model includes:
  • the energy conservation model is used to determine the abnormal temperature parameter in the processed system data, and delete the abnormal temperature parameter.
  • the using the wind power model to correct the power parameter in the processed system data includes:
  • the power parameters in the current time point data are corrected according to the wind speed parameters and the wind turbine status code parameters in the current time point data.
  • the correcting the power parameter in the data at the current time point according to the wind speed parameter and the wind turbine status code parameter in the data at the current time point includes:
  • the method further includes:
  • the determining abnormal data in the corrected system data based on a mathematical model includes:
  • the abnormal data in the corrected system data is determined based on any one or more of the following mathematical models: a cluster model, a residual model, and a cluster model.
  • the clustering model includes any one or more of the following: k-means model, DBSCAN model;
  • the input of the clustering model is the revised system data, and the output is the data category and quantity corresponding to different time points.
  • the residual model includes any one or more of the following: linear regression, support vector machine, decision tree, neural network;
  • the input of the residual model is the corrected system data, and the output is the residuals corresponding to different time points.
  • determining the abnormal data in the corrected system based on the cluster system includes:
  • the wind turbine system determines whether the difference between each column of data of all wind turbines contained in each model in the cluster model and the corresponding average parameter of the wind farm exceeds the set threshold. If it exceeds the set threshold, determine the column of data The data at the time point is abnormal data.
  • a device for improving the data quality of a wind power system comprising:
  • the error data detection module is used to determine the error data and the error type in the collected system data;
  • the system data includes: data corresponding to each time point within a certain period of time, and each time point data includes one or more corresponding Data segments of different sensors;
  • a data processing module configured to process the error data in the system data according to the error type to obtain processed system data
  • the data correction module is used to correct the processed system data based on the mechanism model to obtain the corrected system data.
  • the error types include any one or more of the following: missing errors, type errors, numerical errors, rule errors, and repeated errors;
  • the data processing module is specifically configured to perform any one or more of the following processing on the error data in the system:
  • the data segment whose error type is repeated error is wind speed parameter, or wind direction parameter, or temperature parameter, delete the repeated data segment.
  • the data processing module performs data padding on data segments whose error types are missing errors in the following manner:
  • the data segment whose error type is missing error is a wind speed parameter or a power parameter, the data shall be filled according to the wind power model;
  • the data correction module includes:
  • the wind speed parameter correction unit is used to determine the abnormal wind speed parameter in the processed system data by using the environmental wind model, and delete the abnormal wind speed parameter.
  • a power parameter correction unit configured to use a wind power model to correct the power parameters in the processed system data
  • the temperature parameter correction unit is used to determine the abnormal temperature parameter in the processed system data by using the energy conservation model, and delete the abnormal temperature parameter;
  • the power parameter correction unit includes:
  • the check subunit is used to check whether the data at each time point conforms to the wind power model
  • the correction subunit is used to correct the power parameters in the data at the current time point according to the wind speed parameters and the wind turbine status code parameters in the data at the current time point after the checking subunit determines that the data at the current time point does not conform to the wind power model .
  • the correction subunit is specifically configured to modify the power parameter in the data at the current time point to 0 when the wind speed parameter in the data at the current time point is less than the cut-in wind speed;
  • the wind speed parameter is greater than the cut-in wind speed and less than the rated wind speed, delete the power parameter in the data at the current time point;
  • the wind speed parameter in the data at the current time point is greater than the rated wind speed, check whether the fan status code in the data at the current time point is It is a limited power code; if it is, the power parameter in the data at the current time point is modified to the limited power; otherwise, the power parameter in the data at the current time point is modified to the full power.
  • the device further includes:
  • An abnormal data detection module for determining abnormal data in the corrected system data based on a mathematical model
  • the abnormal data cleaning module is used to remove the abnormal data to obtain optimized system data.
  • the abnormal data detection module includes any one or more of the following modules:
  • the first detection module is configured to detect abnormal data in the corrected system data based on the clustering model
  • the second detection module is configured to detect abnormal data in the corrected system data based on the residual model
  • the third detection module is used to detect abnormal data in the corrected system data based on the cluster model.
  • the clustering model includes any one or more of the following: k-means model, DBSCAN model;
  • the input of the clustering model is the corrected system data, and the output is the data category and its quantity corresponding to different time points.
  • the residual model includes any one or more of the following: linear regression, support vector machine, decision tree, neural network;
  • the input of the residual model is the corrected system data, and the output is the residuals corresponding to different time points.
  • the third detection module is specifically configured to use the wind turbine system as a cluster model to determine whether the difference between each column of data of all wind turbines included in each model in the cluster model and the corresponding average parameter of the wind field exceeds the set value. Set the threshold. If it exceeds the set threshold, the data at the time point of the column of data is determined to be abnormal data.
  • An electronic device including: one or more processors and memories;
  • the memory is used to store computer-executable instructions
  • the processor is used to execute the computer-executable instructions to implement the aforementioned method.
  • a readable storage medium having instructions stored thereon, and the instructions are executed to implement the aforementioned method.
  • the method and device for improving the data quality of the wind power system determine the error data and the error type in the collected wind power system data, and first process the error data according to the different error types to obtain the processed data System data; Then, according to the processed system data, the data is corrected based on the mechanism model to obtain the corrected system data.
  • the solution of the embodiment of the present invention not only processes the erroneous data caused in the process of data collection and transmission, but also corrects the system data based on the mechanism model according to the characteristics of the wind power system data, so that the finally obtained system data has higher quality , Effectively avoiding low-quality source data from interfering with subsequent system operation and maintenance management, and ensuring the normal operation of wind turbines.
  • abnormal data detection can be performed on the corrected system data based on one or more mathematical models, and the detected abnormal data can be removed, so that the system data can be further optimized, and the quality of the system data can be better guaranteed.
  • Fig. 1 is a flowchart of a method for improving data quality of a wind power system according to an embodiment of the present invention
  • FIG. 2 is a flowchart of using a wind power model to correct power parameters in system data in an embodiment of the present invention
  • FIG. 3 is another flowchart of a method for improving data quality of a wind power system according to an embodiment of the present invention
  • Fig. 4 is a structural block diagram of the device for improving the data quality of the wind power system implemented by the present invention.
  • FIG. 5 is a structural block diagram of a data correction module in an embodiment of the present invention.
  • Fig. 6 is another structural block diagram of a device for improving data quality of a wind power system according to an embodiment of the present invention.
  • the embodiment of the present invention provides a method and device for improving the data quality of a wind power system.
  • the error data and the error type are determined, and the error data is first processed according to different error types.
  • FIG. 1 it is a flowchart of a method for improving data quality of a wind power system according to an embodiment of the present invention, which includes the following steps:
  • Step 101 Determine the error data and the error type in the collected system data.
  • the system data includes: data corresponding to each time point in a certain time period, and the data at each time point includes one or more data segments corresponding to different sensors.
  • the existing technology can be used to determine the error data and the error type in the system data.
  • the error types are mainly as follows:
  • Missing error that is, the data in a certain data segment is empty
  • Type error that is, the data type of a data segment does not match the actual type of the data segment.
  • the value of a data segment should be 12.5, but it is recorded as the string ‘12.5’;
  • Rule error that is, a certain data segment does not meet the set rules, such as the fan status code should be an integer between 0 and n, and the fan start and stop status should be 0 or 1;
  • Duplicate error that is, a certain data repeats within a certain period of time.
  • the embodiment of the present invention also provides a method for determining the error data and its error type.
  • This method is not only performed horizontally for the system data collected within a certain period of time. Detection is to detect the data at each time point, and also to perform longitudinal detection, that is, to detect each data segment within a certain time period.
  • the method specifically includes: horizontal detection and vertical detection; among them:
  • Horizontal detection refers to the detection of data at each time point in sequence with time points as a unit, marking the position of abnormal data segments in the data, and recording error types;
  • Longitudinal detection refers to sequentially detecting each data segment within a certain period of time using data segments as a unit, and recording abnormal data segments and error types.
  • the detection of data at each time point in the horizontal detection mainly includes:
  • detecting each data segment within the certain time period includes:
  • Step 102 Process the error data in the system data according to the error type to obtain processed system data.
  • the missing data segment is the wind speed parameter or power parameter
  • the data can be filled according to the wind power model, that is to say, the value of the missing data segment is calculated according to the other data segment and the wind power model corresponding to the time point of the missing data segment, and the value is filled to the corresponding position;
  • the wind power model is a functional relationship between wind speed and active power of a wind turbine, and is related to the factory design parameters of the wind turbine itself.
  • the error type is missing and the wrong data segment is not a wind speed parameter or a power parameter, since wind power system data usually uses a high sampling rate for data collection, data can be filled by interpolation, and the time point of the missing data segment can be used.
  • the corresponding data segments before and after are interpolated and filled. If the corresponding data segments before and after the time point to which the missing data segment belongs are also missing, random numbers can be used for interpolation and filling, where the random numbers come from the 3sigma range of the normal distribution of the data segment.
  • the fan status code column should be an integer. If the corresponding data segment is not an integer, then its value will be rounded. If the corresponding data segment is a character string and the rounding process cannot be performed, the data segment will be deleted.
  • any one or more of the above-mentioned processing methods can be adopted as required, and the processing sequence of different types of error data is not required.
  • the embodiments of the present invention are not limited to the above-mentioned processing, and there may be corresponding processing methods for other error types.
  • Step 103 Correct the processed system data based on the mechanism model to obtain corrected system data.
  • the corresponding wind direction parameter is abnormal data and it is deleted.
  • Step 201 Check in turn whether the data at each time point conforms to the wind power model; if not, perform step 202; otherwise, end.
  • Step 202 Determine whether the wind speed parameter in the data at the current time point is less than the cut-in wind speed; if yes, go to step 203; otherwise, go to step 204;
  • Step 203 Modify the power parameter in the data at the current time point to 0;
  • Step 204 Determine whether the wind speed parameter in the data at the current time point is less than the rated wind speed; if yes, go to step 205; otherwise, go to step 206;
  • Step 205 Delete the power parameter in the data at the current time point
  • Step 206 check whether the wind turbine status code in the data at the current time point is a limited power code; if yes, go to step 207; otherwise, go to step 208;
  • Step 207 Modify the power parameter in the data at the current time point to a limited power
  • Step 208 Modify the power parameter in the data at the current time point to full power.
  • an energy conservation model is established for the approximately enclosed space such as the nacelle, control cabinet, and gear box in the wind turbine system.
  • the actual data collected does not conform to the energy conservation model, the corresponding data is deleted.
  • the method for improving the data quality of the wind power system determines the error data and the error type in the collected wind power system data.
  • the error data is processed according to the different error types to obtain the processed system data ;
  • data correction is made based on the mechanism model to obtain the corrected system data.
  • the solution of the embodiment of the present invention not only processes the erroneous data caused in the process of data collection and transmission, but also corrects the system data based on the mechanism model according to the characteristics of the wind power system data, so that the finally obtained system data has higher quality , Effectively avoiding low-quality source data from interfering with subsequent system operation and maintenance management, and ensuring the normal operation of wind turbines.
  • FIG. 3 it is another flow chart of the method for improving the data quality of the wind power system according to the embodiment of the present invention, which includes the following steps:
  • Step 301 Determine the error data and the error type in the collected system data.
  • Step 302 Process the error data in the system data according to the error type to obtain processed system data.
  • Step 303 Correct the processed system data based on the mechanism model to obtain corrected system data.
  • Step 304 Determine abnormal data in the corrected system data based on a mathematical model.
  • the abnormal data in the revised system data may be determined based on any one or more of the following mathematical models: a clustering model, a residual model, and a clustering model.
  • the union of the abnormal data obtained according to different models can be used as the final abnormal data, or the abnormal data obtained from different models can be merged Calculation (such as weighted calculation), according to the calculation result to determine the final abnormal data.
  • the clustering model may specifically include any one or more of the following: k-means model, DBSCAN (Density-Based Spatial Clustering of Applications with Noise) model; the input of the clustering model is the corrected system data , The output is the data category and quantity corresponding to different time points.
  • the category data is determined to be abnormal data.
  • the residual model may specifically include any one or more of the following: linear regression, support vector machine, decision tree, neural network; the input of the residual model is the corrected system data, and the output is different time points The corresponding residual.
  • the wind turbine system is taken as a cluster model, and it is judged that each column of data of all wind turbines contained in each model in the cluster model and the corresponding average parameters of the wind field (such as temperature, wind speed, rotation speed, power) Etc.) Whether the difference value exceeds the set threshold, if it exceeds the set threshold, it is determined that the data at the time point of the column of data is abnormal data.
  • Step 305 Remove the abnormal data to obtain optimized system data.
  • the method for improving the data quality of the wind power system not only processes the erroneous data caused in the data collection and transmission process of the wind power system, but also corrects the system data based on the mechanism model according to the characteristics of the wind power system data.
  • One or more mathematical models perform abnormal data detection on the corrected system data, and remove the detected abnormal data, so that the system data is further optimized, and the quality of the system data can be better guaranteed.
  • the embodiment of the present invention also provides a device for improving the data quality of the wind power system.
  • FIG. 4 it is a structural block diagram of the device for improving the data quality of the wind power system implemented by the present invention.
  • the device includes the following modules:
  • the error data detection module 401 is used to determine the error data and the error type in the collected system data; the system data includes: data corresponding to each time point in a certain time period, and the data at each time point includes one or more Corresponding to the data segment of different sensors;
  • the data processing module 402 is configured to process the error data in the system data according to the error type to obtain processed system data;
  • the data correction module 403 is used to correct the processed system data based on the mechanism model to obtain the corrected system data.
  • the above-mentioned error data detection module 401 can use existing technology to determine the error data and the error type in the system data.
  • the error types mainly include the following types: missing errors, type errors, numerical errors, rule errors, and repeated errors.
  • the above-mentioned error data detection module 401 can also determine the error data and its error type in the following manner:
  • Horizontal detection using time point as a unit, sequentially detect data at each time point, mark the position of the abnormal data segment in the data, and record the error type;
  • Longitudinal detection Take the data segment as a unit, sequentially detect each data segment within the certain time period, and record the abnormal data segment and the error type.
  • the data processing module 402 can adopt different processing methods. For example, any one or more of the following processing can be performed on the error data in the system:
  • Data filling is performed on the data segment whose error type is missing error. For example, if the data segment with the error type of missing error is a wind speed parameter or power parameter, data filling is performed according to the wind power model; otherwise, data filling is performed through interpolation;
  • the repeated data segment is deleted.
  • the data correction module 403 can use various mechanism models to correct some data segments in the system data. For example, a specific structure of the data correction module 403 is shown in Figure 5. Show, including the following units:
  • the wind speed parameter correction unit 433 is configured to determine the abnormal wind speed parameter in the processed system data by using the environmental wind model, and delete the abnormal wind speed parameter;
  • the power parameter correction unit 431 is configured to correct the power parameters in the processed system data by using the wind power model
  • the temperature parameter correction unit 432 is configured to determine an abnormal temperature parameter in the processed system data by using an energy conservation model, and delete the abnormal temperature parameter.
  • the power parameter correction unit includes: an inspection unit and a correction subunit, wherein:
  • the checking subunit is used to check whether the data at each time point conforms to the wind power model in turn;
  • the correction subunit is used for correcting the power in the data at the current time point according to the wind speed parameters and the wind turbine status code parameters in the data at the current time point after the checking subunit determines that the data at the current time point does not conform to the wind power model parameter.
  • the power parameter in the data at the current time point is modified to 0; when the wind speed parameter in the data at the current time point is greater than the cut-in wind speed and less than the rated wind speed , Delete the power parameter in the data at the current time point; when the wind speed parameter in the data at the current time point is greater than the rated wind speed, check whether the status code of the wind turbine in the data at the current time point is a limited power code; if so, change The power parameter in the data at the current time point is modified to the limited power; otherwise, the power parameter in the data at the current time point is modified to the full power.
  • the specific correction process can refer to the embodiment shown in Figure 2 above.
  • the device for improving the data quality of the wind power system determines the error data and the error type in the collected wind power system data, and first processes the error data according to the different error types to obtain the processed system data ; Then for the processed system data, data correction is made based on the mechanism model to obtain the corrected system data.
  • the solution of the embodiment of the present invention not only processes the erroneous data caused in the process of data collection and transmission, but also corrects the system data based on the mechanism model according to the characteristics of the wind power system data, so that the finally obtained system data has higher quality , Effectively avoiding low-quality source data from interfering with subsequent system operation and maintenance management, and ensuring the normal operation of wind turbines.
  • FIG. 6 it is another structural block diagram of an apparatus for improving data quality of a wind power system according to an embodiment of the present invention.
  • the device further includes the following modules:
  • An abnormal data detection module 404 configured to determine abnormal data in the corrected system data based on a mathematical model
  • the abnormal data cleaning module 405 is used to remove the abnormal data to obtain optimized system data.
  • the abnormal data detection module 404 may determine the abnormal data in the corrected system data based on one or more mathematical models.
  • a specific structure of the abnormal data detection module 404 may include But not limited to any one or more of the following modules:
  • the first detection module is configured to detect abnormal data in the corrected system data based on the clustering model
  • the second detection module is configured to detect abnormal data in the corrected system data based on the residual model
  • the third detection module is used to detect abnormal data in the corrected system data based on the cluster model.
  • the clustering model may include, but is not limited to, any one or more of the following: k-means model, DBSCAN model; the input of the clustering model is the corrected system data, and the output is corresponding to different time points The data category and its quantity.
  • the residual model may include, but is not limited to, any one or more of the following: linear regression, support vector machine, decision tree, neural network; the input of the residual model is the corrected system data, and the output is different The residual error corresponding to the time point.
  • the third detection module is specifically used to use the wind turbine system as a cluster model to determine whether the difference between the data of all the wind turbines contained in each model in the cluster model and the corresponding average parameter of the wind farm exceeds a set threshold, and if it exceeds Set the threshold to determine that the data at the time point of the column of data is abnormal data.
  • each embodiment of the device for improving the data quality of the wind power system since the function implementation of each module and unit is similar to that of the corresponding method, the description of each embodiment of the dialog generating device is relatively simple. For related details, please refer to the description of the corresponding part of the method embodiment.
  • the device for improving the data quality of the wind power system not only processes the erroneous data caused in the data collection and transmission process of the wind power system, but also corrects the system data based on the mechanism model according to the characteristics of the wind power system data.
  • One or more mathematical models perform abnormal data detection on the corrected system data, and remove the detected abnormal data, so that the system data is further optimized, and the quality of the system data can be better guaranteed.
  • the program can be stored in a computer-readable storage medium, which is referred to as storage herein. Medium, such as: ROM/RAM, magnetic disk, optical disk, etc.
  • an embodiment of the present invention also provides a device for improving the data quality of a wind power system.
  • the device is an electronic device, such as a mobile terminal, a computer, a tablet device, a medical device, a fitness device, or a personal computer. Digital assistants, etc.
  • the electronic device may include one or more processors and memories; wherein the memory is used to store computer executable instructions, and the processor is used to execute the computer executable instructions to implement the foregoing method.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Wind Motors (AREA)

Abstract

Disclosed in the present invention are a method and apparatus for improving wind power system data quality. The method comprises: determining error data in acquired system data and an error type thereof, the system data comprising data corresponding to each time point within a specific period of time, and the data of each time point comprising one or more data segments corresponding to different sensors; processing the error data in the system data according to the error type to obtain processed system data; and correcting the processed system data on the basis of a mechanism model to obtain corrected system data. By using the present invention, the quality of system data acquired by a wind power system can be effectively improved.

Description

提高风电系统数据质量的方法及装置Method and device for improving data quality of wind power system 技术领域Technical field
本发明涉及数据处理领域,具体涉及一种提高风电系统数据质量的方法及装置。The invention relates to the field of data processing, in particular to a method and device for improving the data quality of a wind power system.
背景技术Background technique
风电领域预测性维护系统从终端数据采集、算法模型搭建、再到预测性维护,形成了全生命周期管理系统,为风力发电故障预警,风力发电稳定运行,安全并网,节省运维成本等方向均提供了技术指导。The predictive maintenance system in the field of wind power has formed a full life cycle management system from terminal data collection, algorithm model building, and predictive maintenance. It provides early warning of wind power failure, stable operation of wind power, safe grid connection, and saving operation and maintenance costs. All provided technical guidance.
然而风电机组运行工况复杂多变,机组状态监测数据量大,由于机组停机、减载、通信噪声和设备故障等因素,会产生大量异常数据,如采集到的源数据长时间出现重复值、零值等无效数据,或者采集到的数据不符合物理规律等,严重影响风电预测性维护模型精度,从而导致模型预警结果出现误报,漏报等情形。同时,由于风电领域专家知识和算法建模人员的知识不匹配,导致模型建模时异常数据筛选不完全,增加了预警模型建模难度。However, the operating conditions of wind turbines are complex and changeable, and the amount of data for the condition monitoring of the turbines is large. Due to factors such as turbine shutdown, load shedding, communication noise, and equipment failures, a large number of abnormal data will be generated, such as repeated values, Invalid data such as zero value, or collected data that does not conform to physical laws, etc., seriously affects the accuracy of the predictive maintenance model of wind power, leading to false alarms and omissions in the model early warning results. At the same time, due to the mismatch between the knowledge of experts in the field of wind power and the knowledge of algorithm modelers, the abnormal data screening during model modeling is incomplete, which increases the difficulty of early warning model modeling.
综上,现有技术中对风电机组运行数据的收集、管理、分析和挖掘仍存在诸多不足,不能准确辨识所采集数据的质量差异,进而不能有效支撑粗糙数据的正确筛选和合理优化,使得数据质量得不到保障。如果这些数据不经处理直接使用,则会使得风力发电统计特性发生畸变,进而会影响风电机组的运行状态和运行特性的预测性维护结果。In summary, there are still many shortcomings in the collection, management, analysis, and mining of wind turbine operating data in the prior art. The quality difference of collected data cannot be accurately identified, and the correct selection and reasonable optimization of rough data cannot be effectively supported. Quality cannot be guaranteed. If these data are used directly without processing, the statistical characteristics of wind power will be distorted, which will affect the operation status of the wind turbine and the predictive maintenance results of the operation characteristics.
发明内容Summary of the invention
本发明实施例提供一种提高风电系统数据质量的方法及装置,可以对采集的系统数据进行有效处理,提高源数据的质量。The embodiment of the present invention provides a method and device for improving the data quality of a wind power system, which can effectively process collected system data and improve the quality of source data.
为此,本发明提供如下技术方案:To this end, the present invention provides the following technical solutions:
一种提高风电系统数据质量的方法,所述方法包括:A method for improving data quality of a wind power system, the method comprising:
确定采集的系统数据中的错误数据及其错误类型;所述系统数据包括:一定时间段内对应各时间点的数据,每个时间点的数据包括一个或多个对应不同传感器的数据段;Determine the error data and error types in the collected system data; the system data includes: data corresponding to each time point in a certain time period, and the data at each time point includes one or more data segments corresponding to different sensors;
根据所述错误类型对所述系统数据中的错误数据进行处理,得到处理后的系统数据;Processing the error data in the system data according to the error type to obtain processed system data;
基于机理模型对所述处理后的系统数据进行修正,得到修正后的系统数据。Based on the mechanism model, the processed system data is corrected to obtain the corrected system data.
可选地,所述错误类型包括以下任意一种或多种:缺失错误、类型错误、数值错误、规则错误、重复错误;Optionally, the error types include any one or more of the following: missing errors, type errors, numerical errors, rule errors, and repeated errors;
所述根据所述错误类型对所述系统中的错误数据进行处理包括以下任意一种或多种:The processing of the error data in the system according to the error type includes any one or more of the following:
对错误类型为缺失错误的数据段进行数据填补;Data filling is performed on the data segment whose error type is missing error;
对错误类型为类型错误的数据段进行数据类型转换或删除;Data type conversion or deletion of the data segment whose error type is type error;
对错误类型为数值错误的数据段进行删除;Delete the data segment whose error type is numerical error;
对错误类型为规则错误的数据段进行数据转换或删除;Data conversion or deletion of data segments whose error types are rule errors;
对于错误类型为重复错误的数据段,如果对应不同传感器的所有数据段均持续重复超过设定时间,则删除所述设定时间内重复的数据;如果错误类型为重复错误的数据段为风速参数、或风向参数、或温度参数,则删除重复的数据段。For the data segment whose error type is repeated error, if all data segments corresponding to different sensors continue to repeat for more than the set time, the repeated data within the set time will be deleted; if the error type is repeated error data segment is the wind speed parameter , Or wind direction parameter, or temperature parameter, delete the repeated data segment.
可选地,所述对错误类型为缺失错误的数据段进行数据填补包括:Optionally, the performing data padding on the data segment whose error type is a missing error includes:
如果错误类型为缺失错误的数据段是风速参数或功率参数,则根据风功率模型进行数据填补;If the data segment whose error type is missing error is a wind speed parameter or a power parameter, the data shall be filled according to the wind power model;
否则,通过插值进行数据填补。Otherwise, the data is filled by interpolation.
可选地,所述基于机理模型对所述处理后的系统数据进行修正包括:Optionally, the correcting the processed system data based on the mechanism model includes:
利用环境风模型确定所述处理后的系统数据中异常的风速参数,并删除所述异常的风速参数;Determine the abnormal wind speed parameter in the processed system data by using the environmental wind model, and delete the abnormal wind speed parameter;
利用风功率模型对所述处理后的系统数据中的功率参数进行修正;Correcting the power parameters in the processed system data by using the wind power model;
利用能量守恒模型确定所述处理后的系统数据中异常的温度参数,并删除所述异常的温度参数。The energy conservation model is used to determine the abnormal temperature parameter in the processed system data, and delete the abnormal temperature parameter.
可选地,所述利用风功率模型对所述处理后的系统数据中的功率参数进行修正包括:Optionally, the using the wind power model to correct the power parameter in the processed system data includes:
依次检查每个时间点的数据是否符合风功率模型;Check in turn whether the data at each time point conforms to the wind power model;
如果当前时间点的数据不符合风功率模型,则根据当前时间点的数据中的风速参数及风机状态码参数修正当前时间点的数据中的功率参数。If the data at the current time point does not conform to the wind power model, the power parameters in the current time point data are corrected according to the wind speed parameters and the wind turbine status code parameters in the current time point data.
可选地,所述根据当前时间点的数据中的风速参数及风机状态码参数修正当前时间点的数据中的功率参数包括:Optionally, the correcting the power parameter in the data at the current time point according to the wind speed parameter and the wind turbine status code parameter in the data at the current time point includes:
如果当前时间点的数据中的风速参数小于切入风速,则将当前时间点的数据中的功率参数修改为0;If the wind speed parameter in the data at the current time point is less than the cut-in wind speed, modify the power parameter in the data at the current time point to 0;
如果当前时间点的数据中的风速参数大于切入风速并且小于额定风速,则将当前时间点的数据中的功率参数删除;If the wind speed parameter in the data at the current time point is greater than the cut-in wind speed and less than the rated wind speed, delete the power parameter in the data at the current time point;
如果当前时间点的数据中的风速参数大于额定风速,则检查当前时间点的数据中的风机状态码是否为限定功率码;If the wind speed parameter in the data at the current time point is greater than the rated wind speed, check whether the wind turbine status code in the data at the current time point is a limited power code;
如果是,则将当前时间点的数据中的功率参数修改为限定功率;If so, modify the power parameter in the data at the current time point to the limited power;
否则,将当前时间点的数据中的功率参数修改为满发功率。Otherwise, modify the power parameter in the data at the current time point to full power.
可选地,所述方法还包括:Optionally, the method further includes:
基于数学模型确定所述修正后的系统数据中的异常数据;Determining abnormal data in the revised system data based on a mathematical model;
去除所述异常数据,得到优化后的系统数据。Remove the abnormal data to obtain optimized system data.
可选地,所述基于数学模型确定所述修正后的系统数据中的异常数据包括:Optionally, the determining abnormal data in the corrected system data based on a mathematical model includes:
基于以下任意一种或多种数学模型确定所述修正后的系统数据中的异常数据:聚类模型、残差模型、以及集群模型。The abnormal data in the corrected system data is determined based on any one or more of the following mathematical models: a cluster model, a residual model, and a cluster model.
可选地,所述聚类模型包括以下任意一种或多种:k-means模型,DBSCAN模型;Optionally, the clustering model includes any one or more of the following: k-means model, DBSCAN model;
所述聚类模型的输入为所述修正后的系统数据,输出为不同时间点对 应的数据类别及其数量。The input of the clustering model is the revised system data, and the output is the data category and quantity corresponding to different time points.
可选地,所述残差模型包括以下任意一种或多种:线性回归,支持向量机,决策树,神经网络;Optionally, the residual model includes any one or more of the following: linear regression, support vector machine, decision tree, neural network;
所述残差模型的输入为所述修正后的系统数据,输出为不同时间点对应的残差。The input of the residual model is the corrected system data, and the output is the residuals corresponding to different time points.
可选地,基于集群系统确定所述修正后的系统中的异常数据包括:Optionally, determining the abnormal data in the corrected system based on the cluster system includes:
将风机系统作为一个集群模型,判断集群模型中各模型中包含的所有风机的各列数据与风场相应的平均参数的差值是否超过设定阈值,如果超过设定阈值,则确定该列数据所属时间点的数据为异常数据。Regarding the wind turbine system as a cluster model, determine whether the difference between each column of data of all wind turbines contained in each model in the cluster model and the corresponding average parameter of the wind farm exceeds the set threshold. If it exceeds the set threshold, determine the column of data The data at the time point is abnormal data.
一种提高风电系统数据质量的装置,所述装置包括:A device for improving the data quality of a wind power system, the device comprising:
错误数据检测模块,用于确定采集的系统数据中的错误数据及其错误类型;所述系统数据包括:一定时间段内对应各时间点的数据,每个时间点的数据包括一个或多个对应不同传感器的数据段;The error data detection module is used to determine the error data and the error type in the collected system data; the system data includes: data corresponding to each time point within a certain period of time, and each time point data includes one or more corresponding Data segments of different sensors;
数据处理模块,用于根据所述错误类型对所述系统数据中的错误数据进行处理,得到处理后的系统数据;A data processing module, configured to process the error data in the system data according to the error type to obtain processed system data;
数据修正模块,用于基于机理模型对所述处理后的系统数据进行修正,得到修正后的系统数据。The data correction module is used to correct the processed system data based on the mechanism model to obtain the corrected system data.
可选地,所述错误类型包括以下任意一种或多种:缺失错误、类型错误、数值错误、规则错误、重复错误;Optionally, the error types include any one or more of the following: missing errors, type errors, numerical errors, rule errors, and repeated errors;
所述数据处理模块,具体用于对所述系统中的错误数据进行以下任意一种或多种处理:The data processing module is specifically configured to perform any one or more of the following processing on the error data in the system:
对错误类型为缺失错误的数据段进行数据填补;Data filling is performed on the data segment whose error type is missing error;
对错误类型为类型错误的数据段进行数据类型转换或删除;Data type conversion or deletion of the data segment whose error type is type error;
对错误类型为数值错误的数据段进行删除;Delete the data segment whose error type is numerical error;
对错误类型为规则错误的数据段进行数据转换或删除;Data conversion or deletion of data segments whose error types are rule errors;
如果对应不同传感器的所有数据段均持续重复超过设定时间,则删除所述设定时间内重复的数据;If all data segments corresponding to different sensors continue to repeat more than the set time, then delete the repeated data within the set time;
如果错误类型为重复错误的数据段为风速参数、或风向参数、或温度 参数,则删除重复的数据段。If the data segment whose error type is repeated error is wind speed parameter, or wind direction parameter, or temperature parameter, delete the repeated data segment.
可选地,所述数据处理模块按以下方式对错误类型为缺失错误的数据段进行数据填补:Optionally, the data processing module performs data padding on data segments whose error types are missing errors in the following manner:
如果错误类型为缺失错误的数据段是风速参数或功率参数,则根据风功率模型进行数据填补;If the data segment whose error type is missing error is a wind speed parameter or a power parameter, the data shall be filled according to the wind power model;
否则,通过插值进行数据填补。Otherwise, the data is filled by interpolation.
可选地,所述数据修正模块包括:Optionally, the data correction module includes:
风速参数修正单元,用于利用环境风模型确定所述处理后的系统数据中异常的风速参数,并删除所述异常的风速参数。The wind speed parameter correction unit is used to determine the abnormal wind speed parameter in the processed system data by using the environmental wind model, and delete the abnormal wind speed parameter.
功率参数修正单元,用于利用风功率模型对所述处理后的系统数据中的功率参数进行修正;A power parameter correction unit, configured to use a wind power model to correct the power parameters in the processed system data;
温度参数修正单元,用于利用能量守恒模型确定所述处理后的系统数据中异常的温度参数,并删除所述异常的温度参数;The temperature parameter correction unit is used to determine the abnormal temperature parameter in the processed system data by using the energy conservation model, and delete the abnormal temperature parameter;
可选地,所述功率参数修正单元包括:Optionally, the power parameter correction unit includes:
检查子单元,用于依次检查每个时间点的数据是否符合风功率模型;The check subunit is used to check whether the data at each time point conforms to the wind power model;
修正子单元,用于在所述检查子单元确定当前时间点的数据不符合风功率模型后,根据当前时间点的数据中的风速参数及风机状态码参数修正当前时间点的数据中的功率参数。The correction subunit is used to correct the power parameters in the data at the current time point according to the wind speed parameters and the wind turbine status code parameters in the data at the current time point after the checking subunit determines that the data at the current time point does not conform to the wind power model .
可选地,所述修正子单元,具体用于在当前时间点的数据中的风速参数小于切入风速时,将当前时间点的数据中的功率参数修改为0;在当前时间点的数据中的风速参数大于切入风速并且小于额定风速时,将当前时间点的数据中的功率参数删除;在当前时间点的数据中的风速参数大于额定风速时,检查当前时间点的数据中的风机状态码是否为限定功率码;如果是,则将当前时间点的数据中的功率参数修改为限定功率;否则,将当前时间点的数据中的功率参数修改为满发功率。Optionally, the correction subunit is specifically configured to modify the power parameter in the data at the current time point to 0 when the wind speed parameter in the data at the current time point is less than the cut-in wind speed; When the wind speed parameter is greater than the cut-in wind speed and less than the rated wind speed, delete the power parameter in the data at the current time point; when the wind speed parameter in the data at the current time point is greater than the rated wind speed, check whether the fan status code in the data at the current time point is It is a limited power code; if it is, the power parameter in the data at the current time point is modified to the limited power; otherwise, the power parameter in the data at the current time point is modified to the full power.
可选地,所述装置还包括:Optionally, the device further includes:
异常数据检测模块,用于基于数学模型确定所述修正后的系统数据中的异常数据;An abnormal data detection module for determining abnormal data in the corrected system data based on a mathematical model;
异常数据清理模块,用于去除所述异常数据,得到优化后的系统数据。The abnormal data cleaning module is used to remove the abnormal data to obtain optimized system data.
可选地,所述异常数据检测模块包括以下任意一种或多种模块:Optionally, the abnormal data detection module includes any one or more of the following modules:
第一检测模块,用于基于聚类模型检测所述修正后的系统数据中的异常数据;The first detection module is configured to detect abnormal data in the corrected system data based on the clustering model;
第二检测模块,用于基于残差模型检测所述修正后的系统数据中的异常数据;The second detection module is configured to detect abnormal data in the corrected system data based on the residual model;
第三检测模块,用于基于集群模型检测所述修正后的系统数据中的异常数据。The third detection module is used to detect abnormal data in the corrected system data based on the cluster model.
可选地,所述聚类模型包括以下任意一种或多种:k-means模型,DBSCAN模型;Optionally, the clustering model includes any one or more of the following: k-means model, DBSCAN model;
所述聚类模型的输入为所述修正后的系统数据,输出为不同时间点对应的数据类别及其数量。The input of the clustering model is the corrected system data, and the output is the data category and its quantity corresponding to different time points.
可选地,所述残差模型包括以下任意一种或多种:线性回归,支持向量机,决策树,神经网络;Optionally, the residual model includes any one or more of the following: linear regression, support vector machine, decision tree, neural network;
所述残差模型的输入为所述修正后的系统数据,输出为不同时间点对应的残差。The input of the residual model is the corrected system data, and the output is the residuals corresponding to different time points.
可选地,所述第三检测模块,具体用于将风机系统作为一个集群模型,判断集群模型中各模型中包含的所有风机的各列数据与风场相应的平均参数的差值是否超过设定阈值,如果超过设定阈值,则确定该列数据所属时间点的数据为异常数据。Optionally, the third detection module is specifically configured to use the wind turbine system as a cluster model to determine whether the difference between each column of data of all wind turbines included in each model in the cluster model and the corresponding average parameter of the wind field exceeds the set value. Set the threshold. If it exceeds the set threshold, the data at the time point of the column of data is determined to be abnormal data.
一种电子设备,包括:一个或多个处理器、存储器;An electronic device including: one or more processors and memories;
所述存储器用于存储计算机可执行指令,所述处理器用于执行所述计算机可执行指令,以实现前面所述的方法。The memory is used to store computer-executable instructions, and the processor is used to execute the computer-executable instructions to implement the aforementioned method.
一种可读存储介质,其上存储有指令,所述指令被执行以实现前面所述的方法。A readable storage medium having instructions stored thereon, and the instructions are executed to implement the aforementioned method.
本发明实施例提供的提高风电系统数据质量的方法及装置,针对采集的风电系统数据,确定其中的错误数据及错误类型,首先根据不同的错误类型对其中的错误数据进行处理,得到处理后的系统数据;然后再针对处 理后的系统数据,基于机理模型进行数据修正,得到修正后的系统数据。本发明实施例的方案不仅对数据采集及传送过程中导致的错误数据进行处理,而且根据风电系统数据的特点,基于机理模型对系统数据进行修正,从而使最终得到的系统数据具有更高的质量,有效地避免了低质量的源数据对后续系统运维管理等工作产生干扰,保证风电机组的正常运行。The method and device for improving the data quality of the wind power system provided by the embodiments of the present invention determine the error data and the error type in the collected wind power system data, and first process the error data according to the different error types to obtain the processed data System data; Then, according to the processed system data, the data is corrected based on the mechanism model to obtain the corrected system data. The solution of the embodiment of the present invention not only processes the erroneous data caused in the process of data collection and transmission, but also corrects the system data based on the mechanism model according to the characteristics of the wind power system data, so that the finally obtained system data has higher quality , Effectively avoiding low-quality source data from interfering with subsequent system operation and maintenance management, and ensuring the normal operation of wind turbines.
进一步地,还可基于一种或多种数学模型对修正后的系统数据进行异常数据检测,并去除检测到的异常数据,使系统数据得到进一步优化,可以更好地保证系统数据的质量。Further, abnormal data detection can be performed on the corrected system data based on one or more mathematical models, and the detected abnormal data can be removed, so that the system data can be further optimized, and the quality of the system data can be better guaranteed.
附图说明Description of the drawings
为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明中记载的一些实施例,对于本领域普通技术人员来讲,还可以根据这些附图获得其他的附图。In order to more clearly describe the technical solutions in the embodiments of the present application or the prior art, the following will briefly introduce the drawings needed in the embodiments. Obviously, the drawings in the following description are only those described in the present invention. For some of the embodiments, for those of ordinary skill in the art, other drawings may be obtained based on these drawings.
图1是本发明实施例提高风电系统数据质量的方法的一种流程图;Fig. 1 is a flowchart of a method for improving data quality of a wind power system according to an embodiment of the present invention;
图2是本发明实施例中利用风功率模型对系统数据中的功率参数进行修正的流程图;FIG. 2 is a flowchart of using a wind power model to correct power parameters in system data in an embodiment of the present invention;
图3是本发明实施例提高风电系统数据质量的方法的另一种流程图;FIG. 3 is another flowchart of a method for improving data quality of a wind power system according to an embodiment of the present invention;
图4是本发明实施提高风电系统数据质量的装置的一种结构框图;Fig. 4 is a structural block diagram of the device for improving the data quality of the wind power system implemented by the present invention;
图5是本发明实施例中数据修正模块的一种结构框图;FIG. 5 is a structural block diagram of a data correction module in an embodiment of the present invention;
图6是本发明实施例提高风电系统数据质量的装置的另一种结构框图。Fig. 6 is another structural block diagram of a device for improving data quality of a wind power system according to an embodiment of the present invention.
具体实施方式detailed description
为了使本技术领域的人员更好地理解本发明实施例的方案,下面结合附图和实施方式对本发明实施例作进一步的详细说明。In order to enable those skilled in the art to better understand the solutions of the embodiments of the present invention, the embodiments of the present invention will be further described in detail below with reference to the accompanying drawings and implementation manners.
本发明实施例提供一种提高风电系统数据质量的方法及装置,针对采集的风电系统数据,确定其中的错误数据及错误类型,首先根据不同的错 误类型对其中的错误数据进行处理,得到处理后的系统数据;然后再针对处理后的系统数据,基于机理模型进行数据修正,得到修正后的系统数据。The embodiment of the present invention provides a method and device for improving the data quality of a wind power system. According to the collected wind power system data, the error data and the error type are determined, and the error data is first processed according to different error types. The system data; then for the processed system data, the data is corrected based on the mechanism model to obtain the corrected system data.
如图1所示,是本发明实施例提高风电系统数据质量的方法的一种流程图,包括以下步骤:As shown in Figure 1, it is a flowchart of a method for improving data quality of a wind power system according to an embodiment of the present invention, which includes the following steps:
步骤101,确定采集的系统数据中的错误数据及其错误类型。Step 101: Determine the error data and the error type in the collected system data.
所述系统数据包括:一定时间段内对应各时间点的数据,每个时间点的数据包括一个或多个对应不同传感器的数据段。The system data includes: data corresponding to each time point in a certain time period, and the data at each time point includes one or more data segments corresponding to different sensors.
由于风电系统机组众多、运行状态各异,因此需要监测采集的数据量很大;另外在数据传送过程中难免会受到噪声干扰等因素的影响,使得采集的系统数据中的有些数据发生错误,产生错误数据。由于产生错误数据的原因、对象等不同,因此不同错误数据的错误类型也会有所不同。Due to the large number of wind power system units and different operating states, the amount of data collected by monitoring is very large; in addition, the data transmission process will inevitably be affected by factors such as noise interference, causing errors in some data in the collected system data. Bad data. Due to the different causes and objects of the error data, the error type of different error data will be different.
在实际应用中,可以采用现有技术来确定所述系统数据中的错误数据及其错误类型。其中,所述错误类型主要有以下几种:In practical applications, the existing technology can be used to determine the error data and the error type in the system data. Among them, the error types are mainly as follows:
缺失错误,即某个数据段的数据为空;Missing error, that is, the data in a certain data segment is empty;
类型错误,即某个数据段的数据类型与该数据段的实际类型不相符,比如某数据段的数值应为12.5,但被记录为字符串‘12.5’;Type error, that is, the data type of a data segment does not match the actual type of the data segment. For example, the value of a data segment should be 12.5, but it is recorded as the string ‘12.5’;
数值错误,即某个数据段的数值不在规定范围内,如环境温度值被记录为100℃,显然不符合常理;Numerical error, that is, the value of a certain data segment is not within the specified range. For example, the ambient temperature value is recorded as 100°C, which obviously does not conform to common sense;
规则错误,即某个数据段不符合设定规则,如风机状态码应为0-n之间的整数、风机启停机状态应为0或1等规则;Rule error, that is, a certain data segment does not meet the set rules, such as the fan status code should be an integer between 0 and n, and the fan start and stop status should be 0 or 1;
重复错误,即某个数据在一定时间内产生重复。Duplicate error, that is, a certain data repeats within a certain period of time.
当然,根据实际应用需要,还可以有其他错误类型,对此本发明实施例不做限定。Of course, according to actual application requirements, there may be other error types, which are not limited in the embodiment of the present invention.
另外,为了更准确、全面地确定其中的错误数据,本发明实施例还提供了一种确定错误数据及其错误类型的方法,该方法针对一定时间段内采集的系统数据,不仅对其进行横向检测,即针对各时间点的数据进行检测,而且还对其进行纵向检测,即针对一定时间段内的各数据段进行检测。该方法具体包括:横向检测和纵向检测;其中:In addition, in order to more accurately and comprehensively determine the error data therein, the embodiment of the present invention also provides a method for determining the error data and its error type. This method is not only performed horizontally for the system data collected within a certain period of time. Detection is to detect the data at each time point, and also to perform longitudinal detection, that is, to detect each data segment within a certain time period. The method specifically includes: horizontal detection and vertical detection; among them:
横向检测是指以时间点为单位,依次对各时间点的数据进行检测,标记所述数据中异常数据段的位置,并记录错误类型;Horizontal detection refers to the detection of data at each time point in sequence with time points as a unit, marking the position of abnormal data segments in the data, and recording error types;
纵向检测是指以数据段为单位,依次对所述一定时间段内的各数据段进行检测,并记录异常数据段及错误类型。Longitudinal detection refers to sequentially detecting each data segment within a certain period of time using data segments as a unit, and recording abnormal data segments and error types.
其中,所述横向检测中对各时间点的数据进行检测主要包括:Wherein, the detection of data at each time point in the horizontal detection mainly includes:
检测所述数据中是否有缺失数据段;如果有,则标记缺失数据段的位置,并记录错误类型为:缺失错误;Detect whether there is a missing data segment in the data; if so, mark the position of the missing data segment, and record the error type as: missing error;
检测所述数据中各数据段的类型是否正确;如果不正确,则标记所述数据段的位置,并记录错误类型为:类型错误;Detect whether the type of each data segment in the data is correct; if not, mark the position of the data segment, and record the error type as: type error;
检测所述数据中各数据段的数值是否在规定范围内;如果不是,则标记所述数据段的位置,并记录错误类型为:数值错误;Detect whether the value of each data segment in the data is within the specified range; if not, mark the position of the data segment, and record the type of error as: numerical error;
检测所述数据中各数据段是否符合设定规则;如果不符合,则标记数据段的位置,并记录错误类型为:规则错误。It is detected whether each data segment in the data meets the set rule; if it does not, the position of the data segment is marked, and the error type is recorded as: rule error.
所述纵向检测中对所述一定时间段内的各数据段进行检测包括:In the longitudinal detection, detecting each data segment within the certain time period includes:
检测所述数据段在所述一定时间段内的重复性,并标记重复数据段,记录错误类型为:重复错误;Detect the repeatability of the data segment in the certain time period, mark the repeated data segment, and record the error type as: repeated error;
检测所述数据段在所述一定时间段内的连续性,并标记缺失的数据段,记录错误类型为:缺失错误。Detect the continuity of the data segment within the certain time period, mark the missing data segment, and record the error type as: missing error.
步骤102,根据所述错误类型对所述系统数据中的错误数据进行处理,得到处理后的系统数据。Step 102: Process the error data in the system data according to the error type to obtain processed system data.
对于不同错误类型的错误数据,可以采用不同的处理方式。在本发明实施例中,针对前面提到的几种错误类型,可以分别采用以下处理方式:Different types of error data can be handled in different ways. In the embodiment of the present invention, the following processing methods can be adopted for the several types of errors mentioned above:
(1)对错误类型为缺失错误的数据段进行数据填补,具体有以下几种情况:(1) Data filling is performed on the data segment whose error type is missing error, specifically in the following situations:
如果所述系统数据中某行数据(即对应某一时间点的数据)或某列数据(即对应一定时间段内的某个数据段)全部缺失,则对该行数据或该列数据不进行填补,直接删除该行或该列。If a certain row of data (that is, data corresponding to a certain point in time) or a certain column of data (that is, corresponding to a certain data segment within a certain period of time) in the system data are all missing, then the row or column of data is not Fill, delete the row or column directly.
如果某行或某列中的个别数据段缺失,则需要根据缺失的数据段所代 表的系统参数不同采取不同的填补措施,比如:如果错误类型为缺失错误的数据段是风速参数或功率参数,则可以根据风功率模型进行数据填补,也就是说,根据缺失数据段所属时间点对应的其它数据段及风功率模型,计算得到该缺失数据段的数值,将该数值填补到相应的位置;所述风功率模型是一种风电机组风速和有功功率的函数关系,和风机本身的出厂设计参数有关。If an individual data segment in a row or a column is missing, different filling measures need to be taken according to the system parameters represented by the missing data segment. For example, if the error type is the missing data segment is the wind speed parameter or power parameter, Then the data can be filled according to the wind power model, that is to say, the value of the missing data segment is calculated according to the other data segment and the wind power model corresponding to the time point of the missing data segment, and the value is filled to the corresponding position; The wind power model is a functional relationship between wind speed and active power of a wind turbine, and is related to the factory design parameters of the wind turbine itself.
如果错误类型为缺失错误的数据段不是风速参数,也不是功率参数,由于风电系统数据通常会采用高采样率进行数据采集,因此可以通过插值方式进行数据填补,具体可以使用缺失数据段所属时间点前、后相应的数据段进行插值填补。如果缺失数据段所属时间点前、后相应的数据段也缺失,则可以利用随机数进行插值填补,其中随机数来自该数据段正态分布3sigma范围内。If the error type is missing and the wrong data segment is not a wind speed parameter or a power parameter, since wind power system data usually uses a high sampling rate for data collection, data can be filled by interpolation, and the time point of the missing data segment can be used. The corresponding data segments before and after are interpolated and filled. If the corresponding data segments before and after the time point to which the missing data segment belongs are also missing, random numbers can be used for interpolation and filling, where the random numbers come from the 3sigma range of the normal distribution of the data segment.
(2)对错误类型为类型错误的数据段进行数据类型转换或删除。(2) Data type conversion or deletion is performed on the data segment whose error type is type error.
(3)对错误类型为数值错误的数据段进行删除。(3) Delete the data segment whose error type is numerical error.
(4)对错误类型为规则错误的数据段进行数据转换或删除。(4) Data conversion or deletion is performed on the data segment whose error type is rule error.
比如,风机状态码列应为整数,若对应的数据段不是整数,则对其值进行取整处理,若对应的数据段是字符串,无法进行取整处理,则删除该数据段。For example, the fan status code column should be an integer. If the corresponding data segment is not an integer, then its value will be rounded. If the corresponding data segment is a character string and the rounding process cannot be performed, the data segment will be deleted.
(5)对于错误类型为重复错误的数据段:检查对应不同传感器的所有数据段是否均持续重复超过设定时间(比如30分钟),如果是,则删除所述设定时间内重复的数据;否则检查错误类型为重复错误的数据段是否为风速参数、或风向参数、或温度参数,如果是,则删除重复的数据段。(5) For the data segment whose error type is repeated error: check whether all data segments corresponding to different sensors continue to repeat for more than a set time (for example, 30 minutes), if so, delete the repeated data within the set time; Otherwise, check whether the data segment whose error type is repeated error is a wind speed parameter, or a wind direction parameter, or a temperature parameter, and if it is, delete the repeated data segment.
需要说明的是,在实际应用中,可以根据需要采用上述任意一种或多种处理方式,而且对不同类型的错误数据的处理顺序不做要求。当然,本发明实施例也并不仅限于上述这些处理,针对其他的错误类型,还可以有相应的处理方式。It should be noted that in practical applications, any one or more of the above-mentioned processing methods can be adopted as required, and the processing sequence of different types of error data is not required. Of course, the embodiments of the present invention are not limited to the above-mentioned processing, and there may be corresponding processing methods for other error types.
步骤103,基于机理模型对所述处理后的系统数据进行修正,得到修正后的系统数据。Step 103: Correct the processed system data based on the mechanism model to obtain corrected system data.
针对风电系统采集的数据特点,比如,风速、有功功率、齿轮箱油温、齿轮箱轴承温度、发电机轴承温度、机舱温度、机舱控制柜温度等,这些数据理论上应该符合一定的关系,满足相应的机理模型,比如:风功率模型、能量守恒模型、环境风模型等。因此,本发明实施例的方法中,可以分别利用上述各机理模型对所述系统数据中的一些数据段进行修正,主要有以下三种处理方式:According to the characteristics of the data collected by the wind power system, such as wind speed, active power, gearbox oil temperature, gearbox bearing temperature, generator bearing temperature, engine room temperature, engine room control cabinet temperature, etc., these data should theoretically conform to a certain relationship and satisfy Corresponding mechanism models, such as: wind power model, energy conservation model, environmental wind model, etc. Therefore, in the method of the embodiment of the present invention, the above-mentioned mechanism models can be used to modify some data segments in the system data, and there are mainly the following three processing methods:
(1)利用环境风模型确定所述处理后的系统数据中异常的风速参数,并删除所述异常的风速参数。(1) Using the environmental wind model to determine the abnormal wind speed parameter in the processed system data, and delete the abnormal wind speed parameter.
比如,风速无变化,但风向变化超过1度时,则相应的风向参数为异常数据,对其进行删除。For example, if there is no change in wind speed, but the change in wind direction exceeds 1 degree, the corresponding wind direction parameter is abnormal data and it is deleted.
(2)利用风功率模型对所述处理后的系统数据中的功率参数进行修正。(2) Using the wind power model to correct the power parameters in the processed system data.
利用风功率模型对所述处理后的系统数据中的功率参数进行修正的具体流程可参见图2所示,包括以下步骤:The specific process of using the wind power model to correct the power parameters in the processed system data can be seen in Figure 2, which includes the following steps:
步骤201,依次检查每个时间点的数据是否符合风功率模型;如果否,则执行步骤202;否则结束。Step 201: Check in turn whether the data at each time point conforms to the wind power model; if not, perform step 202; otherwise, end.
步骤202,判断当前时间点的数据中的风速参数是否小于切入风速;如果是,则执行步骤203;否则,执行步骤204;Step 202: Determine whether the wind speed parameter in the data at the current time point is less than the cut-in wind speed; if yes, go to step 203; otherwise, go to step 204;
步骤203,将当前时间点的数据中的功率参数修改为0;Step 203: Modify the power parameter in the data at the current time point to 0;
步骤204,判断当前时间点的数据中的风速参数是否小于额定风速;如果是,则执行步骤205;否则执行步骤206;Step 204: Determine whether the wind speed parameter in the data at the current time point is less than the rated wind speed; if yes, go to step 205; otherwise, go to step 206;
步骤205,将当前时间点的数据中的功率参数删除;Step 205: Delete the power parameter in the data at the current time point;
步骤206,检查当前时间点的数据中的风机状态码是否为限定功率码;如果是,则执行步骤207;否则执行步骤208; Step 206, check whether the wind turbine status code in the data at the current time point is a limited power code; if yes, go to step 207; otherwise, go to step 208;
步骤207,将当前时间点的数据中的功率参数修改为限定功率;Step 207: Modify the power parameter in the data at the current time point to a limited power;
步骤208,将当前时间点的数据中的功率参数修改为满发功率。Step 208: Modify the power parameter in the data at the current time point to full power.
(3)利用能量守恒模型确定所述处理后的系统数据中异常的温度参数,并删除所述异常的温度参数。(3) Use the energy conservation model to determine the abnormal temperature parameter in the processed system data, and delete the abnormal temperature parameter.
比如,对风机系统中的机舱、控制柜、齿轮箱等近似封闭空间建立能量守恒模型,当采集的实际数据不符合能量守恒模型时,删除相应数据。For example, an energy conservation model is established for the approximately enclosed space such as the nacelle, control cabinet, and gear box in the wind turbine system. When the actual data collected does not conform to the energy conservation model, the corresponding data is deleted.
需要说明的是,在实际应用中,上述利用不同模型对系统数据进行修正处理的顺序不做限定。It should be noted that, in practical applications, the above-mentioned sequence of correcting system data using different models is not limited.
本发明实施例提供的提高风电系统数据质量的方法,针对采集的风电系统数据,确定其中的错误数据及错误类型,首先根据不同的错误类型对其中的错误数据进行处理,得到处理后的系统数据;然后再针对处理后的系统数据,基于机理模型进行数据修正,得到修正后的系统数据。本发明实施例的方案不仅对数据采集及传送过程中导致的错误数据进行处理,而且根据风电系统数据的特点,基于机理模型对系统数据进行修正,从而使最终得到的系统数据具有更高的质量,有效地避免了低质量的源数据对后续系统运维管理等工作产生干扰,保证风电机组的正常运行。The method for improving the data quality of the wind power system provided by the embodiment of the present invention determines the error data and the error type in the collected wind power system data. First, the error data is processed according to the different error types to obtain the processed system data ; Then for the processed system data, data correction is made based on the mechanism model to obtain the corrected system data. The solution of the embodiment of the present invention not only processes the erroneous data caused in the process of data collection and transmission, but also corrects the system data based on the mechanism model according to the characteristics of the wind power system data, so that the finally obtained system data has higher quality , Effectively avoiding low-quality source data from interfering with subsequent system operation and maintenance management, and ensuring the normal operation of wind turbines.
如图3所示,是本发明实施例提高风电系统数据质量的方法的另一种流程图,包括以下步骤:As shown in Figure 3, it is another flow chart of the method for improving the data quality of the wind power system according to the embodiment of the present invention, which includes the following steps:
步骤301,确定采集的系统数据中的错误数据及其错误类型。Step 301: Determine the error data and the error type in the collected system data.
步骤302,根据所述错误类型对所述系统数据中的错误数据进行处理,得到处理后的系统数据。Step 302: Process the error data in the system data according to the error type to obtain processed system data.
步骤303,基于机理模型对所述处理后的系统数据进行修正,得到修正后的系统数据。Step 303: Correct the processed system data based on the mechanism model to obtain corrected system data.
上述步骤301至步骤303与前面图1中的步骤101至步骤103相同,在此不再赘述。The above steps 301 to 303 are the same as the steps 101 to 103 in FIG. 1 and will not be repeated here.
步骤304,基于数学模型确定所述修正后的系统数据中的异常数据。Step 304: Determine abnormal data in the corrected system data based on a mathematical model.
具体地,可以基于以下任意一种或多种数学模型确定所述修正后的系统数据中的异常数据:聚类模型、残差模型、以及集群模型。Specifically, the abnormal data in the revised system data may be determined based on any one or more of the following mathematical models: a clustering model, a residual model, and a clustering model.
需要说明的是,如果基于以上两种或两种以上数学模型来确定异常数据,则可以将根据不同模型得到的异常数据的并集作为最终的异常数据,或者将不同模型得到的异常数据进行融合计算(比如加权计算),根据计算结果确定最终的异常数据。It should be noted that if the abnormal data is determined based on the above two or more mathematical models, the union of the abnormal data obtained according to different models can be used as the final abnormal data, or the abnormal data obtained from different models can be merged Calculation (such as weighted calculation), according to the calculation result to determine the final abnormal data.
所述聚类模型具体可以包括以下任意一种或多种:k-means模型,DBSCAN(Density-Based Spatial Clustering of Applications with Noise)模型;所述聚类模型的输入为所述修正后的系统数据,输出为不同时间点对应的数据类别及其数量。The clustering model may specifically include any one or more of the following: k-means model, DBSCAN (Density-Based Spatial Clustering of Applications with Noise) model; the input of the clustering model is the corrected system data , The output is the data category and quantity corresponding to different time points.
如果某类别数量小于设定值(比如3),则确定该类别数据为异常数据。If the number of a certain category is less than the set value (for example, 3), the category data is determined to be abnormal data.
所述残差模型具体可以包括以下任意一种或多种:线性回归、支持向量机、决策树、神经网络;所述残差模型的输入为所述修正后的系统数据,输出为不同时间点对应的残差。The residual model may specifically include any one or more of the following: linear regression, support vector machine, decision tree, neural network; the input of the residual model is the corrected system data, and the output is different time points The corresponding residual.
如果对应某时间点的残差超过设定阈值,则确定该时间点的所有数据为异常数据。If the residual error corresponding to a certain time point exceeds the set threshold, all data at that time point is determined to be abnormal data.
风力发电厂中,一个风场经常包括几十台风机,风机外部环境相似,因此同一风场不同风机的运行状态具有相似性。基于这一特点,本发明实施例中,将风机系统作为一个集群模型,判断集群模型中各模型中包含的所有风机的各列数据与风场相应的平均参数(比如温度、风速、转速、功率等)的差值是否超过设定阈值,如果超过设定阈值,则确定该列数据所属时间点的数据为异常数据。In a wind power plant, a wind farm often includes dozens of wind turbines, and the external environment of the wind turbines is similar. Therefore, the operating states of different wind turbines in the same wind farm are similar. Based on this feature, in the embodiment of the present invention, the wind turbine system is taken as a cluster model, and it is judged that each column of data of all wind turbines contained in each model in the cluster model and the corresponding average parameters of the wind field (such as temperature, wind speed, rotation speed, power) Etc.) Whether the difference value exceeds the set threshold, if it exceeds the set threshold, it is determined that the data at the time point of the column of data is abnormal data.
步骤305,去除所述异常数据,得到优化后的系统数据。Step 305: Remove the abnormal data to obtain optimized system data.
本发明实施例提供的提高风电系统数据质量的方法,不仅对风电系统中数据采集及传送过程中导致的错误数据进行处理,而且根据风电系统数据的特点,基于机理模型对系统数据进行修正,基于一种或多种数学模型对修正后的系统数据进行异常数据检测,并去除检测到的异常数据,使系统数据得到进一步优化,可以更好地保证系统数据的质量。The method for improving the data quality of the wind power system provided by the embodiment of the present invention not only processes the erroneous data caused in the data collection and transmission process of the wind power system, but also corrects the system data based on the mechanism model according to the characteristics of the wind power system data. One or more mathematical models perform abnormal data detection on the corrected system data, and remove the detected abnormal data, so that the system data is further optimized, and the quality of the system data can be better guaranteed.
相应地,本发明实施例还提供一种提高风电系统数据质量的装置,如图4所示,是本发明实施提高风电系统数据质量的装置的一种结构框图。Correspondingly, the embodiment of the present invention also provides a device for improving the data quality of the wind power system. As shown in FIG. 4, it is a structural block diagram of the device for improving the data quality of the wind power system implemented by the present invention.
在该实施例中,所述装置包括以下各模块:In this embodiment, the device includes the following modules:
错误数据检测模块401,用于确定采集的系统数据中的错误数据及其错误类型;所述系统数据包括:一定时间段内对应各时间点的数据,每个 时间点的数据包括一个或多个对应不同传感器的数据段;The error data detection module 401 is used to determine the error data and the error type in the collected system data; the system data includes: data corresponding to each time point in a certain time period, and the data at each time point includes one or more Corresponding to the data segment of different sensors;
数据处理模块402,用于根据所述错误类型对所述系统数据中的错误数据进行处理,得到处理后的系统数据;The data processing module 402 is configured to process the error data in the system data according to the error type to obtain processed system data;
数据修正模块403,用于基于机理模型对所述处理后的系统数据进行修正,得到修正后的系统数据。The data correction module 403 is used to correct the processed system data based on the mechanism model to obtain the corrected system data.
在实际应用中,上述错误数据检测模块401可以利用现有技术来确定所述系统数据中的错误数据及其错误类型。其中,所述错误类型主要有以下几种:缺失错误、类型错误、数值错误、规则错误、重复错误。当然,根据实际应用需要,还可以有其他错误类型,对此本发明实施例不做限定。In practical applications, the above-mentioned error data detection module 401 can use existing technology to determine the error data and the error type in the system data. Among them, the error types mainly include the following types: missing errors, type errors, numerical errors, rule errors, and repeated errors. Of course, according to actual application requirements, there may be other error types, which are not limited in the embodiment of the present invention.
上述各错误类型的含义在前面已有详细说明,在此不再赘述。The meaning of the above error types has been explained in detail above, so I won't repeat them here.
另外,为了更准确、全面地确定其中的错误数据,上述错误数据检测模块401还可以按以下方式确定错误数据及其错误类型:In addition, in order to more accurately and comprehensively determine the error data therein, the above-mentioned error data detection module 401 can also determine the error data and its error type in the following manner:
横向检测:以时间点为单位,依次对各时间点的数据进行检测,标记所述数据中异常数据段的位置,并记录错误类型;Horizontal detection: using time point as a unit, sequentially detect data at each time point, mark the position of the abnormal data segment in the data, and record the error type;
纵向检测:以数据段为单位,依次对所述一定时间段内的各数据段进行检测,并记录异常数据段及错误类型。Longitudinal detection: Take the data segment as a unit, sequentially detect each data segment within the certain time period, and record the abnormal data segment and the error type.
上述横向检测及纵向检测的方式及过程在前面已有详细描述,在此不再赘述。The above-mentioned horizontal detection and vertical detection methods and processes have been described in detail above, and will not be repeated here.
针对不同错误类型的错误数据,所述数据处理模块402可以采用不同的处理方式。比如,对所述系统中的错误数据可以进行以下任意一种或多种处理:For the error data of different error types, the data processing module 402 can adopt different processing methods. For example, any one or more of the following processing can be performed on the error data in the system:
对错误类型为缺失错误的数据段进行数据填补,比如如果错误类型为缺失错误的数据段是风速参数或功率参数,则根据风功率模型进行数据填补;否则,通过插值进行数据填补;Data filling is performed on the data segment whose error type is missing error. For example, if the data segment with the error type of missing error is a wind speed parameter or power parameter, data filling is performed according to the wind power model; otherwise, data filling is performed through interpolation;
对错误类型为数值错误的数据段进行数据类型转换或删除;Data type conversion or deletion of the data segment whose error type is numerical error;
对错误类型为规则错误的数据段进行删除;Delete the data segment whose error type is rule error;
如果对应不同传感器的所有数据段均持续重复超过设定时间,则删除所述设定时间内重复的数据;If all data segments corresponding to different sensors continue to repeat more than the set time, then delete the repeated data within the set time;
如果错误类型为重复错误的数据段为风速参数、或风向参数、或温度参数,则删除重复的数据段。If the data segment whose error type is repeated error is a wind speed parameter, or wind direction parameter, or temperature parameter, the repeated data segment is deleted.
根据风电系统采集的数据特点,上述数据修正模块403可以利用多种各机理模型对所述系统数据中的一些数据段进行修正,比如,所述数据修正模块403的一种具体结构如图5所示,包括以下各单元:According to the characteristics of the data collected by the wind power system, the data correction module 403 can use various mechanism models to correct some data segments in the system data. For example, a specific structure of the data correction module 403 is shown in Figure 5. Show, including the following units:
风速参数修正单元433,用于利用环境风模型确定所述处理后的系统数据中异常的风速参数,并删除所述异常的风速参数;The wind speed parameter correction unit 433 is configured to determine the abnormal wind speed parameter in the processed system data by using the environmental wind model, and delete the abnormal wind speed parameter;
功率参数修正单元431,用于利用风功率模型对所述处理后的系统数据中的功率参数进行修正;The power parameter correction unit 431 is configured to correct the power parameters in the processed system data by using the wind power model;
温度参数修正单元432,用于利用能量守恒模型确定所述处理后的系统数据中异常的温度参数,并删除所述异常的温度参数。The temperature parameter correction unit 432 is configured to determine an abnormal temperature parameter in the processed system data by using an energy conservation model, and delete the abnormal temperature parameter.
环境风模型其中,所述功率参数修正单元包括:检查单元和修正子单元,其中:The environmental wind model, wherein the power parameter correction unit includes: an inspection unit and a correction subunit, wherein:
所述检查子单元用于依次检查每个时间点的数据是否符合风功率模型;The checking subunit is used to check whether the data at each time point conforms to the wind power model in turn;
所述修正子单元用于在所述检查子单元确定当前时间点的数据不符合风功率模型后,根据当前时间点的数据中的风速参数及风机状态码参数修正当前时间点的数据中的功率参数。具体地,在当前时间点的数据中的风速参数小于切入风速时,将当前时间点的数据中的功率参数修改为0;在当前时间点的数据中的风速参数大于切入风速并且小于额定风速时,将当前时间点的数据中的功率参数删除;在当前时间点的数据中的风速参数大于额定风速时,检查当前时间点的数据中的风机状态码是否为限定功率码;如果是,则将当前时间点的数据中的功率参数修改为限定功率;否则,将当前时间点的数据中的功率参数修改为满发功率。具体修正过程可参考前面图2所示实施例。The correction subunit is used for correcting the power in the data at the current time point according to the wind speed parameters and the wind turbine status code parameters in the data at the current time point after the checking subunit determines that the data at the current time point does not conform to the wind power model parameter. Specifically, when the wind speed parameter in the data at the current time point is less than the cut-in wind speed, the power parameter in the data at the current time point is modified to 0; when the wind speed parameter in the data at the current time point is greater than the cut-in wind speed and less than the rated wind speed , Delete the power parameter in the data at the current time point; when the wind speed parameter in the data at the current time point is greater than the rated wind speed, check whether the status code of the wind turbine in the data at the current time point is a limited power code; if so, change The power parameter in the data at the current time point is modified to the limited power; otherwise, the power parameter in the data at the current time point is modified to the full power. The specific correction process can refer to the embodiment shown in Figure 2 above.
本发明实施例提供的提高风电系统数据质量的装置,针对采集的风电系统数据,确定其中的错误数据及错误类型,首先根据不同的错误类型对其中的错误数据进行处理,得到处理后的系统数据;然后再针对处理后的 系统数据,基于机理模型进行数据修正,得到修正后的系统数据。本发明实施例的方案不仅对数据采集及传送过程中导致的错误数据进行处理,而且根据风电系统数据的特点,基于机理模型对系统数据进行修正,从而使最终得到的系统数据具有更高的质量,有效地避免了低质量的源数据对后续系统运维管理等工作产生干扰,保证风电机组的正常运行。The device for improving the data quality of the wind power system provided by the embodiment of the present invention determines the error data and the error type in the collected wind power system data, and first processes the error data according to the different error types to obtain the processed system data ; Then for the processed system data, data correction is made based on the mechanism model to obtain the corrected system data. The solution of the embodiment of the present invention not only processes the erroneous data caused in the process of data collection and transmission, but also corrects the system data based on the mechanism model according to the characteristics of the wind power system data, so that the finally obtained system data has higher quality , Effectively avoiding low-quality source data from interfering with subsequent system operation and maintenance management, and ensuring the normal operation of wind turbines.
如图6所示,是本发明实施例提高风电系统数据质量的装置的另一种结构框图。As shown in FIG. 6, it is another structural block diagram of an apparatus for improving data quality of a wind power system according to an embodiment of the present invention.
与图4所示实施例不同的是,在该实施例中,所述装置还包括以下各模块:Different from the embodiment shown in FIG. 4, in this embodiment, the device further includes the following modules:
异常数据检测模块404,用于基于数学模型确定所述修正后的系统数据中的异常数据;An abnormal data detection module 404, configured to determine abnormal data in the corrected system data based on a mathematical model;
异常数据清理模块405,用于去除所述异常数据,得到优化后的系统数据。The abnormal data cleaning module 405 is used to remove the abnormal data to obtain optimized system data.
在实际应用中,所述异常数据检测模块404可以基于一种或多种数学模型确定所述修正后的系统数据中的异常数据,比如,所述异常数据检测模块404的一种具体结构可以包括但不限于以下任意一种或多种模块:In practical applications, the abnormal data detection module 404 may determine the abnormal data in the corrected system data based on one or more mathematical models. For example, a specific structure of the abnormal data detection module 404 may include But not limited to any one or more of the following modules:
第一检测模块,用于基于聚类模型检测所述修正后的系统数据中的异常数据;The first detection module is configured to detect abnormal data in the corrected system data based on the clustering model;
第二检测模块,用于基于残差模型检测所述修正后的系统数据中的异常数据;The second detection module is configured to detect abnormal data in the corrected system data based on the residual model;
第三检测模块,用于基于集群模型检测所述修正后的系统数据中的异常数据。The third detection module is used to detect abnormal data in the corrected system data based on the cluster model.
其中,所述聚类模型可以包括但不限于以下任意一种或多种:k-means模型,DBSCAN模型;所述聚类模型的输入为所述修正后的系统数据,输出为不同时间点对应的数据类别及其数量。The clustering model may include, but is not limited to, any one or more of the following: k-means model, DBSCAN model; the input of the clustering model is the corrected system data, and the output is corresponding to different time points The data category and its quantity.
所述残差模型可以包括但不限于以下任意一种或多种:线性回归,支持向量机,决策树,神经网络;所述残差模型的输入为所述修正后的系统数据,输出为不同时间点对应的残差。The residual model may include, but is not limited to, any one or more of the following: linear regression, support vector machine, decision tree, neural network; the input of the residual model is the corrected system data, and the output is different The residual error corresponding to the time point.
所述第三检测模块具体用于将风机系统作为一个集群模型,判断集群模型中各模型中包含的所有风机的各列数据与风场相应的平均参数的差值是否超过设定阈值,如果超过设定阈值,则确定该列数据所属时间点的数据为异常数据。The third detection module is specifically used to use the wind turbine system as a cluster model to determine whether the difference between the data of all the wind turbines contained in each model in the cluster model and the corresponding average parameter of the wind farm exceeds a set threshold, and if it exceeds Set the threshold to determine that the data at the time point of the column of data is abnormal data.
需要说明的是,对于上述提高风电系统数据质量的装置各实施例而言,由于各模块、单元的功能实现与相应的方法中类似,因此对所述对话生成装置各实施例描述得比较简单,相关之处可参见方法实施例的相应部分说明。It should be noted that, for each embodiment of the device for improving the data quality of the wind power system, since the function implementation of each module and unit is similar to that of the corresponding method, the description of each embodiment of the dialog generating device is relatively simple. For related details, please refer to the description of the corresponding part of the method embodiment.
本发明实施例提供的提高风电系统数据质量的装置,不仅对风电系统中数据采集及传送过程中导致的错误数据进行处理,而且根据风电系统数据的特点,基于机理模型对系统数据进行修正,基于一种或多种数学模型对修正后的系统数据进行异常数据检测,并去除检测到的异常数据,使系统数据得到进一步优化,可以更好地保证系统数据的质量。The device for improving the data quality of the wind power system provided by the embodiment of the present invention not only processes the erroneous data caused in the data collection and transmission process of the wind power system, but also corrects the system data based on the mechanism model according to the characteristics of the wind power system data. One or more mathematical models perform abnormal data detection on the corrected system data, and remove the detected abnormal data, so that the system data is further optimized, and the quality of the system data can be better guaranteed.
需要说明的是,本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本发明的实施例能够以除了在这里图示或描述的那些以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。It should be noted that the terms "first" and "second" in the specification and claims of the present invention and the above-mentioned drawings are used to distinguish similar objects, and not necessarily used to describe a specific sequence or sequence. It should be understood that the data used in this way can be interchanged under appropriate circumstances so that the embodiments of the present invention described herein can be implemented in an order other than those illustrated or described herein. In addition, the terms "including" and "having" and any variations of them are intended to cover non-exclusive inclusions. For example, a process, method, system, product or device that includes a series of steps or units is not necessarily limited to the clearly listed Those steps or units may include other steps or units that are not clearly listed or are inherent to these processes, methods, products, or equipment.
本说明书中的各个实施例均采用递进的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。而且,以上所描述的系统实施例仅仅是示意性的,其中作为分离部件说明的模块和单元可以是或者也可以不是物理上分开的,即可以位于一个网络单元上,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。本领域普通技术人员在不付出创造性劳动的情况下,即可以理解并实施。The various embodiments in this specification are described in a progressive manner, and the same or similar parts between the various embodiments can be referred to each other, and each embodiment focuses on the differences from other embodiments. Moreover, the system embodiments described above are only illustrative, and the modules and units described as separate components may or may not be physically separated, that is, they may be located on one network unit, or may be distributed to multiple On the network unit. Some or all of the modules may be selected according to actual needs to achieve the objectives of the solutions of the embodiments. Those of ordinary skill in the art can understand and implement it without creative work.
本领域普通技术人员可以理解实现上述方法实施方式中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,所述的程序可以存储于计算机可读取存储介质中,这里所称的存储介质,如:ROM/RAM、磁碟、光盘等。A person of ordinary skill in the art can understand that all or part of the steps in the above-mentioned method embodiments can be implemented by a program instructing relevant hardware. The program can be stored in a computer-readable storage medium, which is referred to as storage herein. Medium, such as: ROM/RAM, magnetic disk, optical disk, etc.
相应地,本发明实施例还提供一种用于提高风电系统数据质量的方法的装置,该装置是一种电子设备,比如,可以是移动终端、计算机、平板设备、医疗设备、健身设备、个人数字助理等。所述电子设备可以包括一个或多个处理器、存储器;其中,所述存储器用于存储计算机可执行指令,所述处理器用于执行所述计算机可执行指令,以实现前面各实施例所述的方法。Correspondingly, an embodiment of the present invention also provides a device for improving the data quality of a wind power system. The device is an electronic device, such as a mobile terminal, a computer, a tablet device, a medical device, a fitness device, or a personal computer. Digital assistants, etc. The electronic device may include one or more processors and memories; wherein the memory is used to store computer executable instructions, and the processor is used to execute the computer executable instructions to implement the foregoing method.
以上对本发明实施例进行了详细介绍,本文中应用了具体实施方式对本发明进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及装置,其仅仅是本发明一部分的实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都应当属于本发明保护的范围,本说明书内容不应理解为对本发明的限制。因此,凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The embodiments of the present invention are described in detail above, and specific implementations are used to illustrate the present invention. The descriptions of the above embodiments are only used to help understand the methods and devices of the present invention, and they are only part of the embodiments of the present invention. Not all examples. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention, and the contents of this specification should not be construed as limiting the present invention. Therefore, any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (18)

  1. 一种提高风电系统数据质量的方法,其特征在于,所述方法包括:A method for improving data quality of a wind power system, characterized in that the method includes:
    确定采集的系统数据中的错误数据及其错误类型;所述系统数据包括:一定时间段内对应各时间点的数据,每个时间点的数据包括一个或多个对应不同传感器的数据段;Determine the error data and error types in the collected system data; the system data includes: data corresponding to each time point in a certain time period, and the data at each time point includes one or more data segments corresponding to different sensors;
    根据所述错误类型对所述系统数据中的错误数据进行处理,得到处理后的系统数据;Processing the error data in the system data according to the error type to obtain processed system data;
    基于机理模型对所述处理后的系统数据进行修正,得到修正后的系统数据。Based on the mechanism model, the processed system data is corrected to obtain the corrected system data.
  2. 根据权利要求1所述的方法,其特征在于,所述错误类型包括以下任意一种或多种:缺失错误、类型错误、数值错误、规则错误、重复错误;The method according to claim 1, wherein the error types include any one or more of the following: missing errors, type errors, numerical errors, rule errors, and repeated errors;
    所述根据所述错误类型对所述系统中的错误数据进行处理包括以下任意一种或多种:The processing of the error data in the system according to the error type includes any one or more of the following:
    对错误类型为缺失错误的数据段进行数据填补;Data filling is performed on the data segment whose error type is missing error;
    对错误类型为类型错误的数据段进行数据类型转换或删除;Data type conversion or deletion of the data segment whose error type is type error;
    对错误类型为数值错误的数据段进行删除;Delete the data segment whose error type is numerical error;
    对错误类型为规则错误的数据段进行数据转换或删除;Data conversion or deletion of data segments whose error types are rule errors;
    对于错误类型为重复错误的数据段,如果对应不同传感器的所有数据段均持续重复超过设定时间,则删除所述设定时间内重复的数据;如果错误类型为重复错误的数据段为风速参数、或风向参数、或温度参数,则删除重复的数据段。For the data segment whose error type is repeated error, if all data segments corresponding to different sensors continue to repeat for more than the set time, the repeated data within the set time will be deleted; if the error type is repeated error data segment is the wind speed parameter , Or wind direction parameter, or temperature parameter, delete the repeated data segment.
  3. 根据权利要求2所述的方法,其特征在于,所述对错误类型为缺失错误的数据段进行数据填补包括:The method according to claim 2, wherein said performing data padding on the data segment whose error type is missing error comprises:
    如果错误类型为缺失错误的数据段是风速参数或功率参数,则根据风功率模型进行数据填补;If the data segment whose error type is missing error is a wind speed parameter or a power parameter, the data shall be filled according to the wind power model;
    否则,通过插值进行数据填补。Otherwise, the data is filled by interpolation.
  4. 根据权利要求1所述的方法,其特征在于,所述基于机理模型对所述处理后的系统数据进行修正包括:The method according to claim 1, wherein the correcting the processed system data based on the mechanism model comprises:
    利用环境风模型确定所述处理后的系统数据中异常的风速参数,并删除所述异常的风速参数;Determine the abnormal wind speed parameter in the processed system data by using the environmental wind model, and delete the abnormal wind speed parameter;
    利用风功率模型对所述处理后的系统数据中的功率参数进行修正;Correcting the power parameters in the processed system data by using the wind power model;
    利用能量守恒模型确定所述处理后的系统数据中异常的温度参数,并删除所述异常的温度参数。The energy conservation model is used to determine the abnormal temperature parameter in the processed system data, and delete the abnormal temperature parameter.
  5. 根据权利要求4所述的方法,其特征在于,所述利用风功率模型对所述处理后的系统数据中的功率参数进行修正包括:The method according to claim 4, wherein said correcting the power parameter in the processed system data by using a wind power model comprises:
    依次检查每个时间点的数据是否符合风功率模型;Check in turn whether the data at each time point conforms to the wind power model;
    如果当前时间点的数据不符合风功率模型,则根据当前时间点的数据中的风速参数及风机状态码参数修正当前时间点的数据中的功率参数。If the data at the current time point does not conform to the wind power model, the power parameters in the current time point data are corrected according to the wind speed parameters and the wind turbine status code parameters in the current time point data.
  6. 根据权利要求5所述的方法,其特征在于,所述根据当前时间点的数据中的风速参数及风机状态码参数修正当前时间点的数据中的功率参数包括:The method according to claim 5, wherein the correcting the power parameter in the data at the current time point according to the wind speed parameter and the wind turbine status code parameter in the data at the current time point comprises:
    如果当前时间点的数据中的风速参数小于切入风速,则将当前时间点的数据中的功率参数修改为0;If the wind speed parameter in the data at the current time point is less than the cut-in wind speed, modify the power parameter in the data at the current time point to 0;
    如果当前时间点的数据中的风速参数大于切入风速并且小于额定风速,则将当前时间点的数据中的功率参数删除;If the wind speed parameter in the data at the current time point is greater than the cut-in wind speed and less than the rated wind speed, delete the power parameter in the data at the current time point;
    如果当前时间点的数据中的风速参数大于额定风速,则检查当前时间点的数据中的风机状态码是否为限定功率码;If the wind speed parameter in the data at the current time point is greater than the rated wind speed, check whether the wind turbine status code in the data at the current time point is a limited power code;
    如果是,则将当前时间点的数据中的功率参数修改为限定功率;If so, modify the power parameter in the data at the current time point to the limited power;
    否则,将当前时间点的数据中的功率参数修改为满发功率。Otherwise, modify the power parameter in the data at the current time point to full power.
  7. 根据权利要求1至6任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1 to 6, wherein the method further comprises:
    基于数学模型确定所述修正后的系统数据中的异常数据;Determining abnormal data in the revised system data based on a mathematical model;
    去除所述异常数据,得到优化后的系统数据。Remove the abnormal data to obtain optimized system data.
  8. 根据权利要求7所述的方法,其特征在于,所述基于数学模型确定所述修正后的系统数据中的异常数据包括:The method according to claim 7, wherein the determining abnormal data in the corrected system data based on a mathematical model comprises:
    基于以下任意一种或多种数学模型确定所述修正后的系统数据中的异 常数据:聚类模型、残差模型、以及集群模型。The abnormal data in the revised system data is determined based on any one or more of the following mathematical models: clustering model, residual model, and clustering model.
  9. 根据权利要求8所述的方法,其特征在于,基于集群系统确定所述修正后的系统中的异常数据包括:The method according to claim 8, wherein the determining the abnormal data in the corrected system based on the cluster system comprises:
    将风机系统作为一个集群模型,判断集群模型中各模型中包含的所有风机的各列数据与风场相应的平均参数的差值是否超过设定阈值,如果超过设定阈值,则确定该列数据所属时间点的数据为异常数据。Regarding the wind turbine system as a cluster model, determine whether the difference between each column of data of all wind turbines contained in each model in the cluster model and the corresponding average parameter of the wind farm exceeds the set threshold. If it exceeds the set threshold, determine the column of data The data at the time point is abnormal data.
  10. 一种提高风电系统数据质量的装置,其特征在于,所述装置包括:A device for improving data quality of a wind power system, characterized in that the device comprises:
    错误数据检测模块,用于确定采集的系统数据中的错误数据及其错误类型;所述系统数据包括:一定时间段内对应各时间点的数据,每个时间点的数据包括一个或多个对应不同传感器的数据段;The error data detection module is used to determine the error data and the error type in the collected system data; the system data includes: data corresponding to each time point within a certain period of time, and each time point data includes one or more corresponding Data segments of different sensors;
    数据处理模块,用于根据所述错误类型对所述系统数据中的错误数据进行处理,得到处理后的系统数据;A data processing module, configured to process the error data in the system data according to the error type to obtain processed system data;
    数据修正模块,用于基于机理模型对所述处理后的系统数据进行修正,得到修正后的系统数据。The data correction module is used to correct the processed system data based on the mechanism model to obtain the corrected system data.
  11. 根据权利要求10所述的装置,其特征在于,所述错误类型包括以下任意一种或多种:缺失错误、类型错误、数值错误、规则错误、重复错误;The device according to claim 10, wherein the error type includes any one or more of the following: missing error, type error, numerical error, rule error, and repeated error;
    所述数据处理模块,具体用于对所述系统中的错误数据进行以下任意一种或多种处理:The data processing module is specifically configured to perform any one or more of the following processing on the error data in the system:
    对错误类型为缺失错误的数据段进行数据填补;Data filling is performed on the data segment whose error type is missing error;
    对错误类型为类型错误的数据段进行数据类型转换或删除;Data type conversion or deletion of the data segment whose error type is type error;
    对错误类型为数值错误的数据段进行删除;Delete the data segment whose error type is numerical error;
    对错误类型为规则错误的数据段进行数据转换或删除;Data conversion or deletion of data segments whose error types are rule errors;
    如果对应不同传感器的所有数据段均持续重复超过设定时间,则删除所述设定时间内重复的数据;If all data segments corresponding to different sensors continue to repeat more than the set time, then delete the repeated data within the set time;
    如果错误类型为重复错误的数据段为风速参数、或风向参数、或温度参数,则删除重复的数据段。If the data segment whose error type is repeated error is a wind speed parameter, or wind direction parameter, or temperature parameter, the repeated data segment is deleted.
  12. 根据权利要求11所述的装置,其特征在于,所述数据处理模块按 以下方式对错误类型为缺失错误的数据段进行数据填补:The device according to claim 11, wherein the data processing module performs data padding on data segments whose error types are missing errors in the following manner:
    如果错误类型为缺失错误的数据段是风速参数或功率参数,则根据风功率模型进行数据填补;If the data segment whose error type is missing error is a wind speed parameter or a power parameter, the data shall be filled according to the wind power model;
    否则,通过插值进行数据填补。Otherwise, the data is filled by interpolation.
  13. 根据权利要求10所述的装置,其特征在于,所述数据修正模块包括:The device according to claim 10, wherein the data correction module comprises:
    风速参数修正单元,用于利用环境风模型确定所述处理后的系统数据中异常的风速参数,并删除所述异常的风速参数。The wind speed parameter correction unit is used to determine the abnormal wind speed parameter in the processed system data by using the environmental wind model, and delete the abnormal wind speed parameter.
    功率参数修正单元,用于利用风功率模型对所述处理后的系统数据中的功率参数进行修正;A power parameter correction unit, configured to use a wind power model to correct the power parameters in the processed system data;
    温度参数修正单元,用于利用能量守恒模型确定所述处理后的系统数据中异常的温度参数,并删除所述异常的温度参数;The temperature parameter correction unit is used to determine the abnormal temperature parameter in the processed system data by using the energy conservation model, and delete the abnormal temperature parameter;
  14. 根据权利要求13所述的装置,其特征在于,所述功率参数修正单元包括:The device according to claim 13, wherein the power parameter correction unit comprises:
    检查子单元,用于依次检查每个时间点的数据是否符合风功率模型;The check subunit is used to check whether the data at each time point conforms to the wind power model;
    修正子单元,用于在所述检查子单元确定当前时间点的数据不符合风功率模型后,根据当前时间点的数据中的风速参数及风机状态码参数修正当前时间点的数据中的功率参数。The correction subunit is used to correct the power parameters in the data at the current time point according to the wind speed parameters and the wind turbine status code parameters in the data at the current time point after the checking subunit determines that the data at the current time point does not conform to the wind power model .
  15. 根据权利要求14所述的装置,其特征在于,The device of claim 14, wherein:
    所述修正子单元,具体用于在当前时间点的数据中的风速参数小于切入风速时,将当前时间点的数据中的功率参数修改为0;在当前时间点的数据中的风速参数大于切入风速并且小于额定风速时,将当前时间点的数据中的功率参数删除;在当前时间点的数据中的风速参数大于额定风速时,检查当前时间点的数据中的风机状态码是否为限定功率码;如果是,则将当前时间点的数据中的功率参数修改为限定功率;否则,将当前时间点的数据中的功率参数修改为满发功率。The correction subunit is specifically used to modify the power parameter in the data at the current time point to 0 when the wind speed parameter in the data at the current time point is less than the cut-in wind speed; the wind speed parameter in the data at the current time point is greater than the cut-in wind speed When the wind speed is less than the rated wind speed, delete the power parameter in the data at the current time point; when the wind speed parameter in the data at the current time point is greater than the rated wind speed, check whether the fan status code in the data at the current time point is a limited power code ; If yes, modify the power parameter in the data at the current time point to the limited power; otherwise, modify the power parameter in the data at the current time point to full power.
  16. 根据权利要求10至15任一项所述的装置,其特征在于,所述装置还包括:The device according to any one of claims 10 to 15, wherein the device further comprises:
    异常数据检测模块,用于基于数学模型确定所述修正后的系统数据中的异常数据;An abnormal data detection module for determining abnormal data in the corrected system data based on a mathematical model;
    异常数据清理模块,用于去除所述异常数据,得到优化后的系统数据。The abnormal data cleaning module is used to remove the abnormal data to obtain optimized system data.
  17. 根据权利要求16所述的装置,其特征在于,所述异常数据检测模块包括以下任意一种或多种模块:The device according to claim 16, wherein the abnormal data detection module comprises any one or more of the following modules:
    第一检测模块,用于基于聚类模型检测所述修正后的系统数据中的异常数据;The first detection module is configured to detect abnormal data in the corrected system data based on the clustering model;
    第二检测模块,用于基于残差模型检测所述修正后的系统数据中的异常数据;The second detection module is configured to detect abnormal data in the corrected system data based on the residual model;
    第三检测模块,用于基于集群模型检测所述修正后的系统数据中的异常数据。The third detection module is used to detect abnormal data in the corrected system data based on the cluster model.
  18. 根据权利要求17所述的装置,其特征在于,The device according to claim 17, wherein:
    所述第三检测模块,具体用于将风机系统作为一个集群模型,判断集群模型中各模型中包含的所有风机的各列数据与风场相应的平均参数的差值是否超过设定阈值,如果超过设定阈值,则确定该列数据所属时间点的数据为异常数据。The third detection module is specifically used to use the wind turbine system as a cluster model to determine whether the difference between each column of data of all wind turbines included in each model in the cluster model and the corresponding average parameter of the wind field exceeds a set threshold, If the threshold is exceeded, it is determined that the data at the time point of the column of data is abnormal data.
PCT/CN2020/082958 2019-08-14 2020-04-02 Method and apparatus for improving wind power system data quality WO2021027294A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910750791.2A CN110489852A (en) 2019-08-14 2019-08-14 Improve the method and device of the wind power system quality of data
CN201910750791.2 2019-08-14

Publications (1)

Publication Number Publication Date
WO2021027294A1 true WO2021027294A1 (en) 2021-02-18

Family

ID=68549935

Family Applications (2)

Application Number Title Priority Date Filing Date
PCT/CN2019/106357 WO2021027011A1 (en) 2019-08-14 2019-09-18 Method and apparatus for improving data quality of wind power system
PCT/CN2020/082958 WO2021027294A1 (en) 2019-08-14 2020-04-02 Method and apparatus for improving wind power system data quality

Family Applications Before (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/106357 WO2021027011A1 (en) 2019-08-14 2019-09-18 Method and apparatus for improving data quality of wind power system

Country Status (2)

Country Link
CN (1) CN110489852A (en)
WO (2) WO2021027011A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110489852A (en) * 2019-08-14 2019-11-22 北京天泽智云科技有限公司 Improve the method and device of the wind power system quality of data
CN111323081A (en) * 2020-03-20 2020-06-23 江南造船(集团)有限责任公司 Ship test voyage performance index correction method, correction system, storage medium and electronic terminal
CN114358616A (en) * 2022-01-07 2022-04-15 华润电力技术研究院有限公司 Fan state monitoring method and device and readable storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103631681A (en) * 2013-12-10 2014-03-12 国家电网公司 Method for online restoring abnormal data of wind power plant
US9354968B2 (en) * 2011-09-30 2016-05-31 Johnson Controls Technology Company Systems and methods for data quality control and cleansing
CN108549689A (en) * 2018-04-12 2018-09-18 华北电力大学 A kind of running of wind generating set data cleaning method
CN110489852A (en) * 2019-08-14 2019-11-22 北京天泽智云科技有限公司 Improve the method and device of the wind power system quality of data

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160179599A1 (en) * 2012-10-11 2016-06-23 University Of Southern California Data processing framework for data cleansing
CN105389634A (en) * 2015-12-01 2016-03-09 广东智造能源科技研究有限公司 Combined short-term wind power prediction system and method
CN107330183B (en) * 2017-06-29 2020-04-17 华北电力大学 Wind power utilization rate calculation method based on operation data
CN107291927B (en) * 2017-06-29 2019-11-05 同济大学 A kind of air speed data cleaning method for Wind turbines wind speed correlation analysis
CN107392304A (en) * 2017-08-04 2017-11-24 中国电力科学研究院 A kind of Wind turbines disorder data recognition method and device
CN113419899A (en) * 2017-08-07 2021-09-21 上海寒武纪信息科技有限公司 Error recovery method and device and processor
US10728282B2 (en) * 2018-01-19 2020-07-28 General Electric Company Dynamic concurrent learning method to neutralize cyber attacks and faults for industrial asset monitoring nodes
CN108646261B (en) * 2018-02-14 2021-03-26 上饶市中科院云计算中心大数据研究院 Data processing method and device for vehicle GPS data set
CN108509752B (en) * 2018-05-02 2021-09-03 东南大学 Superheater mechanism model dynamic parameter setting method combined with field data
CN109740175B (en) * 2018-11-18 2020-12-08 浙江大学 Outlier discrimination method for power curve data of wind turbine generator
CN109783486B (en) * 2019-01-17 2020-11-24 华北电力大学 Data cleaning method and device and server

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9354968B2 (en) * 2011-09-30 2016-05-31 Johnson Controls Technology Company Systems and methods for data quality control and cleansing
CN103631681A (en) * 2013-12-10 2014-03-12 国家电网公司 Method for online restoring abnormal data of wind power plant
CN108549689A (en) * 2018-04-12 2018-09-18 华北电力大学 A kind of running of wind generating set data cleaning method
CN110489852A (en) * 2019-08-14 2019-11-22 北京天泽智云科技有限公司 Improve the method and device of the wind power system quality of data

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ZHANG HAN-CHAO, KUANG HONG-HAI, WANG JIAN-HUI, ZHU GUO-PING: "Application of Error Correction Algorithm in Short-term Power Forecasting of Wind Power", JOURNAL OF NEW INDUSTRIALIZATION, INSTITUTE OF ELECTRONIC SCIENCE AND TECHNOLOGY INFORMATION, MINISTRY OF INDUSTRY AND INFORMATION TECHNOLOGY, CN, vol. 9, no. 2, 1 January 2019 (2019-01-01), CN, pages 1 - 9, XP055781032, ISSN: 2095-6649, DOI: 10.19335/j.cnki.2095-6649.2019.02.001 *

Also Published As

Publication number Publication date
WO2021027011A1 (en) 2021-02-18
CN110489852A (en) 2019-11-22

Similar Documents

Publication Publication Date Title
KR102455338B1 (en) Defect detection method and apparatus, model training method and apparatus, and device and storage medium
WO2021027294A1 (en) Method and apparatus for improving wind power system data quality
US20220300857A1 (en) System and method for validating unsupervised machine learning models
US11669083B2 (en) System and method for proactive repair of sub optimal operation of a machine
US11933695B2 (en) System and method for detecting anomalies in sensory data of industrial machines located within a predetermined proximity
CN112555084B (en) Real-time state early warning method and device, electronic equipment and storage medium
US11733688B2 (en) System and method for recognizing and forecasting anomalous sensory behavioral patterns of a machine
CN116810825B (en) Wafer conveying mechanical arm abnormality monitoring method and system in vacuum environment
EP4038462A1 (en) System and method for fusing multiple analytics of a wind turbine for improved efficiency
CN115453356B (en) Power equipment operation state monitoring and analyzing method, system, terminal and medium
JP6033140B2 (en) Abnormality detection method and abnormality detection device
CN115360704A (en) Offshore wind power output prediction method, device, equipment and storage medium
US20220341393A1 (en) System and method for fusing multiple analytics of a wind turbine for improved efficiency
CN113051700A (en) Equipment reliability monitoring method and device
CN112162528A (en) Fault diagnosis method, device, equipment and storage medium of numerical control machine tool
Dinh et al. Implementation of Digital Twin-Assisted Condition Monitoring and Fault Diagnosis for Wind Turbines
CN116090324A (en) TensorFlow-based wind turbine generator temperature rise fault early warning method
CN115169736A (en) Power grid disaster prediction method and device, storage medium and electronic equipment
CN117633589A (en) Method and system for monitoring moving ring
CN117494053A (en) Transformer multi-state-quantity data anomaly detection method, medium and system
CN118114204A (en) Wind direction prediction method, system, electronic device and non-transitory machine-readable medium
CN113052320A (en) Equipment safety monitoring method and device
WO2024039510A1 (en) Incremental causal discovery and root cause localization for online system fault diagnosis
CN118094264A (en) Intelligent power capacitor partial discharge detection method and system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20851588

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20851588

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 20851588

Country of ref document: EP

Kind code of ref document: A1