WO2024077753A1 - Wind turbine fault prediction method based on multi-source heterogeneous data - Google Patents

Wind turbine fault prediction method based on multi-source heterogeneous data Download PDF

Info

Publication number
WO2024077753A1
WO2024077753A1 PCT/CN2022/137737 CN2022137737W WO2024077753A1 WO 2024077753 A1 WO2024077753 A1 WO 2024077753A1 CN 2022137737 W CN2022137737 W CN 2022137737W WO 2024077753 A1 WO2024077753 A1 WO 2024077753A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
wind turbine
target
sample
historical data
Prior art date
Application number
PCT/CN2022/137737
Other languages
French (fr)
Chinese (zh)
Inventor
杨之乐
安钊
郭媛君
胡天宇
吴承科
Original Assignee
深圳先进技术研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳先进技术研究院 filed Critical 深圳先进技术研究院
Publication of WO2024077753A1 publication Critical patent/WO2024077753A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • G06F30/27Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02EREDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
    • Y02E10/00Energy generation through renewable energy sources
    • Y02E10/70Wind energy
    • Y02E10/72Wind turbines with rotation axis in wind direction

Definitions

  • the present invention relates to the field of new energy technology, and in particular to a wind turbine fault prediction method based on multi-source heterogeneous data.
  • the present invention provides a wind turbine fault prediction method based on multi-source heterogeneous data, aiming to solve the problem of low efficiency of fault detection of wind turbines requiring manual inspection in the prior art.
  • a first aspect of the present invention provides a wind turbine fault prediction method based on multi-source heterogeneous data, the method comprising:
  • Acquire historical data of a target wind turbine wherein the historical data of the target wind turbine at least includes deformation data of each blade of the target wind turbine within a preset time period and vibration data of the target wind turbine, wherein the preset time period is a time period of a preset duration before a current moment;
  • the neural network is trained based on a training data set, and the training data set includes sample historical data and sample expansion data, the sample historical data is historical data of wind turbines actually collected, and the sample expansion data is data generated by performing sample expansion processing on the sample historical data.
  • the wind turbine fault prediction method based on multi-source heterogeneous data wherein the acquiring of historical data of the target wind turbine comprises:
  • the nose position of the target wind turbine is acquired, and deformation data of each blade of the target wind turbine is acquired according to the nose position and a reflection signal of a light signal sent to the target wind turbine.
  • the wind turbine fault prediction method based on multi-source heterogeneous data wherein the step of acquiring deformation data of each blade of the target wind turbine according to the position of the wind turbine head and the reflection signal of the light signal sent to the target wind turbine, comprises:
  • the deformation data of each blade of the target wind turbine is acquired according to the difference between the nose position and the nose reference position, and the reflected signal of the light signal sent to the target wind turbine and the reference reflected signal data.
  • the wind turbine fault prediction method based on multi-source heterogeneous data, wherein, after obtaining the deformation data of each blade of the target wind turbine according to the position of the wind turbine head and the reflection signal of the light signal sent to the target wind turbine, comprises:
  • the method further comprises:
  • the fault prediction result of the target wind turbine is updated into the digital twin model of the target wind turbine.
  • the wind turbine fault prediction method based on multi-source heterogeneous data wherein the neural network includes a feature extraction module and a prediction module, and the training process of the neural network is:
  • the feature extraction module extracts the features of the target sample historical data and the target sample expanded data respectively to obtain a first feature and a second feature;
  • the parameters of the sample expansion module, the feature extraction module and the prediction module are updated according to the batch training loss, and the step of selecting part of the sample historical data from each of the sample historical data to form a target training batch is re-executed until the parameters converge.
  • the wind turbine fault prediction method based on multi-source heterogeneous data, wherein the batch training loss is obtained according to the first feature, the second feature, the first prediction result, the second prediction result and the fault label corresponding to each sample historical data in the target training batch, including:
  • sample losses corresponding to each sample historical data in the training batch are summed to obtain the second batch loss
  • the updating of the parameters of the sample expansion module, the feature extraction module and the prediction module according to the batch training loss includes:
  • the parameters of the sample expansion module, the feature extraction module, the prediction module and the discriminator are updated according to the batch training loss.
  • the wind turbine fault prediction method based on multi-source heterogeneous data, wherein the features of the target sample historical data and the target sample expanded data are respectively extracted by the feature extraction module to obtain the first feature and the second feature, including:
  • the first dimension reduction data and the second dimension reduction data are respectively input into the feature extraction module to obtain the first feature and the second feature output by the feature extraction module.
  • a second aspect of the present invention provides a wind turbine fault prediction device based on multi-source heterogeneous data, comprising:
  • a data acquisition module is used to acquire historical data of a target wind turbine, the historical data of the target wind turbine at least including deformation data of each blade of the target wind turbine within a preset time period and vibration data of the target wind turbine, the preset time period being a period of preset duration before a current moment;
  • a prediction module the prediction module is used to input the historical data of the target wind turbine into a trained neural network, and obtain a fault prediction result of the target wind turbine output by the neural network;
  • the neural network is trained based on a training data set, and the training data set includes sample historical data and sample expansion data, the sample historical data is historical data of wind turbines actually collected, and the sample expansion data is data generated by performing sample expansion processing on the sample historical data.
  • a terminal comprising a processor and a computer-readable storage medium communicatively connected to the processor, wherein the computer-readable storage medium is suitable for storing a plurality of instructions, and the processor is suitable for calling the instructions in the computer-readable storage medium to execute the steps of implementing any one of the above-mentioned methods for predicting wind turbine faults based on multi-source heterogeneous data.
  • a fourth aspect of the present invention provides a computer-readable storage medium storing one or more programs, which can be executed by one or more processors to implement the steps of any of the above-mentioned methods for predicting wind turbine faults based on multi-source heterogeneous data.
  • the present invention provides a wind turbine fault prediction method based on multi-source heterogeneous data, which predicts wind turbine faults through a neural network.
  • the real data is expanded and processed, so that the amount of data in the training data set of the neural network is larger, thereby improving the accuracy of fault prediction of the trained neural network.
  • FIG1 is a flow chart of an embodiment of a method for predicting wind turbine faults based on multi-source heterogeneous data provided by the present invention
  • FIG2 is a structural principle diagram of an embodiment of a wind turbine fault prediction device based on multi-source heterogeneous data provided by the present invention
  • FIG3 is a schematic diagram showing the principles of an embodiment of a terminal provided by the present invention.
  • the wind turbine fault prediction method based on multi-source heterogeneous data provided by the present invention can be applied to a terminal with computing capabilities.
  • the terminal can execute the wind turbine fault prediction method based on multi-source heterogeneous data provided by the present invention to perform power grid peak-shaving scheduling.
  • the terminal can be but is not limited to various computers, mobile terminals, smart home appliances, wearable devices, etc.
  • the steps include:
  • the preset time length may be one week, one month, etc., and the preset time length may be determined through experiments, that is, different time lengths are used to conduct experiments using the method provided in this embodiment to obtain a more accurate fault prediction result as the preset time length.
  • the vibration data in the historical data of the target wind turbine may be obtained by a sensor installed on the target wind turbine.
  • the blade deformation data in the historical data of the target wind turbine may be obtained by optical signal detection.
  • the historical data of the target wind turbine may be obtained by:
  • the nose position of the target wind turbine is acquired, and deformation data of each blade of the target wind turbine is acquired according to the nose position and a reflection signal of a light signal sent to the target wind turbine.
  • an optical signal generator and an optical signal detector may be arranged around the target wind turbine.
  • the number of the optical signal generator and the optical signal detector may be multiple so that the detection range covers the entire blade of the target wind turbine.
  • the optical signal generator sends an optical signal to the target wind turbine and is reflected and received by the optical signal detector.
  • the optical signal detector receives the reflected signal and analyzes it to obtain the deformation data of the blade. Specifically, the reflected signal pre-set when the blade of the wind turbine is not deformed can be used as a reference, and the deformation data is determined according to the difference between the actually received reflected signal and the reflected signal when it is not deformed.
  • the head of the wind turbine will rotate accordingly to keep the blade in the right wind. Therefore, in this embodiment, when obtaining the deformation data of the blade, it is also necessary to combine the real-time head position of the wind turbine.
  • the step of acquiring deformation data of each blade of the target wind turbine according to the position of the wind turbine head and the reflected signal of the light signal sent to the target wind turbine includes:
  • the deformation data of each blade of the target wind turbine is acquired according to the difference between the nose position and the nose reference position, and the reflection signal of the light signal sent to the target wind turbine and the standard reflection signal data.
  • the nose reference position refers to the pre-set position of the nose of the target wind turbine relative to the optical signal generator and the optical signal detector.
  • the positions of the optical signal generator and the optical signal detector remain unchanged, while the nose position of the target wind turbine will change with different wind directions.
  • the reflected signal data received by the optical signal detector after the optical signal emitted by the optical signal generator is reflected by the blades of the target wind turbine when the nose of the target wind turbine is at the nose reference position is obtained in advance as the reference reflected signal data.
  • the reflected signal of the optical signal sent to the blades of the target wind turbine when the blades of the target wind turbine are not deformed can be obtained.
  • the deformation data of the blades of the target wind turbine can be determined.
  • the method After acquiring deformation data of each blade of the target wind turbine according to the position of the wind turbine head and the reflected signal of the light signal sent to the target wind turbine, the method includes:
  • the three-dimensional model in the digital twin model of the target wind turbine is updated based on the current blade deformation data of the target wind turbine.
  • a digital twin model of the target wind turbine is set up, and the digital twin model includes a three-dimensional model of the target wind turbine. After obtaining the current blade deformation data of the target wind turbine, the three-dimensional model of the target wind turbine is updated according to the blade deformation data, so that maintenance personnel can intuitively obtain the deformation condition of the blades through the three-dimensional model.
  • the method provided in this embodiment further includes the steps of:
  • S200 Inputting historical data of the target wind turbine into a trained neural network, and obtaining a fault prediction result of the target wind turbine output by the neural network.
  • the method provided in this embodiment uses sample expansion to expand the data set.
  • the neural network is trained based on the training data set, and the training data set includes sample historical data and sample expansion data.
  • the sample historical data is the historical data of the wind turbine that is actually collected, and the sample expansion data is the data generated by performing sample expansion processing on the sample historical data.
  • the neural network includes a feature extraction module and a prediction module, and the training process of the neural network is:
  • a label is a fault category label
  • a label is a fault category label
  • the training process of the adversarial network is not ideal, and the fault prediction network trained with the expanded data generated by the adversarial network after training is not high in the final prediction accuracy.
  • a new training method for a data expansion network is proposed, and the data expansion network and the fault prediction network are jointly trained.
  • the label of a single data is not restricted, but a part of all the real labeled data is selected.
  • the sample historical data is the real historical data of the wind turbine, and the sample historical data has corresponding labels, that is, the actual fault type of the wind turbine and the corresponding real historical data when the fault type occurs are collected to obtain the sample historical data and the corresponding fault label label.
  • the selected part of the sample historical data is used as the target training batch, and each of the sample historical data in the target training batch is input into the sample expansion module respectively.
  • the batch training loss of the target training batch is obtained, and then the parameters of each network module are updated based on the batch training loss, and then part of the sample historical data is randomly removed as the new target training batch, and this is iterated multiple times until the parameters converge.
  • the expanded data is not labeled. Instead, the outputs of multiple network modules are used to calculate multiple losses to update the parameters of the sample expansion module, so that the sample expansion module can learn the essence of the feature space where data of different fault types are located.
  • obtaining the batch training loss according to the first feature, the second feature, the first prediction result, the second prediction result and the fault label corresponding to each sample historical data in the target training batch includes:
  • sample losses corresponding to each sample historical data in the training batch are summed to obtain the second batch loss
  • the batch training loss is obtained according to the first batch loss and the second batch loss.
  • multiple losses are set to constrain the optimization direction of the network module parameters.
  • the first loss can enable the feature extraction module and the prediction module to learn the intrinsic relationship between the historical data and faults of the wind turbine
  • the second loss can enable the feature extraction module and the sample expansion module to learn the feature space of the real data of the wind turbine for predicting faults, so that the generated expanded data can be consistent with the real data in terms of the features used to predict faults.
  • the module parameters are also updated by constraining the distance between the probability distribution of the fault prediction results of the real data and the probability distribution of the fault prediction results of the expanded data.
  • the first probability distribution is obtained according to the corresponding first prediction results
  • the corresponding second features are input into the prediction module
  • the second probability distribution is obtained based on the multiple second prediction results output by the prediction module. If the expanded data generated by the sample expansion module has the same feature space as the real data, and the prediction module has fully learned the intrinsic connection between the feature space and the fault category, then the first probability distribution and the second probability distribution should be consistent. Therefore, calculating the first batch distribution loss based on the difference between the first probability distribution and the second probability distribution for updating the module parameters can effectively improve the accuracy of the model.
  • the corresponding first loss and second loss can be obtained, the first loss and the second loss are summed to obtain a sample loss corresponding to the sample historical data, the sample losses corresponding to each sample historical data in the target training batch are summed to obtain the second batch loss, the first batch loss and the second batch loss are summed to obtain the batch training loss corresponding to the target training batch, and the parameters of the sample expansion module, the feature extraction module, the prediction module and the discriminator are updated based on the batch training loss.
  • each time the sample historical data is selected to form the target training batch it is selected randomly, that is, the data combination in the target training batch used to update the module parameters each time is different, so that the probability distribution is also different each time, which can have a similar effect as expanding the training data.
  • the historical data of the target wind turbine is also subjected to dimensionality reduction processing.
  • the features of the target sample historical data and the target sample expanded data are respectively extracted by the feature extraction module to obtain the first feature and the second feature, including:
  • the first dimension reduction data and the second dimension reduction data are respectively input into the feature extraction module to obtain the first feature and the second feature output by the feature extraction module.
  • the fault prediction result of the target wind turbine generator can be input into the trained neural network, and the neural network outputs the fault prediction result of the target wind turbine generator.
  • the method includes:
  • the fault prediction result of the target wind turbine is updated into the digital twin model of the target wind turbine.
  • Displaying the fault prediction result in the digital twin model of the target wind turbine may be based on the components corresponding to the fault prediction result, and displaying the fault prediction result on the three-dimensional model of the corresponding components, which is more intuitive.
  • this embodiment provides a wind turbine fault prediction method based on multi-source heterogeneous data, which predicts wind turbine faults through a neural network, and in order to address the problem of insufficient real label data of existing wind turbines, expands the real data, so that the amount of data in the training data set of the neural network is larger, thereby improving the accuracy of fault prediction of the trained neural network, eliminating the need for manual inspection of wind turbines to troubleshoot faults, and improving the fault detection efficiency of wind turbines.
  • steps in the flowchart may include multiple sub-steps or multiple stages, and these sub-steps or stages are not necessarily executed at the same time, but can be executed at different times, and the execution order of these sub-steps or stages is not necessarily to be carried out in sequence, but can be executed in turn or alternately with other steps or at least a part of the sub-steps or stages of other steps.
  • Non-volatile memory can include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM) or flash memory.
  • Volatile memory can include random access memory (RAM) or external cache memory.
  • RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous link (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
  • the present invention further provides a wind turbine fault prediction device based on multi-source heterogeneous data.
  • the wind turbine fault prediction device based on multi-source heterogeneous data includes:
  • a data acquisition module the data acquisition module is used to acquire historical data of a target wind turbine, the historical data of the target wind turbine at least includes deformation data of each blade of the target wind turbine within a preset time period and vibration data of the target wind turbine, the preset time period is a period of preset duration before the current moment, as described in the first embodiment;
  • a prediction module the prediction module is used to input the historical data of the target wind turbine into a trained neural network, and obtain a fault prediction result of the target wind turbine output by the neural network, as specifically described in the first embodiment;
  • the neural network is trained based on a training data set, and the training data set includes sample historical data and sample expansion data, the sample historical data is historical data of wind turbines actually collected, and the sample expansion data is data generated by performing sample expansion processing on the sample historical data, as specifically described in Example 1.
  • the present invention also provides a terminal, as shown in Figure 3, the terminal includes a processor 10 and a memory 20.
  • Figure 3 only shows some components of the terminal, but it should be understood that it is not required to implement all the components shown, and more or fewer components can be implemented instead.
  • the memory 20 may be an internal storage unit of the terminal, such as a hard disk or memory of the terminal. In other embodiments, the memory 20 may also be an external storage device of the terminal, such as a plug-in hard disk, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) card, a flash card (Flash Card), etc. equipped on the terminal. Further, the memory 20 may also include both an internal storage unit of the terminal and an external storage device. The memory 20 is used to store application software and various types of data installed on the terminal. The memory 20 may also be used to temporarily store data that has been output or is to be output.
  • a plug-in hard disk such as a plug-in hard disk, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) card, a flash card (Flash Card), etc. equipped on the terminal.
  • the memory 20 may also include both an internal storage unit of the terminal and an external storage device.
  • the memory 20 is used to store
  • a wind turbine fault prediction program 30 based on multi-source heterogeneous data is stored on the memory 20, and the wind turbine fault prediction program 30 based on multi-source heterogeneous data can be executed by the processor 10, thereby realizing the wind turbine fault prediction method based on multi-source heterogeneous data in the present application.
  • the processor 10 may be a central processing unit (CPU), a microprocessor or other chip, used to run the program code or process data stored in the memory 20, such as executing the wind turbine fault prediction method based on multi-source heterogeneous data.
  • CPU central processing unit
  • microprocessor or other chip
  • the processor 10 executes the wind turbine fault prediction program 30 based on multi-source heterogeneous data in the memory 20, the following steps are implemented:
  • Acquire historical data of a target wind turbine wherein the historical data of the target wind turbine at least includes deformation data of each blade of the target wind turbine within a preset time period and vibration data of the target wind turbine, wherein the preset time period is a time period of a preset duration before a current moment;
  • the neural network is trained based on a training data set, and the training data set includes sample historical data and sample expansion data, the sample historical data is historical data of wind turbines actually collected, and the sample expansion data is data generated by performing sample expansion processing on the sample historical data.
  • the present invention also provides a computer-readable storage medium, which stores one or more programs, and the one or more programs can be executed by one or more processors to implement the steps of the wind turbine fault prediction method based on multi-source heterogeneous data as described above.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Computer Hardware Design (AREA)
  • Geometry (AREA)
  • Wind Motors (AREA)

Abstract

Disclosed in the present invention is a wind turbine fault prediction method based on multi-source heterogeneous data. The method comprises: acquiring historical data of a target wind turbine, wherein the historical data of the target wind turbine at least comprises deformation data of each blade of the target wind turbine within a preset time period and vibration data of the target wind turbine, and the preset time period is a preset duration before the current moment; and inputting the historical data of the target wind turbine into a trained neural network, and acquiring a fault prediction result of the target wind turbine outputted by the neural network, wherein the neural network is trained on the basis of a training data set, the training data set comprises sample historical data and sample expansion data, the sample historical data is actually collected historical data of the wind turbine, and the sample expansion data is data generated by performing sample expansion processing on the sample historical data. The present invention can improve the fault detection efficiency of the wind turbine.

Description

一种基于多源异构数据的风力发电机故障预测方法A wind turbine fault prediction method based on multi-source heterogeneous data 技术领域Technical Field
本发明涉及新能源技术领域,特别涉及一种基于多源异构数据的风力发电机故障预测方法。The present invention relates to the field of new energy technology, and in particular to a wind turbine fault prediction method based on multi-source heterogeneous data.
背景技术Background technique
现有的风力发电机需要人工定期检查来排查风力发电机的故障,但是人工检查需要风力发电机停机进行检修,不仅不方便还会浪费清洁能源。Existing wind turbines require regular manual inspections to troubleshoot wind turbine faults, but manual inspections require the wind turbines to be shut down for maintenance, which is not only inconvenient but also wastes clean energy.
因此,现有技术还有待改进和提高。Therefore, the prior art still needs to be improved and enhanced.
技术问题technical problem
针对现有技术的上述缺陷,本发明提供一种基于多源异构数据的风力发电机故障预测方法,旨在解决现有技术中需要人工检查风力发电机进行故障检测效率低的问题。In view of the above-mentioned defects of the prior art, the present invention provides a wind turbine fault prediction method based on multi-source heterogeneous data, aiming to solve the problem of low efficiency of fault detection of wind turbines requiring manual inspection in the prior art.
技术解决方案Technical Solutions
为了解决上述技术问题,本发明所采用的技术方案如下:In order to solve the above technical problems, the technical solution adopted by the present invention is as follows:
本发明的第一方面,提供一种基于多源异构数据的风力发电机故障预测方法,所述方法包括:A first aspect of the present invention provides a wind turbine fault prediction method based on multi-source heterogeneous data, the method comprising:
获取目标风力发电机的历史数据,所述目标风力发电机的历史数据至少包括所述目标风力发电机在预设时段内的各个叶片的形变数据以及所述目标风力发电机的振动数据,所述预设时段为当前时刻前预设时长的时段;Acquire historical data of a target wind turbine, wherein the historical data of the target wind turbine at least includes deformation data of each blade of the target wind turbine within a preset time period and vibration data of the target wind turbine, wherein the preset time period is a time period of a preset duration before a current moment;
将所述目标风力发电机的历史数据输入至已训练的神经网络,获取所述神经网络输出的所述目标风力发电机的故障预测结果;Inputting historical data of the target wind turbine into a trained neural network, and obtaining a fault prediction result of the target wind turbine output by the neural network;
其中,所述神经网络是基于训练数据集训练完成的,所述训练数据集中包括样本历史数据和样本扩充数据,所述样本历史数据为真实采集的风力发电机的历史数据,所述样本扩充数据为对所述样本历史数据进行样本扩充处理生成的数据。Wherein, the neural network is trained based on a training data set, and the training data set includes sample historical data and sample expansion data, the sample historical data is historical data of wind turbines actually collected, and the sample expansion data is data generated by performing sample expansion processing on the sample historical data.
所述的基于多源异构数据的风力发电机故障预测方法,其中,所述获取目标风力发电机的历史数据,包括:The wind turbine fault prediction method based on multi-source heterogeneous data, wherein the acquiring of historical data of the target wind turbine comprises:
获取所述目标风力发电机的机头位置,根据所述机头位置和向所述目标风力发电机发出的光信号的反射信号获取所述目标风力发电机的各个叶片的形变数据。The nose position of the target wind turbine is acquired, and deformation data of each blade of the target wind turbine is acquired according to the nose position and a reflection signal of a light signal sent to the target wind turbine.
所述的基于多源异构数据的风力发电机故障预测方法,其中,所述根据所述机头位置和向所述目标风力发电机发出的光信号的反射信号获取所述目标风力发电机的各个叶片的形变数据,包括:The wind turbine fault prediction method based on multi-source heterogeneous data, wherein the step of acquiring deformation data of each blade of the target wind turbine according to the position of the wind turbine head and the reflection signal of the light signal sent to the target wind turbine, comprises:
获取所述目标风力发电机的机头基准位置以及所述机头基准位置对应的基准反射信号数据;Acquire a nose reference position of the target wind turbine and reference reflection signal data corresponding to the nose reference position;
根据所述机头位置和所述机头基准位置之间的差异、以及向所述目标风力发电机发出的光信号的反射信号与所述基准反射信号数据获取所述目标风力发电机的各个叶片的形变数据。The deformation data of each blade of the target wind turbine is acquired according to the difference between the nose position and the nose reference position, and the reflected signal of the light signal sent to the target wind turbine and the reference reflected signal data.
所述的基于多源异构数据的风力发电机故障预测方法,其中,根据所述机头位置和向所述目标风力发电机发出的光信号的反射信号获取所述目标风力发电机的各个叶片的形变数据之后,包括:The wind turbine fault prediction method based on multi-source heterogeneous data, wherein, after obtaining the deformation data of each blade of the target wind turbine according to the position of the wind turbine head and the reflection signal of the light signal sent to the target wind turbine, comprises:
根据所述目标风力发电机当前的叶片形变数据,基于所述目标风力发电机当前的叶片形变数据更新所述目标风力发电机的数字孪生模型中的三维模型;According to the current blade deformation data of the target wind turbine, updating the three-dimensional model in the digital twin model of the target wind turbine based on the current blade deformation data of the target wind turbine;
所述获取所述神经网络输出的所述目标风力发电机的故障预测结果之后,包括:After obtaining the fault prediction result of the target wind turbine generator output by the neural network, the method further comprises:
将所述目标风力发电机的故障预测结果更新至所述目标风力发电机的数字孪生模型中。The fault prediction result of the target wind turbine is updated into the digital twin model of the target wind turbine.
所述的基于多源异构数据的风力发电机故障预测方法,其中,所述神经网络包括特征提取模块和预测模块,所述神经网络的训练过程为:The wind turbine fault prediction method based on multi-source heterogeneous data, wherein the neural network includes a feature extraction module and a prediction module, and the training process of the neural network is:
在各个所述样本历史数据中选取部分所述样本历史数据组成目标训练批次,对于所述目标训练批次中的目标样本历史数据,执行如下步骤:Select a portion of the sample history data from each of the sample history data to form a target training batch, and perform the following steps for the target sample history data in the target training batch:
将所述目标样本历史数据输入至样本扩充模块,生成目标样本扩充数据;Inputting the target sample historical data into a sample expansion module to generate target sample expansion data;
通过所述特征提取模块分别提取所述目标样本历史数据和所述目标样本扩充数据的特征,得到第一特征和第二特征;The feature extraction module extracts the features of the target sample historical data and the target sample expanded data respectively to obtain a first feature and a second feature;
将所述第一特征和所述第二特征分别输入至所述预测模块,获取所述预测模块输出的第一预测结果和第二预测结果;Inputting the first feature and the second feature into the prediction module respectively, and obtaining a first prediction result and a second prediction result output by the prediction module;
根据所述目标训练批次中的每个样本历史数据对应的所述第一特征、所述第二特征、所述第一预测结果、所述第二预测结果以及所述目标样本历史数据对应的故障标注标签得到批次训练损失;Obtaining a batch training loss according to the first feature, the second feature, the first prediction result, the second prediction result, and the fault label corresponding to each sample historical data in the target training batch;
根据所述批次训练损失更新所述样本扩充模块、所述特征提取模块和所述预测模块的参数,并重新执行所述在各个所述样本历史数据中选取部分所述样本历史数据组成目标训练批次的步骤,直至参数收敛。The parameters of the sample expansion module, the feature extraction module and the prediction module are updated according to the batch training loss, and the step of selecting part of the sample historical data from each of the sample historical data to form a target training batch is re-executed until the parameters converge.
所述的基于多源异构数据的风力发电机故障预测方法,其中,所述根据所述目标训练批次中的每个样本历史数据对应的所述第一特征、所述第二特征、所述第一预测结果、所述第二预测结果以及所述目标样本历史数据对应的故障标注标签得到批次训练损失,包括:The wind turbine fault prediction method based on multi-source heterogeneous data, wherein the batch training loss is obtained according to the first feature, the second feature, the first prediction result, the second prediction result and the fault label corresponding to each sample historical data in the target training batch, including:
根据所述目标样本历史数据对应的故障标注标签和所述第一预测结果得到第一损失;Obtaining a first loss according to the fault label corresponding to the target sample historical data and the first prediction result;
将所述第一特征和所述第二特征分别输入至判别器,获取所述判别器输出的判别结果,根据所述判别结果得到第二损失;Inputting the first feature and the second feature into a discriminator respectively, obtaining a discrimination result output by the discriminator, and obtaining a second loss according to the discrimination result;
根据所述第一损失和所述第二损失得到所述目标样本历史数据对应的样本损失;Obtaining a sample loss corresponding to the target sample historical data according to the first loss and the second loss;
获取所述训练批次中每个样本历史数据对应的所述第一故障预测结果的概率分布作为第一概率分布,获取所述训练批次中每个样本历史数据对应的所述第二故障预测结果的概率分布作为第二概率分布;Obtaining the probability distribution of the first fault prediction result corresponding to each sample historical data in the training batch as a first probability distribution, and obtaining the probability distribution of the second fault prediction result corresponding to each sample historical data in the training batch as a second probability distribution;
根据所述第一概率分布和所述第二概率分布得到第一批次分损失;Obtaining a first batch loss according to the first probability distribution and the second probability distribution;
对所述训练批次中的每个样本历史数据分别对应的样本损失进行求和,得到第二批次分损失;The sample losses corresponding to each sample historical data in the training batch are summed to obtain the second batch loss;
根据所述第一批次分损失和所述第二批次分损失得到所述批次训练损失;Obtaining the batch training loss according to the first batch loss and the second batch loss;
所述根据所述批次训练损失更新所述样本扩充模块、所述特征提取模块和所述预测模块的参数,包括:The updating of the parameters of the sample expansion module, the feature extraction module and the prediction module according to the batch training loss includes:
根据所述批次训练损失更新所述样本扩充模块、所述特征提取模块、所述预测模块和所述判别器的参数。The parameters of the sample expansion module, the feature extraction module, the prediction module and the discriminator are updated according to the batch training loss.
所述的基于多源异构数据的风力发电机故障预测方法,其中,所述通过所述特征提取模块分别提取所述目标样本历史数据和所述目标样本扩充数据的特征,得到第一特征和第二特征,包括:The wind turbine fault prediction method based on multi-source heterogeneous data, wherein the features of the target sample historical data and the target sample expanded data are respectively extracted by the feature extraction module to obtain the first feature and the second feature, including:
分别对所述目标样本历史数据和所述目标样本扩充数据进行降维处理,得到第一降维数据和第二降维数据;Respectively performing dimensionality reduction processing on the target sample historical data and the target sample expanded data to obtain first dimensionality reduction data and second dimensionality reduction data;
分别将所述第一降维数据和所述第二降维数据输入至所述特征提取模块,获取所述特征提取模块输出的所述第一特征和所述第二特征。The first dimension reduction data and the second dimension reduction data are respectively input into the feature extraction module to obtain the first feature and the second feature output by the feature extraction module.
本发明的第二方面,提供一种基于多源异构数据的风力发电机故障预测装置,包括:A second aspect of the present invention provides a wind turbine fault prediction device based on multi-source heterogeneous data, comprising:
数据获取模块,所述数据获取模块用于获取目标风力发电机的历史数据,所述目标风力发电机的历史数据至少包括所述目标风力发电机在预设时段内的各个叶片的形变数据以及所述目标风力发电机的振动数据,所述预设时段为当前时刻前预设时长的时段;A data acquisition module, the data acquisition module is used to acquire historical data of a target wind turbine, the historical data of the target wind turbine at least including deformation data of each blade of the target wind turbine within a preset time period and vibration data of the target wind turbine, the preset time period being a period of preset duration before a current moment;
预测模块,所述预测模块用于将所述目标风力发电机的历史数据输入至已训练的神经网络,获取所述神经网络输出的所述目标风力发电机的故障预测结果;A prediction module, the prediction module is used to input the historical data of the target wind turbine into a trained neural network, and obtain a fault prediction result of the target wind turbine output by the neural network;
其中,所述神经网络是基于训练数据集训练完成的,所述训练数据集中包括样本历史数据和样本扩充数据,所述样本历史数据为真实采集的风力发电机的历史数据,所述样本扩充数据为对所述样本历史数据进行样本扩充处理生成的数据。Wherein, the neural network is trained based on a training data set, and the training data set includes sample historical data and sample expansion data, the sample historical data is historical data of wind turbines actually collected, and the sample expansion data is data generated by performing sample expansion processing on the sample historical data.
本发明的第三方面,提供一种终端,所述终端包括处理器、与处理器通信连接的计算机可读存储介质,所述计算机可读存储介质适于存储多条指令,所述处理器适于调用所述计算机可读存储介质中的指令,以执行实现上述任一项所述的基于多源异构数据的风力发电机故障预测方法的步骤。According to a third aspect of the present invention, a terminal is provided, comprising a processor and a computer-readable storage medium communicatively connected to the processor, wherein the computer-readable storage medium is suitable for storing a plurality of instructions, and the processor is suitable for calling the instructions in the computer-readable storage medium to execute the steps of implementing any one of the above-mentioned methods for predicting wind turbine faults based on multi-source heterogeneous data.
本发明的第四方面,提供一种计算机可读存储介质,所述计算机可读存储介质存储有一个或者多个程序,所述一个或者多个程序可被一个或者多个处理器执行,以实现上述任一项所述的基于多源异构数据的风力发电机故障预测方法的步骤。A fourth aspect of the present invention provides a computer-readable storage medium storing one or more programs, which can be executed by one or more processors to implement the steps of any of the above-mentioned methods for predicting wind turbine faults based on multi-source heterogeneous data.
有益效果Beneficial Effects
与现有技术相比,本发明提供了一种基于多源异构数据的风力发电机故障预测方法,通过神经网络预测风力风电机的故障,并且针对现有的风力发电机的真实标签数据不足的问题,对真实数据进行扩充处理,使得神经网络的训练数据集中的数据量更大,提升训练得到的神经网络对故障预测的准确性,不需要采用人工对风力发电机进行检查来排查故障,提升了风力发电机的故障检测效率。Compared with the prior art, the present invention provides a wind turbine fault prediction method based on multi-source heterogeneous data, which predicts wind turbine faults through a neural network. In order to address the problem of insufficient real label data of existing wind turbines, the real data is expanded and processed, so that the amount of data in the training data set of the neural network is larger, thereby improving the accuracy of fault prediction of the trained neural network. There is no need to manually inspect the wind turbine to troubleshoot the fault, thereby improving the fault detection efficiency of the wind turbine.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1为本发明提供的基于多源异构数据的风力发电机故障预测方法的实施例的流程图;FIG1 is a flow chart of an embodiment of a method for predicting wind turbine faults based on multi-source heterogeneous data provided by the present invention;
图2为本发明提供的基于多源异构数据的风力发电机故障预测装置的实施例的结构原理图;FIG2 is a structural principle diagram of an embodiment of a wind turbine fault prediction device based on multi-source heterogeneous data provided by the present invention;
图3为本发明提供的终端的实施例的原理示意图。FIG3 is a schematic diagram showing the principles of an embodiment of a terminal provided by the present invention.
本发明的实施方式Embodiments of the present invention
为使本发明的目的、技术方案及效果更加清楚、明确,以下参照附图并举实施例对本发明进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本发明,并不用于限定本发明。In order to make the purpose, technical solution and effect of the present invention clearer and more specific, the present invention is further described in detail with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are only used to explain the present invention and are not used to limit the present invention.
本发明提供的基于多源异构数据的风力发电机故障预测方法,可以应用于具有计算能力的终端中,终端可以执行本发明提供的基于多源异构数据的风力发电机故障预测方法进行电网调峰调度,终端可以但不限于是各种计算机、移动终端、智能家电、可穿戴式设备等。The wind turbine fault prediction method based on multi-source heterogeneous data provided by the present invention can be applied to a terminal with computing capabilities. The terminal can execute the wind turbine fault prediction method based on multi-source heterogeneous data provided by the present invention to perform power grid peak-shaving scheduling. The terminal can be but is not limited to various computers, mobile terminals, smart home appliances, wearable devices, etc.
实施例一Embodiment 1
如图1所示,所述基于多源异构数据的风力发电机故障预测方法的一个实施例中,包括步骤:As shown in FIG1 , in one embodiment of the wind turbine fault prediction method based on multi-source heterogeneous data, the steps include:
S100、获取目标风力发电机的历史数据,所述目标风力发电机的历史数据至少包括所述目标风力发电机在预设时段内的各个叶片的形变数据以及所述目标风力发电机的振动数据,所述预设时段为当前时刻前预设时长的时段。S100, acquiring historical data of a target wind turbine, wherein the historical data of the target wind turbine at least includes deformation data of each blade of the target wind turbine within a preset time period and vibration data of the target wind turbine, wherein the preset time period is a period of preset duration before a current moment.
所述预设时长可以为一周、一个月等,所述预设时长可以通过实验确定,即采用不同的时长经过本实施例提供的方法进行实验,获取故障预测结果更准确的作为所述预设时长。The preset time length may be one week, one month, etc., and the preset time length may be determined through experiments, that is, different time lengths are used to conduct experiments using the method provided in this embodiment to obtain a more accurate fault prediction result as the preset time length.
所述目标风力发电机的历史数据中的振动数据,可以是通过安装在所述目标风力发电机上的传感器获取。所述目标风力发电机的历史数据中的叶片形变数据,可以是通过采用光信号探测获取。具体地,所述获取目标风力发电机的历史数据,包括:The vibration data in the historical data of the target wind turbine may be obtained by a sensor installed on the target wind turbine. The blade deformation data in the historical data of the target wind turbine may be obtained by optical signal detection. Specifically, the historical data of the target wind turbine may be obtained by:
获取所述目标风力发电机的机头位置,根据所述机头位置和向所述目标风力发电机发出的光信号的反射信号获取所述目标风力发电机的各个叶片的形变数据。The nose position of the target wind turbine is acquired, and deformation data of each blade of the target wind turbine is acquired according to the nose position and a reflection signal of a light signal sent to the target wind turbine.
在本实施例中,可以在所述目标风力发电机的周围设置光信号发生器和光信号探测器,所述光信号发生器和所述光信号探测器的数量可以为多个以使得检测范围覆盖所述目标风力发电机的整个叶片,所述光信号发生器发出光信号到达所述目标风力发电机被反射后被所述光信号探测器接收,所述光信号探测器接收到反射信号并进行分析来获取叶片的形变数据。具体地,可以是预先设置在风力发电机的叶片无形变时的反射信号作为参考,根据实际接收到的反射信号与无形变时的反射信号之间的差异来确定形变数据。而在风力发电机运行的过程中,随着风向的改变,风力发电机的机头会相应转动以使得叶片保持正风,因此,在本实施例中,在获取叶片的形变数据时,还需要结合风力发电机的实时机头位置。In this embodiment, an optical signal generator and an optical signal detector may be arranged around the target wind turbine. The number of the optical signal generator and the optical signal detector may be multiple so that the detection range covers the entire blade of the target wind turbine. The optical signal generator sends an optical signal to the target wind turbine and is reflected and received by the optical signal detector. The optical signal detector receives the reflected signal and analyzes it to obtain the deformation data of the blade. Specifically, the reflected signal pre-set when the blade of the wind turbine is not deformed can be used as a reference, and the deformation data is determined according to the difference between the actually received reflected signal and the reflected signal when it is not deformed. During the operation of the wind turbine, as the wind direction changes, the head of the wind turbine will rotate accordingly to keep the blade in the right wind. Therefore, in this embodiment, when obtaining the deformation data of the blade, it is also necessary to combine the real-time head position of the wind turbine.
所述根据所述机头位置和向所述目标风力发电机发出的光信号的反射信号获取所述目标风力发电机的各个叶片的形变数据,包括:The step of acquiring deformation data of each blade of the target wind turbine according to the position of the wind turbine head and the reflected signal of the light signal sent to the target wind turbine includes:
获取所述目标风力发电机的机头基准位置以及所述机头基准位置对应的基准反射信号数据;Acquire a nose reference position of the target wind turbine and reference reflection signal data corresponding to the nose reference position;
根据所述机头位置和所述机头基准位置之间的差异、以及向所述目标风力发电机发出的光信号的反射信号与所述标准反射信号数据获取所述目标风力发电机的各个叶片的形变数据。The deformation data of each blade of the target wind turbine is acquired according to the difference between the nose position and the nose reference position, and the reflection signal of the light signal sent to the target wind turbine and the standard reflection signal data.
具体地,所述机头基准位置是指预先设置的所述目标风力发电机的机头相对于所述光信号发生器和所述光信号探测器的位置,所述光信号发生器和所述光信号探测器的位置保持不变,而所述目标风力发电机的机头位置会随着风向的不同而变化。预先获取所述目标风力发电机的机头在所述机头基准位置时所述光信号发生器发出的光信号被所述目标风力发电机的叶片反射后由所述光信号探测器接收到的反射信号数据作为所述基准反射信号数据,根据所述机头位置和所述机头基准位置之间的差异以及所述基准发射信号,可以得到所述目标风力发电机的叶片无形变情况下在所述机头位置时向所述目标风力发电机的叶片发送的光信号的反射信号,与实际接收到的反射信号进行比较,就可以确定所述目标风力发电机的叶片的形变数据。Specifically, the nose reference position refers to the pre-set position of the nose of the target wind turbine relative to the optical signal generator and the optical signal detector. The positions of the optical signal generator and the optical signal detector remain unchanged, while the nose position of the target wind turbine will change with different wind directions. The reflected signal data received by the optical signal detector after the optical signal emitted by the optical signal generator is reflected by the blades of the target wind turbine when the nose of the target wind turbine is at the nose reference position is obtained in advance as the reference reflected signal data. According to the difference between the nose position and the nose reference position and the reference transmitted signal, the reflected signal of the optical signal sent to the blades of the target wind turbine when the blades of the target wind turbine are not deformed can be obtained. By comparing with the actually received reflected signal, the deformation data of the blades of the target wind turbine can be determined.
根据所述机头位置和向所述目标风力发电机发出的光信号的反射信号获取所述目标风力发电机的各个叶片的形变数据之后,包括:After acquiring deformation data of each blade of the target wind turbine according to the position of the wind turbine head and the reflected signal of the light signal sent to the target wind turbine, the method includes:
根据所述目标风力发电机当前的叶片形变数据,基于所述目标风力发电机当前的叶片形变数据更新所述目标风力发电机的数字孪生模型中的三维模型。According to the current blade deformation data of the target wind turbine, the three-dimensional model in the digital twin model of the target wind turbine is updated based on the current blade deformation data of the target wind turbine.
为了方便风力发电机的健康管理,在本实施例中,设置所述目标风力发电机的数字孪生模型,所述数字孪生模型中包括所述目标风力发电机的三维模型,在获取到所述目标风力发电机当前的叶片形变数据后,根据叶片形变数据更新所述目标风力发电机的三维模型,使得维护人员可以通过三维模型直观地获取到叶片的形变情况。In order to facilitate the health management of wind turbines, in this embodiment, a digital twin model of the target wind turbine is set up, and the digital twin model includes a three-dimensional model of the target wind turbine. After obtaining the current blade deformation data of the target wind turbine, the three-dimensional model of the target wind turbine is updated according to the blade deformation data, so that maintenance personnel can intuitively obtain the deformation condition of the blades through the three-dimensional model.
请再次参阅图1,本实施例提供的方法,还包括步骤:Please refer to FIG. 1 again. The method provided in this embodiment further includes the steps of:
S200、将所述目标风力发电机的历史数据输入至已训练的神经网络,获取所述神经网络输出的所述目标风力发电机的故障预测结果。S200: Inputting historical data of the target wind turbine into a trained neural network, and obtaining a fault prediction result of the target wind turbine output by the neural network.
现有技术中,还没有成熟的关于风力发电机的数据库,风力发电机的真实数据量较少,而神经网络模型的训练需要大量的数据,采用少量的数据训练神经网络模型会导致模型的训练结果不理想,影响故障预测的准确性。针对这一问题,本实施例提供的方法,采用样本扩充的方式来扩充数据集,具体地,本实施例中,所述神经网络是基于训练数据集训练完成的,所述训练数据集中包括样本历史数据和样本扩充数据,所述样本历史数据为真实采集的风力发电机的历史数据,所述样本扩充数据为对所述样本历史数据进行样本扩充处理生成的数据。In the prior art, there is no mature database on wind turbines. The amount of real data on wind turbines is relatively small, while the training of neural network models requires a large amount of data. Using a small amount of data to train the neural network model will lead to unsatisfactory training results of the model, affecting the accuracy of fault prediction. To address this problem, the method provided in this embodiment uses sample expansion to expand the data set. Specifically, in this embodiment, the neural network is trained based on the training data set, and the training data set includes sample historical data and sample expansion data. The sample historical data is the historical data of the wind turbine that is actually collected, and the sample expansion data is the data generated by performing sample expansion processing on the sample historical data.
所述神经网络包括特征提取模块和预测模块,所述神经网络的训练过程为:The neural network includes a feature extraction module and a prediction module, and the training process of the neural network is:
S001、在各个所述样本历史数据中选取部分所述样本历史数据组成目标训练批次,对于所述目标训练批次中的目标样本历史数据,执行如下步骤:S001. Selecting a portion of the sample history data from each of the sample history data to form a target training batch, and executing the following steps for the target sample history data in the target training batch:
S002、将所述目标样本历史数据输入至样本扩充模块,生成目标样本扩充数据;S002, inputting the target sample historical data into a sample expansion module to generate target sample expansion data;
S003、通过所述特征提取模块分别提取所述目标样本历史数据和所述目标样本扩充数据的特征,得到第一特征和第二特征;S003, extracting features of the target sample historical data and the target sample expanded data respectively through the feature extraction module to obtain a first feature and a second feature;
S004、将所述第一特征和所述第二特征分别输入至所述预测模块,获取所述预测模块输出的第一预测结果和第二预测结果;S004, inputting the first feature and the second feature into the prediction module respectively, and obtaining a first prediction result and a second prediction result output by the prediction module;
S005、根据所述目标训练批次中的每个样本历史数据对应的所述第一特征、所述第二特征、所述第一预测结果、所述第二预测结果以及所述目标样本历史数据对应的故障标注标签得到批次训练损失;S005, obtaining a batch training loss according to the first feature, the second feature, the first prediction result, the second prediction result, and the fault label corresponding to each sample historical data in the target training batch;
S006、根据所述批次训练损失更新所述样本扩充模块、所述特征提取模块和所述预测模块的参数,并重新执行所述在各个所述样本历史数据中选取部分所述样本历史数据组成目标训练批次的步骤,直至参数收敛。S006. Update the parameters of the sample expansion module, the feature extraction module and the prediction module according to the batch training loss, and re-execute the step of selecting part of the sample historical data from each of the sample historical data to form a target training batch until the parameters converge.
在现有的数据扩充方案中,大部分都是采用对抗网络根据标签分别进行样本扩充的,并且采用对抗网络单独训练,例如具有A标签(以本申请中的故障预测任务为例,则A标签为故障类别标签)的数据,通过对抗网络的训练,生成也具有A标签的数据,以实现数据扩充。但是,对于数据量较少的情况下,例如故障类别标签有多种,而所有的风力发电机的真实带标签数据量就不多,拆分到各个故障类别数据量就更少了,这样对抗网络的训练过程也并不理想,用训练完成后的对抗网络生成的扩充数据去训练的故障预测网络最后预测准确性也不高。在本实施例中,提出了一种新的数据扩充网络的训练方式,对数据扩充网络与故障预测网络进行联合训练。In the existing data expansion schemes, most of them use adversarial networks to expand samples according to labels, and use adversarial networks to train separately. For example, data with A label (taking the fault prediction task in this application as an example, A label is a fault category label) is generated through adversarial network training to achieve data expansion. However, in the case of a small amount of data, for example, there are multiple fault category labels, and the actual amount of labeled data of all wind turbines is not much, and the amount of data split into each fault category is even less. In this way, the training process of the adversarial network is not ideal, and the fault prediction network trained with the expanded data generated by the adversarial network after training is not high in the final prediction accuracy. In this embodiment, a new training method for a data expansion network is proposed, and the data expansion network and the fault prediction network are jointly trained.
具体地,在本实施例中,在进行数据扩充时,并不针对单个的数据限制其标签,而是在所有的真实带标签数据中选取部分。所述样本历史数据为风力发电机的真实历史数据,所述样本历史数据都有对应的标签,即采集风力发电机真实发生的故障类型,以及发生该故障类型时对应的真实历史数据,得到所述样本历史数据和对应的故障标注标签。将选取的部分所述样本历史数据作为目标训练批次,对于所述目标训练批次中的每个所述样本历史数据,都分别输入至样本扩充模块,通过上述的步骤S002-S005,得到所述目标训练批次的批次训练损失,再基于所述批次训练损失更新各个网络模块的参数,再重新随机寻去部分所述样本历史数据作为新的所述目标训练批次,这样多次迭代,直到参数收敛。在这个过程中,并不对扩充得到的数据标注标签,而是通过多个网络模块的输出来计算多个方面的损失更新所述样本扩充模块的参数,使得所述样本扩充模块能够学习到不同的故障类型的数据所在的特征空间的本质,相对于现有的对抗网络的损失的单一性,多种类型损失对模块参数更新的约束,可以弥补少数据量的缺陷,防止网络模型的参数优化结果陷入局部最优导致的准确性降低。Specifically, in this embodiment, when performing data expansion, the label of a single data is not restricted, but a part of all the real labeled data is selected. The sample historical data is the real historical data of the wind turbine, and the sample historical data has corresponding labels, that is, the actual fault type of the wind turbine and the corresponding real historical data when the fault type occurs are collected to obtain the sample historical data and the corresponding fault label label. The selected part of the sample historical data is used as the target training batch, and each of the sample historical data in the target training batch is input into the sample expansion module respectively. Through the above steps S002-S005, the batch training loss of the target training batch is obtained, and then the parameters of each network module are updated based on the batch training loss, and then part of the sample historical data is randomly removed as the new target training batch, and this is iterated multiple times until the parameters converge. In this process, the expanded data is not labeled. Instead, the outputs of multiple network modules are used to calculate multiple losses to update the parameters of the sample expansion module, so that the sample expansion module can learn the essence of the feature space where data of different fault types are located. Compared with the singleness of the loss of the existing adversarial network, the constraints of multiple types of losses on the update of module parameters can make up for the defect of small data volume and prevent the parameter optimization results of the network model from falling into the local optimum, resulting in reduced accuracy.
具体地,所述根据所述目标训练批次中的每个样本历史数据对应的所述第一特征、所述第二特征、所述第一预测结果、所述第二预测结果以及所述目标样本历史数据对应的故障标注标签得到批次训练损失,包括:Specifically, obtaining the batch training loss according to the first feature, the second feature, the first prediction result, the second prediction result and the fault label corresponding to each sample historical data in the target training batch includes:
根据所述目标样本历史数据对应的故障标注标签和所述第一预测结果得到第一损失;Obtaining a first loss according to the fault label corresponding to the target sample historical data and the first prediction result;
将所述第一特征和所述第二特征分别输入至判别器,获取所述判别器输出的判别结果,根据所述判别结果得到第二损失;Inputting the first feature and the second feature into a discriminator respectively, obtaining a discrimination result output by the discriminator, and obtaining a second loss according to the discrimination result;
根据所述第一损失和所述第二损失得到所述目标样本历史数据对应的样本损失;Obtaining a sample loss corresponding to the target sample historical data according to the first loss and the second loss;
获取所述训练批次中每个样本历史数据对应的所述第一故障预测结果的概率分布作为第一概率分布,获取所述训练批次中每个样本历史数据对应的所述第二故障预测结果的概率分布作为第二概率分布;Obtaining the probability distribution of the first fault prediction result corresponding to each sample historical data in the training batch as a first probability distribution, and obtaining the probability distribution of the second fault prediction result corresponding to each sample historical data in the training batch as a second probability distribution;
根据所述第一概率分布和所述第二概率分布得到第一批次分损失;Obtaining a first batch loss according to the first probability distribution and the second probability distribution;
对所述训练批次中的每个样本历史数据分别对应的样本损失进行求和,得到第二批次分损失;The sample losses corresponding to each sample historical data in the training batch are summed to obtain the second batch loss;
根据所述第一批次分损失和所述第二批次分损失得到所述批次训练损失。The batch training loss is obtained according to the first batch loss and the second batch loss.
在本实施例中,设置多种损失来对网络模块参数的优化方向进行约束。所述第一损失可以使得所述特征提取模块和所述预测模块学习到风力发电机的历史数据和故障之间的内在关联,所述第二损失可以使得所述特征提取模块和所述样本扩充模块学习到风力发电机的真实数据在用于预测故障上的特征的特征空间,使得生成的扩充数据能够在用于预测故障的特征上与真实数据一致。而由于样本扩充数据不带有标签,为了将生成的样本扩充数据用于训练所述预测模块,进一步地提升所述预测模块的故障预测能力,在本实施例中,还通过约束真实数据的故障预测结果的概率分布和扩充数据的故障预测结果的概率分布之间的距离来更新模块参数。In this embodiment, multiple losses are set to constrain the optimization direction of the network module parameters. The first loss can enable the feature extraction module and the prediction module to learn the intrinsic relationship between the historical data and faults of the wind turbine, and the second loss can enable the feature extraction module and the sample expansion module to learn the feature space of the real data of the wind turbine for predicting faults, so that the generated expanded data can be consistent with the real data in terms of the features used to predict faults. Since the sample expansion data does not carry labels, in order to use the generated sample expansion data to train the prediction module and further improve the fault prediction ability of the prediction module, in this embodiment, the module parameters are also updated by constraining the distance between the probability distribution of the fault prediction results of the real data and the probability distribution of the fault prediction results of the expanded data.
具体地,对于在训练过程中每个所述训练批次中的所有所述样本历史数据,根据其对应的各个所述第一预测结果获取所述第一概率分布,而对于每个所述训练批次中生成的所有的所述扩充样本数据,将其对应的所述第二特征输入至所述预测模块,基于所述预测模块输出的多个所述第二预测结果获取所述第二概率分布,如果所述样本扩充模块生成的扩充数据具有和真实数据同样的特征空间,且所述预测模块也充分学习到了该特征空间和故障类别的内在联系,那么所述第一概率分布和所述第二概率分布应该是一致的,因此,基于所述第一概率分布和所述第二概率分布的差异计算所述第一批次分损失用于更新模块参数,可以有效提升模型的精度。Specifically, for all the sample historical data in each of the training batches during the training process, the first probability distribution is obtained according to the corresponding first prediction results, and for all the expanded sample data generated in each of the training batches, the corresponding second features are input into the prediction module, and the second probability distribution is obtained based on the multiple second prediction results output by the prediction module. If the expanded data generated by the sample expansion module has the same feature space as the real data, and the prediction module has fully learned the intrinsic connection between the feature space and the fault category, then the first probability distribution and the second probability distribution should be consistent. Therefore, calculating the first batch distribution loss based on the difference between the first probability distribution and the second probability distribution for updating the module parameters can effectively improve the accuracy of the model.
从前面的说明可以看出,对于所述目标训练批次中的每个所述样本历史数据,都可以得到对应的所述第一损失和所述第二损失,对所述第一损失和所述第二损失进行求和,得到一个所述样本历史数据对应的样本损失,对所述目标训练批次中每个所述样本历史数据对应所述样本损失求和,得到第二批次分损失,对所述第一批次分损失和所述第二批次分损失进行求和得到所述目标训练批次对应的所述批次训练损失,基于所述批次训练损失更新所述样本扩充模块、所述特征提取模块、所述预测模块和所述判别器的参数。From the foregoing description, it can be seen that for each sample historical data in the target training batch, the corresponding first loss and second loss can be obtained, the first loss and the second loss are summed to obtain a sample loss corresponding to the sample historical data, the sample losses corresponding to each sample historical data in the target training batch are summed to obtain the second batch loss, the first batch loss and the second batch loss are summed to obtain the batch training loss corresponding to the target training batch, and the parameters of the sample expansion module, the feature extraction module, the prediction module and the discriminator are updated based on the batch training loss.
进一步地,在本实施例中,每次选取所述样本历史数据组成所述目标训练批次时,是随机选取,也就是说,每次更新模块参数时所用的所述目标训练批次中的数据组合都是不一样的,这样每次的概率分布也不一样,可以起到类似扩充训练数据的效果。Furthermore, in this embodiment, each time the sample historical data is selected to form the target training batch, it is selected randomly, that is, the data combination in the target training batch used to update the module parameters each time is different, so that the probability distribution is also different each time, which can have a similar effect as expanding the training data.
进一步地,由于所述历史数据的数据量较大,维数较大,为了降低所述神经网络的计算量,在本实施例中,在将所述目标风力发电机的历史数据输入至已训练的神经网络之前,还对所述目标风力发电机的历史数据进行降维处理,同样地,所述通过所述特征提取模块分别提取所述目标样本历史数据和所述目标样本扩充数据的特征,得到第一特征和第二特征,包括:Furthermore, since the amount of data of the historical data is large and the dimension is large, in order to reduce the amount of calculation of the neural network, in this embodiment, before the historical data of the target wind turbine is input into the trained neural network, the historical data of the target wind turbine is also subjected to dimensionality reduction processing. Similarly, the features of the target sample historical data and the target sample expanded data are respectively extracted by the feature extraction module to obtain the first feature and the second feature, including:
分别对所述目标样本历史数据和所述目标样本扩充数据进行降维处理,得到第一降维数据和第二降维数据;Respectively performing dimensionality reduction processing on the target sample historical data and the target sample expanded data to obtain first dimensionality reduction data and second dimensionality reduction data;
分别将所述第一降维数据和所述第二降维数据输入至所述特征提取模块,获取所述特征提取模块输出的所述第一特征和所述第二特征。The first dimension reduction data and the second dimension reduction data are respectively input into the feature extraction module to obtain the first feature and the second feature output by the feature extraction module.
在所述神经网络训练完成后,可以将所述目标风力发电机的故障预测结果输入至训练完成后的所述神经网络中,所述神经网络输出所述目标风力发电机的故障预测结果,为了进一步地方便维护人员获取所述故障预测结果,所述获取所述神经网络输出的所述目标风力发电机的故障预测结果之后,包括:After the neural network training is completed, the fault prediction result of the target wind turbine generator can be input into the trained neural network, and the neural network outputs the fault prediction result of the target wind turbine generator. In order to further facilitate maintenance personnel to obtain the fault prediction result, after obtaining the fault prediction result of the target wind turbine generator output by the neural network, the method includes:
将所述目标风力发电机的故障预测结果更新至所述目标风力发电机的数字孪生模型中。The fault prediction result of the target wind turbine is updated into the digital twin model of the target wind turbine.
在所述目标风力发电机的数字孪生模型中显示所述故障预测结果,可以是基于所述故障预测结果对应的零部件,将所述故障预测结果显示在对应的零部件的三维模型上,更加直观。Displaying the fault prediction result in the digital twin model of the target wind turbine may be based on the components corresponding to the fault prediction result, and displaying the fault prediction result on the three-dimensional model of the corresponding components, which is more intuitive.
综上所述,本实施例提供一种基于多源异构数据的风力发电机故障预测方法,通过神经网络预测风力风电机的故障,并且针对现有的风力发电机的真实标签数据不足的问题,对真实数据进行扩充处理,使得神经网络的训练数据集中的数据量更大,提升训练得到的神经网络对故障预测的准确性,不需要采用人工对风力发电机进行检查来排查故障,提升了风力发电机的故障检测效率。In summary, this embodiment provides a wind turbine fault prediction method based on multi-source heterogeneous data, which predicts wind turbine faults through a neural network, and in order to address the problem of insufficient real label data of existing wind turbines, expands the real data, so that the amount of data in the training data set of the neural network is larger, thereby improving the accuracy of fault prediction of the trained neural network, eliminating the need for manual inspection of wind turbines to troubleshoot faults, and improving the fault detection efficiency of wind turbines.
应该理解的是,虽然本发明说明书附图中给出的的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,流程图中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些子步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that, although the steps in the flowcharts given in the accompanying drawings of the present specification are displayed in sequence according to the indications of the arrows, these steps are not necessarily executed in sequence in the order indicated by the arrows. Unless there is a clear description in this article, there is no strict order restriction on the execution of these steps, and these steps can be executed in other orders. Moreover, at least a part of the steps in the flowchart may include multiple sub-steps or multiple stages, and these sub-steps or stages are not necessarily executed at the same time, but can be executed at different times, and the execution order of these sub-steps or stages is not necessarily to be carried out in sequence, but can be executed in turn or alternately with other steps or at least a part of the sub-steps or stages of other steps.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一非易失性计算机可读取计算机可读存储介质中,该计算机程序在执行时,可包括如上述各方法的实施例的流程。其中,本发明所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink) DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。Those skilled in the art can understand that all or part of the processes in the above-mentioned embodiments can be completed by instructing the relevant hardware through a computer program, and the computer program can be stored in a non-volatile computer-readable computer-readable storage medium. When the computer program is executed, it can include the processes of the embodiments of the above-mentioned methods. Among them, any reference to memory, storage, database or other media used in the embodiments provided by the present invention can include non-volatile and/or volatile memory. Non-volatile memory can include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM) or flash memory. Volatile memory can include random access memory (RAM) or external cache memory. As an illustration and not limitation, RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous link (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
实施例二Embodiment 2
基于上述实施例,本发明还相应提供了一种基于多源异构数据的风力发电机故障预测装置,如图2所示,所述基于多源异构数据的风力发电机故障预测装置包括:Based on the above embodiments, the present invention further provides a wind turbine fault prediction device based on multi-source heterogeneous data. As shown in FIG2 , the wind turbine fault prediction device based on multi-source heterogeneous data includes:
数据获取模块,所述数据获取模块用于获取目标风力发电机的历史数据,所述目标风力发电机的历史数据至少包括所述目标风力发电机在预设时段内的各个叶片的形变数据以及所述目标风力发电机的振动数据,所述预设时段为当前时刻前预设时长的时段,具体如实施例一中所述;A data acquisition module, the data acquisition module is used to acquire historical data of a target wind turbine, the historical data of the target wind turbine at least includes deformation data of each blade of the target wind turbine within a preset time period and vibration data of the target wind turbine, the preset time period is a period of preset duration before the current moment, as described in the first embodiment;
预测模块,所述预测模块用于将所述目标风力发电机的历史数据输入至已训练的神经网络,获取所述神经网络输出的所述目标风力发电机的故障预测结果,具体如实施例一中所述;A prediction module, the prediction module is used to input the historical data of the target wind turbine into a trained neural network, and obtain a fault prediction result of the target wind turbine output by the neural network, as specifically described in the first embodiment;
其中,所述神经网络是基于训练数据集训练完成的,所述训练数据集中包括样本历史数据和样本扩充数据,所述样本历史数据为真实采集的风力发电机的历史数据,所述样本扩充数据为对所述样本历史数据进行样本扩充处理生成的数据,具体如实施例一中所述。Wherein, the neural network is trained based on a training data set, and the training data set includes sample historical data and sample expansion data, the sample historical data is historical data of wind turbines actually collected, and the sample expansion data is data generated by performing sample expansion processing on the sample historical data, as specifically described in Example 1.
实施例三Embodiment 3
基于上述实施例,本发明还相应提供了一种终端,如图3所示,所述终端包括处理器10以及存储器20。图3仅示出了终端的部分组件,但是应理解的是,并不要求实施所有示出的组件,可以替代的实施更多或者更少的组件。Based on the above embodiments, the present invention also provides a terminal, as shown in Figure 3, the terminal includes a processor 10 and a memory 20. Figure 3 only shows some components of the terminal, but it should be understood that it is not required to implement all the components shown, and more or fewer components can be implemented instead.
所述存储器20在一些实施例中可以是所述终端的内部存储单元,例如终端的硬盘或内存。所述存储器20在另一些实施例中也可以是所述终端的外部存储设备,例如所述终端上配备的插接式硬盘,智能存储卡(Smart Media Card, SMC),安全数字(Secure Digital, SD)卡,闪存卡(Flash Card)等。进一步地,所述存储器20还可以既包括所述终端的内部存储单元也包括外部存储设备。所述存储器20用于存储安装于所述终端的应用软件及各类数据。所述存储器20还可以用于暂时地存储已经输出或者将要输出的数据。在一实施例中,存储器20上存储有基于多源异构数据的风力发电机故障预测程序30,该基于多源异构数据的风力发电机故障预测程序30可被处理器10所执行,从而实现本申请中基于多源异构数据的风力发电机故障预测方法。In some embodiments, the memory 20 may be an internal storage unit of the terminal, such as a hard disk or memory of the terminal. In other embodiments, the memory 20 may also be an external storage device of the terminal, such as a plug-in hard disk, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) card, a flash card (Flash Card), etc. equipped on the terminal. Further, the memory 20 may also include both an internal storage unit of the terminal and an external storage device. The memory 20 is used to store application software and various types of data installed on the terminal. The memory 20 may also be used to temporarily store data that has been output or is to be output. In one embodiment, a wind turbine fault prediction program 30 based on multi-source heterogeneous data is stored on the memory 20, and the wind turbine fault prediction program 30 based on multi-source heterogeneous data can be executed by the processor 10, thereby realizing the wind turbine fault prediction method based on multi-source heterogeneous data in the present application.
所述处理器10在一些实施例中可以是一中央处理器(Central Processing Unit, CPU),微处理器或其他芯片,用于运行所述存储器20中存储的程序代码或处理数据,例如执行所述基于多源异构数据的风力发电机故障预测方法等。In some embodiments, the processor 10 may be a central processing unit (CPU), a microprocessor or other chip, used to run the program code or process data stored in the memory 20, such as executing the wind turbine fault prediction method based on multi-source heterogeneous data.
在一实施例中,当处理器10执行所述存储器20中基于多源异构数据的风力发电机故障预测程序30时实现以下步骤:In one embodiment, when the processor 10 executes the wind turbine fault prediction program 30 based on multi-source heterogeneous data in the memory 20, the following steps are implemented:
获取目标风力发电机的历史数据,所述目标风力发电机的历史数据至少包括所述目标风力发电机在预设时段内的各个叶片的形变数据以及所述目标风力发电机的振动数据,所述预设时段为当前时刻前预设时长的时段;Acquire historical data of a target wind turbine, wherein the historical data of the target wind turbine at least includes deformation data of each blade of the target wind turbine within a preset time period and vibration data of the target wind turbine, wherein the preset time period is a time period of a preset duration before a current moment;
将所述目标风力发电机的历史数据输入至已训练的神经网络,获取所述神经网络输出的所述目标风力发电机的故障预测结果;Inputting historical data of the target wind turbine into a trained neural network, and obtaining a fault prediction result of the target wind turbine output by the neural network;
其中,所述神经网络是基于训练数据集训练完成的,所述训练数据集中包括样本历史数据和样本扩充数据,所述样本历史数据为真实采集的风力发电机的历史数据,所述样本扩充数据为对所述样本历史数据进行样本扩充处理生成的数据。Wherein, the neural network is trained based on a training data set, and the training data set includes sample historical data and sample expansion data, the sample historical data is historical data of wind turbines actually collected, and the sample expansion data is data generated by performing sample expansion processing on the sample historical data.
实施例四Embodiment 4
本发明还提供一种计算机可读存储介质,其中,存储有一个或者多个程序,所述一个或者多个程序可被一个或者多个处理器执行,以实现如上所述的基于多源异构数据的风力发电机故障预测方法的步骤。The present invention also provides a computer-readable storage medium, which stores one or more programs, and the one or more programs can be executed by one or more processors to implement the steps of the wind turbine fault prediction method based on multi-source heterogeneous data as described above.
最后应说明的是:以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention, rather than to limit it. Although the present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art should understand that they can still modify the technical solutions described in the aforementioned embodiments, or make equivalent replacements for some of the technical features therein. However, these modifications or replacements do not deviate the essence of the corresponding technical solutions from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims (10)

  1. 一种基于多源异构数据的风力发电机故障预测方法,其特征在于,所述方法包括:A wind turbine fault prediction method based on multi-source heterogeneous data, characterized in that the method comprises:
    获取目标风力发电机的历史数据,所述目标风力发电机的历史数据至少包括所述目标风力发电机在预设时段内的各个叶片的形变数据以及所述目标风力发电机的振动数据,所述预设时段为当前时刻前预设时长的时段;Acquire historical data of a target wind turbine, wherein the historical data of the target wind turbine at least includes deformation data of each blade of the target wind turbine within a preset time period and vibration data of the target wind turbine, wherein the preset time period is a time period of a preset duration before a current moment;
    将所述目标风力发电机的历史数据输入至已训练的神经网络,获取所述神经网络输出的所述目标风力发电机的故障预测结果;Inputting historical data of the target wind turbine into a trained neural network, and obtaining a fault prediction result of the target wind turbine output by the neural network;
    其中,所述神经网络是基于训练数据集训练完成的,所述训练数据集中包括样本历史数据和样本扩充数据,所述样本历史数据为真实采集的风力发电机的历史数据,所述样本扩充数据为对所述样本历史数据进行样本扩充处理生成的数据。Wherein, the neural network is trained based on a training data set, and the training data set includes sample historical data and sample expansion data, the sample historical data is historical data of wind turbines actually collected, and the sample expansion data is data generated by performing sample expansion processing on the sample historical data.
  2. 根据权利要求1所述的基于多源异构数据的风力发电机故障预测方法,其特征在于,所述获取目标风力发电机的历史数据,包括:The method for predicting wind turbine faults based on multi-source heterogeneous data according to claim 1 is characterized in that the step of obtaining historical data of the target wind turbine comprises:
    获取所述目标风力发电机的机头位置,根据所述机头位置和向所述目标风力发电机发出的光信号的反射信号获取所述目标风力发电机的各个叶片的形变数据。The nose position of the target wind turbine is acquired, and deformation data of each blade of the target wind turbine is acquired according to the nose position and a reflection signal of a light signal sent to the target wind turbine.
  3. 根据权利要求2所述的基于多源异构数据的风力发电机故障预测方法,其特征在于,所述根据所述机头位置和向所述目标风力发电机发出的光信号的反射信号获取所述目标风力发电机的各个叶片的形变数据,包括:The wind turbine fault prediction method based on multi-source heterogeneous data according to claim 2 is characterized in that the deformation data of each blade of the target wind turbine is obtained according to the head position and the reflection signal of the light signal sent to the target wind turbine, including:
    获取所述目标风力发电机的机头基准位置以及所述机头基准位置对应的基准反射信号数据;Acquire a nose reference position of the target wind turbine and reference reflection signal data corresponding to the nose reference position;
    根据所述机头位置和所述机头基准位置之间的差异、以及向所述目标风力发电机发出的光信号的反射信号与所述基准反射信号数据获取所述目标风力发电机的各个叶片的形变数据。The deformation data of each blade of the target wind turbine is acquired according to the difference between the nose position and the nose reference position, and the reflected signal of the light signal sent to the target wind turbine and the reference reflected signal data.
  4. 根据权利要求2所述的基于多源异构数据的风力发电机故障预测方法,其特征在于,根据所述机头位置和向所述目标风力发电机发出的光信号的反射信号获取所述目标风力发电机的各个叶片的形变数据之后,包括:The wind turbine fault prediction method based on multi-source heterogeneous data according to claim 2 is characterized in that after obtaining the deformation data of each blade of the target wind turbine according to the head position and the reflection signal of the light signal sent to the target wind turbine, it includes:
    根据所述目标风力发电机当前的叶片形变数据,基于所述目标风力发电机当前的叶片形变数据更新所述目标风力发电机的数字孪生模型中的三维模型;According to the current blade deformation data of the target wind turbine, updating the three-dimensional model in the digital twin model of the target wind turbine based on the current blade deformation data of the target wind turbine;
    所述获取所述神经网络输出的所述目标风力发电机的故障预测结果之后,包括:After obtaining the fault prediction result of the target wind turbine generator output by the neural network, the method further comprises:
    将所述目标风力发电机的故障预测结果更新至所述目标风力发电机的数字孪生模型中。The fault prediction result of the target wind turbine is updated into the digital twin model of the target wind turbine.
  5. 根据权利要求1所述的基于多源异构数据的风力发电机故障预测方法,其特征在于,所述神经网络包括特征提取模块和预测模块,所述神经网络的训练过程为:The wind turbine fault prediction method based on multi-source heterogeneous data according to claim 1 is characterized in that the neural network includes a feature extraction module and a prediction module, and the training process of the neural network is:
    在各个所述样本历史数据中选取部分所述样本历史数据组成目标训练批次,对于所述目标训练批次中的目标样本历史数据,执行如下步骤:Select a portion of the sample history data from each of the sample history data to form a target training batch, and perform the following steps for the target sample history data in the target training batch:
    将所述目标样本历史数据输入至样本扩充模块,生成目标样本扩充数据;Inputting the target sample historical data into a sample expansion module to generate target sample expansion data;
    通过所述特征提取模块分别提取所述目标样本历史数据和所述目标样本扩充数据的特征,得到第一特征和第二特征;The feature extraction module extracts the features of the target sample historical data and the target sample expanded data respectively to obtain a first feature and a second feature;
    将所述第一特征和所述第二特征分别输入至所述预测模块,获取所述预测模块输出的第一预测结果和第二预测结果;Inputting the first feature and the second feature into the prediction module respectively, and obtaining a first prediction result and a second prediction result output by the prediction module;
    根据所述目标训练批次中的每个样本历史数据对应的所述第一特征、所述第二特征、所述第一预测结果、所述第二预测结果以及所述目标样本历史数据对应的故障标注标签得到批次训练损失;Obtaining a batch training loss according to the first feature, the second feature, the first prediction result, the second prediction result, and the fault label corresponding to each sample historical data in the target training batch;
    根据所述批次训练损失更新所述样本扩充模块、所述特征提取模块和所述预测模块的参数,并重新执行所述在各个所述样本历史数据中选取部分所述样本历史数据组成目标训练批次的步骤,直至参数收敛。The parameters of the sample expansion module, the feature extraction module and the prediction module are updated according to the batch training loss, and the step of selecting part of the sample historical data from each of the sample historical data to form a target training batch is re-executed until the parameters converge.
  6. 根据权利要求5所述的基于多源异构数据的风力发电机故障预测方法,其特征在于,所述根据所述目标训练批次中的每个样本历史数据对应的所述第一特征、所述第二特征、所述第一预测结果、所述第二预测结果以及所述目标样本历史数据对应的故障标注标签得到批次训练损失,包括:The wind turbine fault prediction method based on multi-source heterogeneous data according to claim 5 is characterized in that the batch training loss is obtained according to the first feature, the second feature, the first prediction result, the second prediction result and the fault label corresponding to the target sample historical data corresponding to each sample historical data in the target training batch, including:
    根据所述目标样本历史数据对应的故障标注标签和所述第一预测结果得到第一损失;Obtaining a first loss according to the fault label corresponding to the target sample historical data and the first prediction result;
    将所述第一特征和所述第二特征分别输入至判别器,获取所述判别器输出的判别结果,根据所述判别结果得到第二损失;Inputting the first feature and the second feature into a discriminator respectively, obtaining a discrimination result output by the discriminator, and obtaining a second loss according to the discrimination result;
    根据所述第一损失和所述第二损失得到所述目标样本历史数据对应的样本损失;Obtaining a sample loss corresponding to the target sample historical data according to the first loss and the second loss;
    获取所述训练批次中每个样本历史数据对应的所述第一故障预测结果的概率分布作为第一概率分布,获取所述训练批次中每个样本历史数据对应的所述第二故障预测结果的概率分布作为第二概率分布;Obtaining the probability distribution of the first fault prediction result corresponding to each sample historical data in the training batch as a first probability distribution, and obtaining the probability distribution of the second fault prediction result corresponding to each sample historical data in the training batch as a second probability distribution;
    根据所述第一概率分布和所述第二概率分布得到第一批次分损失;Obtaining a first batch loss according to the first probability distribution and the second probability distribution;
    对所述训练批次中的每个样本历史数据分别对应的样本损失进行求和,得到第二批次分损失;The sample losses corresponding to each sample historical data in the training batch are summed to obtain the second batch loss;
    根据所述第一批次分损失和所述第二批次分损失得到所述批次训练损失;Obtaining the batch training loss according to the first batch loss and the second batch loss;
    所述根据所述批次训练损失更新所述样本扩充模块、所述特征提取模块和所述预测模块的参数,包括:The updating of the parameters of the sample expansion module, the feature extraction module and the prediction module according to the batch training loss includes:
    根据所述批次训练损失更新所述样本扩充模块、所述特征提取模块、所述预测模块和所述判别器的参数。The parameters of the sample expansion module, the feature extraction module, the prediction module and the discriminator are updated according to the batch training loss.
  7. 根据权利要求5所述的基于多源异构数据的风力发电机故障预测方法,其特征在于,所述通过所述特征提取模块分别提取所述目标样本历史数据和所述目标样本扩充数据的特征,得到第一特征和第二特征,包括:The method for predicting wind turbine faults based on multi-source heterogeneous data according to claim 5 is characterized in that the feature extraction module extracts the features of the target sample historical data and the target sample expanded data respectively to obtain the first feature and the second feature, including:
    分别对所述目标样本历史数据和所述目标样本扩充数据进行降维处理,得到第一降维数据和第二降维数据;Respectively performing dimensionality reduction processing on the target sample historical data and the target sample expanded data to obtain first dimensionality reduction data and second dimensionality reduction data;
    分别将所述第一降维数据和所述第二降维数据输入至所述特征提取模块,获取所述特征提取模块输出的所述第一特征和所述第二特征。The first dimension reduction data and the second dimension reduction data are respectively input into the feature extraction module to obtain the first feature and the second feature output by the feature extraction module.
  8. 一种基于多源异构数据的风力发电机故障预测装置,其特征在于,包括:A wind turbine fault prediction device based on multi-source heterogeneous data, characterized by comprising:
    数据获取模块,所述数据获取模块用于获取目标风力发电机的历史数据,所述目标风力发电机的历史数据至少包括所述目标风力发电机在预设时段内的各个叶片的形变数据以及所述目标风力发电机的振动数据,所述预设时段为当前时刻前预设时长的时段;A data acquisition module, the data acquisition module is used to acquire historical data of a target wind turbine, the historical data of the target wind turbine at least including deformation data of each blade of the target wind turbine within a preset time period and vibration data of the target wind turbine, the preset time period being a period of preset duration before a current moment;
    预测模块,所述预测模块用于将所述目标风力发电机的历史数据输入至已训练的神经网络,获取所述神经网络输出的所述目标风力发电机的故障预测结果;A prediction module, the prediction module is used to input the historical data of the target wind turbine into a trained neural network, and obtain a fault prediction result of the target wind turbine output by the neural network;
    其中,所述神经网络是基于训练数据集训练完成的,所述训练数据集中包括样本历史数据和样本扩充数据,所述样本历史数据为真实采集的风力发电机的历史数据,所述样本扩充数据为对所述样本历史数据进行样本扩充处理生成的数据。Wherein, the neural network is trained based on a training data set, and the training data set includes sample historical data and sample expansion data, the sample historical data is historical data of wind turbines actually collected, and the sample expansion data is data generated by performing sample expansion processing on the sample historical data.
  9. 一种终端,其特征在于,所述终端包括:处理器、与处理器通信连接的计算机可读存储介质,所述计算机可读存储介质适于存储多条指令,所述处理器适于调用所述计算机可读存储介质中的指令,以执行实现上述权利要求1-7任一项所述的基于多源异构数据的风力发电机故障预测方法的步骤。A terminal, characterized in that the terminal includes: a processor, a computer-readable storage medium communicatively connected to the processor, the computer-readable storage medium is suitable for storing multiple instructions, and the processor is suitable for calling the instructions in the computer-readable storage medium to execute the steps of the wind turbine fault prediction method based on multi-source heterogeneous data as described in any one of claims 1 to 7.
  10. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有一个或者多个程序,所述一个或者多个程序可被一个或者多个处理器执行,以实现如权利要求1-7任一项所述的基于多源异构数据的风力发电机故障预测方法的步骤。A computer-readable storage medium, characterized in that the computer-readable storage medium stores one or more programs, and the one or more programs can be executed by one or more processors to implement the steps of the wind turbine fault prediction method based on multi-source heterogeneous data as described in any one of claims 1-7.
PCT/CN2022/137737 2022-10-10 2022-12-08 Wind turbine fault prediction method based on multi-source heterogeneous data WO2024077753A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202211231295.4A CN115293057B (en) 2022-10-10 2022-10-10 Wind driven generator fault prediction method based on multi-source heterogeneous data
CN202211231295.4 2022-10-10

Publications (1)

Publication Number Publication Date
WO2024077753A1 true WO2024077753A1 (en) 2024-04-18

Family

ID=83819425

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/137737 WO2024077753A1 (en) 2022-10-10 2022-12-08 Wind turbine fault prediction method based on multi-source heterogeneous data

Country Status (2)

Country Link
CN (1) CN115293057B (en)
WO (1) WO2024077753A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115293057B (en) * 2022-10-10 2022-12-20 深圳先进技术研究院 Wind driven generator fault prediction method based on multi-source heterogeneous data
CN116226676B (en) * 2023-05-08 2023-07-21 中科航迈数控软件(深圳)有限公司 Machine tool fault prediction model generation method suitable for extreme environment and related equipment
CN117006002B (en) * 2023-09-27 2024-02-09 广东海洋大学 Digital twinning-based offshore wind turbine monitoring method and system
CN117421244B (en) * 2023-11-17 2024-05-24 北京邮电大学 Multi-source cross-project software defect prediction method, device and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109670595A (en) * 2019-03-04 2019-04-23 丽水学院 A kind of equipment fault diagnosis method
CN112733283A (en) * 2020-12-21 2021-04-30 北京华能新锐控制技术有限公司 Wind turbine generator component fault prediction method
WO2021107866A1 (en) * 2019-11-25 2021-06-03 Envision Digital International Pte. Ltd. Method and apparatus for detecting fault, method and apparatus for training model, and device and storage medium
CN114118251A (en) * 2021-11-22 2022-03-01 国网辽宁省电力有限公司沈阳供电公司 Fault diagnosis and early warning method based on multi-source data fusion and convolution twin neural network
CN115293057A (en) * 2022-10-10 2022-11-04 深圳先进技术研究院 Wind driven generator fault prediction method based on multi-source heterogeneous data

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11675641B2 (en) * 2018-07-02 2023-06-13 Nec Corporation Failure prediction
US10956632B2 (en) * 2018-12-27 2021-03-23 Utopus Insights, Inc. Scalable system and engine for forecasting wind turbine failure
US20200210824A1 (en) * 2018-12-28 2020-07-02 Utopus Insights, Inc. Scalable system and method for forecasting wind turbine failure with varying lead time windows
CN111275108A (en) * 2020-01-20 2020-06-12 国网山东省电力公司枣庄供电公司 Method for performing sample expansion on partial discharge data based on generation countermeasure network
CN111858926A (en) * 2020-06-08 2020-10-30 北京市基础设施投资有限公司(原北京地铁集团有限责任公司) Pipe gallery equipment fault prediction method and system based on multi-element heterogeneous data
CN112115024B (en) * 2020-09-03 2023-07-18 上海上讯信息技术股份有限公司 Training method and device for fault prediction neural network model
CN113935443B (en) * 2021-12-20 2022-04-22 广东电网有限责任公司广州供电局 Heterogeneous equipment multi-domain joint fault prediction method, device, terminal and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109670595A (en) * 2019-03-04 2019-04-23 丽水学院 A kind of equipment fault diagnosis method
WO2021107866A1 (en) * 2019-11-25 2021-06-03 Envision Digital International Pte. Ltd. Method and apparatus for detecting fault, method and apparatus for training model, and device and storage medium
CN112733283A (en) * 2020-12-21 2021-04-30 北京华能新锐控制技术有限公司 Wind turbine generator component fault prediction method
CN114118251A (en) * 2021-11-22 2022-03-01 国网辽宁省电力有限公司沈阳供电公司 Fault diagnosis and early warning method based on multi-source data fusion and convolution twin neural network
CN115293057A (en) * 2022-10-10 2022-11-04 深圳先进技术研究院 Wind driven generator fault prediction method based on multi-source heterogeneous data

Also Published As

Publication number Publication date
CN115293057A (en) 2022-11-04
CN115293057B (en) 2022-12-20

Similar Documents

Publication Publication Date Title
WO2024077753A1 (en) Wind turbine fault prediction method based on multi-source heterogeneous data
CN111210024B (en) Model training method, device, computer equipment and storage medium
JP2021064370A (en) Method and system for semi-supervised deep abnormality detection for large-scale industrial monitoring system based on time-series data utilizing digital twin simulation data
Lu et al. GAN-based data augmentation strategy for sensor anomaly detection in industrial robots
Mourtzis et al. Intelligent predictive maintenance and remote monitoring framework for industrial equipment based on mixed reality
US20070100487A1 (en) Method and system for virtual metrology in semiconductor manufacturing
Kim et al. Abnormality diagnosis model for nuclear power plants using two-stage gated recurrent units
WO2021066867A1 (en) System and method for fusing multiple analytics of a wind turbine for improved efficiency
CN113962433A (en) Wind power prediction method and system fusing causal convolution and separable time convolution
EP4111373A1 (en) Robust artificial intelligence inference in edge computing devices
Javadi et al. An algorithm for practical power curve estimation of wind turbines
CN116627773B (en) Abnormality analysis method and system of production and marketing difference statistics platform system
US11907865B2 (en) Determination of security-constrained optimal power flow
CN115712874A (en) Thermal energy power system fault diagnosis method and device based on time series characteristics
CN114358172A (en) Nuclear reactor fault classification method, apparatus, computer device, and storage medium
CN116108989B (en) Wind power ultra-short-term power prediction method, system, storage medium and device
CN113887820A (en) Method and device for predicting fault of electric power spot business system, computer equipment and storage medium
CN114065460A (en) Model processing method, storage medium and electronic device in thermal power generation system
CN112364716A (en) Nuclear power equipment abnormal information detection method and device and computer equipment
US12001180B2 (en) Condition-based method for malfunction prediction
CN116722548B (en) Photovoltaic power generation prediction method based on time sequence model and related equipment
US20210382447A1 (en) Condition-Based Method for Malfunction Prediction
Biard et al. Reliability Assessment of an Electrical Network with Digital Twins
EP4361742A1 (en) Prediction system, information processing device, and information processing program
WO2024039510A1 (en) Incremental causal discovery and root cause localization for online system fault diagnosis

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22961932

Country of ref document: EP

Kind code of ref document: A1