CN115329880A - Meteorological feature extraction method and device, computer equipment and storage medium - Google Patents
Meteorological feature extraction method and device, computer equipment and storage medium Download PDFInfo
- Publication number
- CN115329880A CN115329880A CN202210997566.0A CN202210997566A CN115329880A CN 115329880 A CN115329880 A CN 115329880A CN 202210997566 A CN202210997566 A CN 202210997566A CN 115329880 A CN115329880 A CN 115329880A
- Authority
- CN
- China
- Prior art keywords
- meteorological
- feature set
- feature
- features
- meteorological feature
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000605 extraction Methods 0.000 title claims abstract description 54
- 238000010248 power generation Methods 0.000 claims abstract description 134
- 238000012216 screening Methods 0.000 claims abstract description 85
- 238000000034 method Methods 0.000 claims abstract description 74
- 238000007637 random forest analysis Methods 0.000 claims abstract description 51
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 47
- 238000004590 computer program Methods 0.000 claims abstract description 40
- 238000010219 correlation analysis Methods 0.000 claims abstract description 37
- 230000008030 elimination Effects 0.000 claims abstract description 37
- 238000003379 elimination reaction Methods 0.000 claims abstract description 37
- 238000003066 decision tree Methods 0.000 claims description 59
- 238000012360 testing method Methods 0.000 claims description 42
- 238000012549 training Methods 0.000 claims description 24
- 238000005070 sampling Methods 0.000 claims description 11
- 238000002790 cross-validation Methods 0.000 claims description 10
- 238000001914 filtration Methods 0.000 claims 1
- 238000012545 processing Methods 0.000 abstract description 5
- 239000000284 extract Substances 0.000 abstract description 4
- 238000010586 diagram Methods 0.000 description 8
- 238000010187 selection method Methods 0.000 description 4
- 238000004891 communication Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000005855 radiation Effects 0.000 description 3
- 238000013500 data storage Methods 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 229910021389 graphene Inorganic materials 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/06—Energy or water supply
Landscapes
- Business, Economics & Management (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Economics (AREA)
- Public Health (AREA)
- Water Supply & Treatment (AREA)
- General Health & Medical Sciences (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- Strategic Management (AREA)
- Tourism & Hospitality (AREA)
- Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Supply And Distribution Of Alternating Current (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The application relates to a method, a device, computer equipment, a storage medium and a computer program product for extracting meteorological features influencing generated power in a new energy power generation system. The method comprises the steps of obtaining an initial meteorological feature set; screening meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain a first meteorological feature set; based on correlation analysis, meteorological feature screening is carried out on the first meteorological feature set to obtain a second meteorological feature set; and screening the meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set. Whole scheme carries out the primary extraction to meteorological feature according to random forest algorithm, on the basis of extracting the characteristic for the first time, carries out correlation analysis to meteorological feature, extracts once more according to correlation analysis result, and then carries out the recursion to remaining characteristic and eliminates, through screening many times, will influence the biggest characteristic screening processing to new forms of energy generated power, and then obtain more accurate meteorological feature.
Description
Technical Field
The present application relates to the field of new energy technologies, and in particular, to a method, an apparatus, a computer device, a storage medium, and a computer program product for extracting meteorological features affecting generated power in a new energy power generation system.
Background
The new energy generally refers to renewable energy developed and utilized on the basis of new technology. With the limited nature of conventional energy and the increasing prominence of environmental issues, new energy characterized by environmental protection and regeneration is more and more emphasized. The output of new energy is influenced by meteorological factors, and has strong randomness and volatility. The large-scale new energy grid connection brings challenges to safe and stable operation of an electric power system, and power prediction is one of key measures for solving the new energy grid connection.
In the traditional power prediction process, power prediction is mainly carried out by selecting meteorological features. However, due to various meteorological characteristic factors influencing the output of the new energy, the meteorological characteristics influencing the output of the new energy power generation system under different climatic conditions are different. And therefore, accurate characteristics affecting the generated power cannot be extracted.
Disclosure of Invention
In view of the above, it is necessary to provide a meteorological feature extraction method, apparatus, computer device, computer readable storage medium, and computer program product capable of obtaining meteorological features that accurately affect new energy generated power.
In a first aspect, the application provides a meteorological feature extraction method for influencing generated power in a new energy power generation system. The method comprises the following steps:
acquiring initial meteorological features influencing the power generation power in the new energy power generation system to obtain an initial meteorological feature set;
screening meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain a first meteorological feature set;
based on the correlation analysis between the meteorological features and the generated power, carrying out meteorological feature screening on the first meteorological feature set to obtain a second meteorological feature set;
screening meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set;
and the influence degree of each average meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is increased step by step.
In one embodiment, screening meteorological features in an initial meteorological feature set by using a random forest algorithm to obtain a first meteorological feature set includes: acquiring power generation sample data of the new energy power generation system; according to the power generation sample data, scoring the meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain feature scores; and according to the feature scores, removing features with the feature scores lower than a preset score threshold value from the initial meteorological feature set to obtain a first meteorological feature set.
In one embodiment, according to power generation sample data, scoring meteorological features in the initial meteorological feature set by using a random forest algorithm, and obtaining feature scores includes: randomly sampling power generation sample data to obtain training sample data; obtaining test sample data according to the power generation sample data which is not sampled; constructing a decision tree according to training sample data; and calculating the importance degree of the meteorological features in the initial meteorological feature set according to the decision tree and the test sample data to obtain a feature score.
In one embodiment, calculating the importance degree of the meteorological features in the initial meteorological feature set according to the decision tree and the test sample data to obtain the feature score includes: calculating the prediction error rate of the decision tree according to the test sample data; randomly adding noise into a single meteorological feature of the test sample data, and calculating the noise prediction error rate of the decision tree; and determining the importance degree of the meteorological features in the meteorological feature set according to the prediction error rate and the noise prediction error rate to obtain a feature score.
In one embodiment, the meteorological feature screening the first meteorological feature set based on the correlation analysis between the meteorological features and the generated power, and the obtaining the second meteorological feature set comprises: calculating the correlation between each meteorological feature in the first meteorological feature set and the generated power; and performing meteorological feature screening on the first meteorological feature set according to the correlation between each meteorological feature and the generated power to obtain a second meteorological feature set.
In one embodiment, the step of screening the meteorological features in the second meteorological feature set by using a recursive feature elimination method to obtain the target meteorological feature set includes: performing meteorological feature extraction on the second meteorological feature by adopting a recursive feature elimination method; and performing cross validation on the extracted meteorological features to obtain a target meteorological feature set.
In a second aspect, the application further provides a meteorological feature extraction device for influencing the generated power in the new energy power generation system. The device includes:
the acquisition module is used for acquiring initial meteorological features influencing the power generation power in the new energy power generation system to obtain an initial meteorological feature set;
the first extraction module is used for screening meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain a first meteorological feature set;
the second extraction module is used for screening meteorological features of the first meteorological feature set based on correlation analysis between the meteorological features and the generated power to obtain a second meteorological feature set;
the third extraction module is used for screening meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set;
and the influence degree of each average meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is increased step by step.
In one embodiment, the first extraction module is further configured to obtain power generation sample data of the new energy power generation system; according to the power generation sample data, scoring the meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain feature scores; and according to the feature scores, removing features with the feature scores lower than a preset score threshold value from the initial meteorological feature set to obtain a first meteorological feature set.
In one embodiment, the first extraction module is further configured to perform random sampling on the power generation sample data to obtain training sample data; obtaining test sample data according to the power generation sample data which is not sampled; constructing a decision tree according to training sample data; and calculating the importance degree of the meteorological features in the initial meteorological feature set according to the decision tree and the test sample data to obtain a feature score.
In one embodiment, the first extraction module is further configured to calculate a prediction error rate of the decision tree according to the test sample data; randomly adding noise into a single meteorological feature of the test sample data, and calculating the noise prediction error rate of the decision tree; and determining the importance degree of the meteorological features in the meteorological feature set according to the prediction error rate and the noise prediction error rate to obtain a feature score.
In one embodiment, the second extraction module is further used for calculating the correlation between each meteorological feature in the first meteorological feature set and the generated power; and performing meteorological feature screening on the first meteorological feature set according to the correlation between each meteorological feature and the generated power to obtain a second meteorological feature set.
In one embodiment, the third extraction module is further configured to perform meteorological feature extraction on the second meteorological feature by using a recursive feature elimination method; and performing cross validation on the extracted meteorological features to obtain a target meteorological feature set.
In a third aspect, the present application also provides a computer device. The computer device comprises a memory storing a computer program and a processor implementing the following steps when executing the computer program:
acquiring initial meteorological features influencing the power generation power in the new energy power generation system to obtain an initial meteorological feature set;
screening meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain a first meteorological feature set;
based on the correlation analysis between the meteorological features and the generated power, carrying out meteorological feature screening on the first meteorological feature set to obtain a second meteorological feature set;
screening meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set;
and the influence degree of each average meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is increased step by step.
In a fourth aspect, the present application further provides a computer-readable storage medium. The computer-readable storage medium, on which a computer program is stored which, when executed by a processor, carries out the steps of:
acquiring initial meteorological features influencing the power generation power in the new energy power generation system to obtain an initial meteorological feature set;
screening meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain a first meteorological feature set;
based on the correlation analysis between the meteorological features and the generated power, carrying out meteorological feature screening on the first meteorological feature set to obtain a second meteorological feature set;
screening meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set;
and the influence degree of each average meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is gradually increased.
In a fifth aspect, the present application further provides a computer program product. The computer program product comprising a computer program which when executed by a processor performs the steps of:
acquiring initial meteorological features influencing the power generation power in the new energy power generation system to obtain an initial meteorological feature set;
screening meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain a first meteorological feature set;
based on the correlation analysis between the meteorological features and the generated power, carrying out meteorological feature screening on the first meteorological feature set to obtain a second meteorological feature set;
screening meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set;
and the influence degree of each average meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is increased step by step.
The method, the device, the computer equipment, the storage medium and the computer program product for extracting the meteorological features influencing the generated power in the new energy power generation system obtain the initial meteorological features influencing the generated power in the new energy power generation system to obtain an initial meteorological feature set; screening meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain a first meteorological feature set; based on the correlation analysis between the meteorological features and the generated power, carrying out meteorological feature screening on the first meteorological feature set to obtain a second meteorological feature set; screening meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set; and the influence degree of each average meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is increased step by step. The whole scheme acquires a comprehensive meteorological feature set, and then carries out preliminary extraction on meteorological features according to a random forest algorithm, and then on the basis of the features extracted for the first time, carries out correlation analysis on the meteorological features, extracts again according to correlation analysis results, and then carries out recursion elimination on the remaining features, and through screening many times, the feature screening processing with the largest influence on the new energy power generation power is carried out, and then more accurate meteorological features are obtained.
Drawings
FIG. 1 is a diagram illustrating an exemplary application of a method for extracting meteorological features affecting power generation in a new energy power generation system;
FIG. 2 is a schematic flow chart illustrating a method for extracting meteorological features influencing power generation in a new energy power generation system according to an embodiment;
FIG. 3 is a schematic flow chart of a first weather feature extraction process in one embodiment;
FIG. 4 is a schematic representation of the correlation between the pre-second meteorological feature extraction features and the power generated in one embodiment;
FIG. 5 is a schematic representation of the correlation between the extracted second meteorological feature and the generated power in one embodiment;
FIG. 6 is a schematic diagram of a linear relationship between meteorological features after extraction of a second meteorological feature, in one embodiment;
FIG. 7 is a graphical illustration of the relationship between the number of features and the prediction accuracy in one embodiment;
FIG. 8 is a schematic diagram illustrating the accuracy of the meteorological features extracted by the present application before and after power generation power prediction in one embodiment;
FIG. 9 is a complete diagram of the meteorological feature extraction method in one embodiment;
FIG. 10 is a block diagram showing a meteorological feature extraction unit for a new energy power generation system according to an embodiment, the meteorological feature extraction unit influencing power generation;
FIG. 11 is a diagram of the internal structure of a computer device in one embodiment.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more clearly understood, the present application is further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
At present, most of new energy power generation power prediction methods consider the influence of selection of an algorithm model on prediction accuracy, and correlation rules among meteorological features cannot be fully mined. Due to the fact that meteorological characteristic factors of new energy output are various, for new energy power generation systems (namely new energy stations) under different climatic conditions, meteorological characteristics influencing output are different, and a set of characteristic selection method which is flow-based, transplantable and wide in application range is lacked. Therefore, the potential law of the meteorological features is fully utilized, the optimal training features are selected, and the method plays an important role in improving the power prediction accuracy of the new energy.
According to the method, firstly, a meteorological feature factor combination which has the largest influence on the power change of the new energy is screened out based on the influence power of a random forest algorithm, on the basis, the correlation between meteorological features and power generation power is researched by applying statistical correlation analysis, the meteorological features are further screened out, and finally, a recursive feature elimination method is used for determining a feature set which is finally screened out for prediction. The method is used for selecting meteorological features during power prediction, and the new energy power prediction precision can be improved.
The method for extracting meteorological features influencing generated power in the new energy power generation system provided by the embodiment of the application can be applied to an application environment shown in fig. 1. Wherein, the user 102 communicates with the terminal 104 through the network, and the terminal 104 communicates with the new energy power generation system 106 through the network. The data storage system may store power generation sample data in the new energy power generation system 106. The data storage system may be integrated on the new energy power generation system 106, or may be placed on the cloud or other network server. A user 102 initiates a meteorological feature analysis request to a terminal 104, the terminal 104 receives the meteorological feature analysis request, an initial meteorological feature set is obtained from a new energy power generation system 106 according to the meteorological feature analysis request, and a random forest algorithm is adopted to screen meteorological features in the initial meteorological feature set to obtain a first meteorological feature set; based on the correlation analysis between the meteorological features and the generated power, carrying out meteorological feature screening on the first meteorological feature set to obtain a second meteorological feature set; screening meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set; and the influence degree of each average meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is increased step by step. The terminal 104 may be, but not limited to, various physical servers, virtual hosts, personal computers, notebook computers, smart phones, tablet computers, internet of things devices, and portable wearable devices, where the internet of things devices may be smart speakers, smart televisions, smart air conditioners, smart car-mounted devices, and the like. The portable wearable device can be a smart watch, a smart bracelet, a head-mounted device, and the like. The new energy power generation system 106 may be implemented by a stand-alone server or a server cluster composed of a plurality of servers.
In an embodiment, as shown in fig. 2, a method for extracting meteorological features affecting generated power in a new energy power generation system is provided, and this embodiment is exemplified by applying the method to a terminal, and includes the following steps:
step 202, obtaining initial meteorological features influencing the generated power in the new energy power generation system to obtain an initial meteorological feature set.
The new energy power generation system comprises new energy power generation systems of different power generation types, such as a photovoltaic power generation station, a wind power station and the like. The initial meteorological features refer to all meteorological features influencing the new energy power generation power, and a set formed by all meteorological features in the initial meteorological features is an initial meteorological feature set. The initial meteorological feature set includes Wind Speed (WS), wind Direction (WD), temperature (TEM), air Density (Density), air Pressure (PRS), total radiation (SR), direct radiation (SWDDIF), scattered radiation (SWDDIR), high cloud number (TCC), total cloud number (HCC), and the like.
Specifically, the terminal obtains a meteorological feature analysis request of a user, analyzes the meteorological feature analysis request, determines meteorological feature types such as a wind power station, a photovoltaic station and the like, obtains initial meteorological features which affect the power generation power in the new energy power generation system and correspond to the meteorological feature types, and takes a set formed by the initial meteorological features as an initial meteorological feature set. The initial meteorological features can be obtained by performing correlation analysis on all meteorological features and the new energy power generation power by the terminal according to all meteorological features and extracting the initial meteorological features. The initial meteorological features can also be meteorological features which are summarized according to scientific research by new energy experts and influence the generated power of the new energy.
And 204, screening meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain a first meteorological feature set.
The first meteorological feature set refers to a meteorological feature set which is obtained by screening meteorological features through a random forest algorithm and has a large influence on the power generation power.
Specifically, the terminal adopts a random forest algorithm, a data classifier comprising a plurality of decision trees is constructed through the random forest algorithm, meteorological features which have large influence on generating power are extracted through the data classifier and the relation between historical meteorological data and historical generating power, and a first meteorological feature set is obtained.
And step 206, based on the correlation analysis between the meteorological features and the generated power, performing meteorological feature screening on the first meteorological feature set to obtain a second meteorological feature set.
Specifically, the terminal extracts the historical data of the meteorological features contained in the first meteorological feature set from the historical meteorological data, performs correlation analysis between the historical data of the first meteorological features and the historical generated power, eliminates the meteorological features lower than a preset correlation threshold value in the correlation analysis result, and combines the meteorological features remaining in the first meteorological feature set after the elimination to obtain a second meteorological feature set.
And 208, screening the meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set.
In the screening process of the terminal for several times, the meteorological features which have large influence on the generated power are screened out every time, so that the influence degree of each average meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is gradually increased.
Specifically, after the terminal obtains a second meteorological feature set, a decision tree model is recursively built, features with the strongest correlation with the power generation power are selected according to correlation coefficients of meteorological features in the second meteorological feature set, the decision tree model is built again according to the remaining meteorological features in the second meteorological feature set, features with the strongest correlation with the power generation power are selected according to correlation coefficients of the remaining meteorological features in the second meteorological feature set until meteorological feature positions of preset feature data are selected, the screening is stopped, and the meteorological features obtained through screening are used as a target meteorological feature set.
In the meteorological feature extraction method for influencing the generated power in the new energy power generation system, the initial meteorological features influencing the generated power in the new energy power generation system are obtained to obtain an initial meteorological feature set; screening meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain a first meteorological feature set; performing meteorological feature screening on the first meteorological feature set based on correlation analysis between meteorological features and generated power to obtain a second meteorological feature set; screening meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set; and the influence degree of each average meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is increased step by step. The whole scheme acquires comprehensive meteorological feature set, and then carries out preliminary extraction to meteorological features according to random forest algorithm, and then on the basis of the feature of extracting for the first time, carries out correlation analysis to meteorological features, extracts once more according to correlation analysis result, and then carries out recursion elimination to remaining feature, through screening many times, will influence the biggest feature screening processing to new forms of energy generated power, and then obtain more accurate meteorological features.
In an alternative embodiment, as shown in fig. 3, screening meteorological features in the initial meteorological feature set by using a random forest algorithm to obtain a first meteorological feature set includes:
and 302, acquiring power generation sample data of the new energy power generation system.
The power generation sample data comprises meteorological sample data and power generation sample data. Generating sample data is obtained by collecting historical meteorological data and historical generating power data of new energy to be researched in a period of time.
Specifically, the terminal obtains a meteorological feature analysis request of a user, analyzes the meteorological feature analysis request to obtain a meteorological feature type and a sample storage path, and obtains power generation sample data of preset days and initial meteorological features influencing power generation power in the new energy power generation system according to the sample storage path and the meteorological feature type.
And step 304, grading the meteorological features in the initial meteorological feature set by adopting a random forest algorithm according to the power generation sample data to obtain feature grades.
Specifically, the terminal constructs a plurality of decision trees by adopting a random forest algorithm according to part of sample data in the power generation sample data, calculates the prediction accuracy of the constructed decision trees according to other residual sample data, and calculates according to the prediction accuracy of the decision trees to obtain the feature scores of a plurality of meteorological features.
And step 306, removing the features with the feature scores lower than a preset score threshold value from the initial meteorological feature set according to the feature scores to obtain a first meteorological feature set.
Specifically, the terminal compares the characteristic score of each meteorological feature in the initial meteorological feature set with a preset score threshold value in sequence, the meteorological features with the characteristic scores lower than the preset score threshold value are removed from the initial meteorological feature set, and the meteorological features left in the initial meteorological feature set after removal are features important for predicting the power generation power in a decision tree to form a first meteorological feature set.
In an optional embodiment, scoring the meteorological features in the initial meteorological feature set by using a random forest algorithm according to power generation sample data, and obtaining the feature score includes: randomly sampling power generation sample data to obtain training sample data; obtaining test sample data according to the power generation sample data which is not sampled; constructing a decision tree according to training sample data; and calculating the importance degree of the meteorological features in the initial meteorological feature set according to the decision tree and the test sample data to obtain a feature score.
Specifically, the terminal carries out repeated random sampling on the power generation sample data, and when the total amount of data obtained by sampling reaches preset training sample data, the sampling is stopped, so that training sample data are obtained; and taking the data which is not sampled in the power generation sample data as the test sample data. And constructing a plurality of decision trees according to the meteorological sample data and the generating power sample data in the training sample data. According to the meteorological sample data and the generating power sample data in the test sample data, the prediction accuracy rates of the decision trees are verified, the importance degree of meteorological features in the decision trees is calculated according to the prediction accuracy rates of the decision trees, and the feature score of each meteorological feature is obtained.
In an optional embodiment, calculating the importance degree of the meteorological features in the initial meteorological feature set according to the decision tree and the test sample data, and obtaining the feature score includes: calculating the prediction error rate of the decision tree according to the test sample data; randomly adding noise into a single meteorological feature of the test sample data, and calculating the noise prediction error rate of the decision tree; and determining the importance degree of the meteorological features in the meteorological feature set according to the prediction error rate and the noise prediction error rate to obtain a feature score.
Specifically, the terminal verifies the prediction error rates of the decision trees according to meteorological sample data and power generation power sample data in the test sample data; the terminal randomly adds noise interference to the characteristic X of all samples from the test sample data, namely randomly changes the value of the sample at the characteristic X, and calculates the noise prediction error rate (namely the error of the data outside the bag) of the decision tree; and subtracting the prediction error rate according to the noise prediction error rate, and dividing the accumulated subtraction result of the plurality of decision trees by the number of the decision trees to obtain the importance degree of the characteristic X. Randomly changing the value of each feature in the initial meteorological feature set, determining the importance degree of the feature according to the prediction error rate and the noise prediction error rate of the decision tree of each feature, and obtaining the feature score corresponding to the meteorological feature. Removing meteorological features with characteristic scores lower than a preset score threshold value from the initial meteorological feature set, wherein the meteorological features left in the initial meteorological feature set after removal are features important for predicting power generation power in a decision tree, and forming a first meteorological feature set
Further, the feature score randomly changes the value of a certain meteorological feature based on the error of the data outside the bag, and if the error of the data outside the bag greatly rises, the meteorological feature has a great influence on the classification result of the sample, and the importance degree is high. The feature score is calculated as follows:
the method comprises the steps that firstly, training sample data are obtained through repeated sampling of power generation sample data, a decision tree is built according to the training sample data, and data which are not sampled are used for evaluating the performance of a random forest and calculating the prediction error rate of a model, namely the error of data outside a bag;
secondly, calculating the prediction error rate of the model for each decision tree in the random forest, and recording the prediction error rate as errOOB1;
and thirdly, randomly adding noise interference to the characteristic X of all the samples of the data outside the bag (randomly changing the value of the sample at the characteristic X), and calculating the error of the data outside the bag, wherein the error is recorded as errOOB2.
Fourth, assuming there are N trees in the random forest, then the importance for feature X = ∑ (err 00B2-errOOB 1)/N.
Features below a preset scoring threshold are discarded. The preset score threshold is used as a variable parameter of the feature selection method, and can be manually adjusted, and the default value of the random forest score threshold set in the embodiment is 0.05.
In an optional embodiment, the performing meteorological feature screening on the first meteorological feature set based on correlation analysis between meteorological features and generated power to obtain the second meteorological feature set comprises: calculating the correlation between each meteorological feature in the first meteorological feature set and the generated power; and performing meteorological feature screening on the first meteorological feature set according to the correlation between each meteorological feature and the generated power to obtain a second meteorological feature set.
Specifically, after the terminal obtains a first meteorological feature set, a Pearson correlation coefficient between each meteorological feature in the first meteorological feature set and generated power is calculated to obtain a characteristic correlation coefficient; and removing meteorological features of which the correlation coefficients are lower than a preset correlation threshold value from the first meteorological feature set, and combining the remaining features to obtain a second meteorological feature set.
Further, based on correlation analysis, the degree of linear relation between different meteorological features and power is researched through a statistical correlation coefficient r. Comparing all the characteristics with the absolute value of the correlation coefficient of the power, and discarding the characteristics of which the absolute value of the correlation coefficient is lower than a critical value (threshold value). The threshold value is used as a variable parameter of the feature selection method and can be manually adjusted, generally, the absolute value of the correlation coefficient is 0-0.09, no correlation is considered, 0.3-weak, 0.1-0.3 is weak correlation, 0.3-0.5 is medium correlation, and 0.5-1.0 is strong correlation. According to the method, the default value of the critical value of the photovoltaic correlation coefficient is set to be 0.2, and the default value of the critical value of the wind power correlation coefficient is set to be 0.45. The correlation coefficient r is calculated as follows:
as shown in fig. 4, before the meteorological features are screened by the correlation analysis, the correlation coefficient between each meteorological feature and power is calculated. Before screening, the meteorological features have serious linear correlation, and the performance of a prediction model is easily influenced.
As shown in fig. 5, the correlation coefficient between each meteorological feature and power is obtained after the meteorological features are screened by the correlation analysis. As shown in fig. 6, correlation between meteorological features is basically not linear correlation between meteorological features after screening.
In an optional embodiment, the step of screening the meteorological features in the second meteorological feature set by using a recursive feature elimination method to obtain the target meteorological feature set includes: adopting a recursive feature elimination method to extract meteorological features of the second meteorological feature; and performing cross validation on the extracted meteorological features to obtain a target meteorological feature set.
Specifically, after the terminal obtains the second meteorological feature set, the decision tree model is recursively constructed, the best feature is selected according to the correlation coefficient through cross validation, and then the process is repeated on the remaining features until all the features are traversed. The features are sorted in the recursion process, k features which are ranked at most are reserved, k values are saved as feature sets and used as parameters of the feature selection method, manual adjustment can be achieved, the default number of the set features is 4, and adjustment can be achieved according to different new energy power generation types. As shown in fig. 7, when the number of training features is determined to be 4 by using recursive elimination, the model prediction accuracy is the highest. And finally, the terminal takes the target meteorological feature set as the output of the feature engineering in a list form, and can also output and store the target meteorological feature set as a configuration file in a csv format, so that the future training or research is facilitated.
As shown in fig. 8, after the meteorological characteristic method provided by the present application is adopted for screening, the new energy generated power prediction accuracy is significantly improved.
According to the method, firstly, a meteorological feature factor combination which has the largest influence on the power change of the new energy is screened out based on the random forest algorithm, on the basis, the correlation analysis of statistics is applied, the correlation among meteorological features is researched, the meteorological features are further screened out, finally, a cross verification method and a recursive feature elimination method are used, the meteorological features are selected when the power is predicted, and the prediction accuracy of the power of the new energy can be improved. According to the method, 7 new energy power generation systems in Yunnan, guizhou and the like are selected, meteorological characteristics are selected, and through characteristic engineering, compared with new energy power prediction accuracy rates without characteristic engineering, the accuracy rate of short-term prediction (predicting 72 hours in the future) is improved by 0.22% at the minimum, 7.09% at the maximum and 2.08% on average; the accuracy of ultra-short-term prediction (4 hours in the future) is improved by 0.14 percent of the minimum value, 11.16 percent of the maximum value and 2.72 percent of the average value.
In order to easily understand the technical solution provided by the embodiment of the present application, as shown in fig. 9, a meteorological feature extraction process for influencing the generated power in a complete new energy power generation system is used to briefly describe the meteorological feature extraction method for influencing the generated power in the new energy power generation system provided by the embodiment of the present application:
(1) The path of the incoming data set, the number of days of data loaded, the meteorological feature type (wind/photovoltaic), random forests and correlation coefficient thresholds. And screening out a meteorological characteristic factor combination which has the largest influence on the power change of the new energy based on a random forest algorithm.
(2) And (3) researching the correlation between the meteorological features and the generating power by applying statistical correlation analysis, and further screening the meteorological features according to the correlation between the meteorological features and the generating power.
(3) And (3) recursively constructing a decision tree model by using a cross-validation method and a recursive feature elimination method, selecting the best feature according to the correlation coefficient, and repeating the process on the remaining features until all the features are traversed to obtain a target meteorological feature set.
(4) And exporting the feature set obtained by screening the feature engineering, and outputting the feature set into a configuration file in a csv format.
It should be understood that, although the steps in the flowcharts related to the embodiments as described above are sequentially displayed as indicated by arrows, the steps are not necessarily performed sequentially as indicated by the arrows. The steps are not limited to being performed in the exact order illustrated and, unless explicitly stated herein, may be performed in other orders. Moreover, at least a part of the steps in the flowcharts related to the embodiments described above may include multiple steps or multiple stages, which are not necessarily performed at the same time, but may be performed at different times, and the execution order of the steps or stages is not necessarily sequential, but may be rotated or alternated with other steps or at least a part of the steps or stages in other steps.
Based on the same inventive concept, the embodiment of the application also provides a meteorological feature extraction device for extracting the meteorological feature influencing the generated power in the new energy power generation system, wherein the meteorological feature extraction device is used for realizing the meteorological feature extraction method for extracting the meteorological feature influencing the generated power in the new energy power generation system. The implementation scheme for solving the problem provided by the device is similar to the implementation scheme recorded in the method, so that specific limitations in the embodiment of the meteorological feature extraction device for influencing the generated power in one or more new energy power generation systems provided below can be referred to the limitations on the meteorological feature extraction method for influencing the generated power in the new energy power generation system, and details are not repeated here.
In one embodiment, as shown in fig. 10, there is provided a meteorological feature extraction apparatus for influencing power generation in a new energy power generation system, including: an acquisition module 1002, a first extraction module 1004, a second extraction module 1006, and a third extraction module 1008, wherein:
the obtaining module 1002 is configured to obtain an initial meteorological feature affecting the generated power in the new energy power generation system, and obtain an initial meteorological feature set.
The first extraction module 1004 is configured to filter meteorological features in the initial meteorological feature set by using a random forest algorithm to obtain a first meteorological feature set.
And a second extraction module 1006, configured to perform meteorological feature screening on the first meteorological feature set based on correlation analysis between meteorological features and generated power, to obtain a second meteorological feature set.
And a third extraction module 1008, configured to filter meteorological features in the second meteorological feature set by using a recursive feature elimination method to obtain a target meteorological feature set.
And the influence degree of each average meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is increased step by step.
In one embodiment, the first extraction module 1004 is further configured to obtain power generation sample data of the new energy power generation system; according to the power generation sample data, scoring the meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain feature scores; and according to the feature scores, removing features with the feature scores lower than a preset score threshold value from the initial meteorological feature set to obtain a first meteorological feature set.
In one embodiment, the first extraction module 1004 is further configured to randomly sample power generation sample data to obtain training sample data; obtaining test sample data according to the power generation sample data which is not sampled; constructing a decision tree according to training sample data; and calculating the importance degree of the meteorological features in the initial meteorological feature set according to the decision tree and the test sample data to obtain a feature score.
In one embodiment, the first extraction module 1004 is further configured to calculate a prediction error rate of the decision tree according to the test sample data; randomly adding noise to a single meteorological feature of the test sample data, and calculating the noise prediction error rate of the decision tree; and determining the importance degree of the meteorological features in the meteorological feature set according to the prediction error rate and the noise prediction error rate to obtain a feature score.
In one embodiment, the second extraction module 1006 is further configured to calculate a correlation between each meteorological feature in the first set of meteorological features and generated power; and performing meteorological feature screening on the first meteorological feature set according to the correlation between each meteorological feature and the generated power to obtain a second meteorological feature set.
In one embodiment, the third extraction module 1008 is further configured to perform meteorological feature extraction on the second meteorological feature by using a recursive feature elimination method; and performing cross validation on the extracted meteorological features to obtain a target meteorological feature set.
All modules in the meteorological feature extraction device influencing the generated power in the new energy power generation system can be completely or partially realized through software, hardware and a combination of the software and the hardware. The modules can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the modules.
In one embodiment, a computer device is provided, which may be a terminal, and its internal structure diagram may be as shown in fig. 11. The computer device includes a processor, a memory, a communication interface, a display screen, and an input device connected by a system bus. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device comprises a nonvolatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operating system and the computer program to run on the non-volatile storage medium. The communication interface of the computer device is used for communicating with an external terminal in a wired or wireless manner, and the wireless manner can be realized through WIFI, a mobile cellular network, NFC (near field communication) or other technologies. The computer program is executed by a processor to realize a meteorological feature extraction method for influencing the generated power in the new energy power generation system. The display screen of the computer equipment can be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer equipment can be a touch layer covered on the display screen, a key, a track ball or a touch pad arranged on the shell of the computer equipment, an external keyboard, a touch pad or a mouse and the like.
Those skilled in the art will appreciate that the architecture shown in fig. 11 is merely a block diagram of some of the structures associated with the disclosed aspects and is not intended to limit the computing devices to which the disclosed aspects apply, as particular computing devices may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.
In one embodiment, a computer device is provided, comprising a memory and a processor, the memory having a computer program stored therein, the processor implementing the following steps when executing the computer program:
acquiring initial meteorological features influencing the power generation power in the new energy power generation system to obtain an initial meteorological feature set;
screening meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain a first meteorological feature set;
based on the correlation analysis between the meteorological features and the generated power, carrying out meteorological feature screening on the first meteorological feature set to obtain a second meteorological feature set;
screening meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set;
and the influence degree of each average meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is gradually increased.
In one embodiment, the processor when executing the computer program further performs the steps of: adopting a random forest algorithm, screening meteorological features in the initial meteorological feature set, and obtaining a first meteorological feature set comprises the following steps: acquiring power generation sample data of a new energy power generation system; according to the power generation sample data, scoring the meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain feature scores; and according to the feature scores, removing features with the feature scores lower than a preset score threshold value from the initial meteorological feature set to obtain a first meteorological feature set.
In one embodiment, the processor, when executing the computer program, further performs the steps of: according to the power generation sample data, scoring the meteorological features in the initial meteorological feature set by adopting a random forest algorithm, and obtaining the feature score comprises the following steps: randomly sampling power generation sample data to obtain training sample data; obtaining test sample data according to the power generation sample data which is not sampled; constructing a decision tree according to training sample data; and calculating the importance degree of the meteorological features in the initial meteorological feature set according to the decision tree and the test sample data to obtain a feature score.
In one embodiment, the processor, when executing the computer program, further performs the steps of: calculating the importance degree of the meteorological features in the initial meteorological feature set according to the decision tree and the test sample data, and obtaining the feature score comprises the following steps: calculating the prediction error rate of the decision tree according to the test sample data; randomly adding noise into a single meteorological feature of the test sample data, and calculating the noise prediction error rate of the decision tree; and determining the importance degree of the meteorological features in the meteorological feature set according to the prediction error rate and the noise prediction error rate to obtain a feature score.
In one embodiment, the processor when executing the computer program further performs the steps of: based on the correlation analysis between the meteorological features and the generated power, the meteorological feature screening is carried out on the first meteorological feature set, and the obtaining of the second meteorological feature set comprises the following steps: calculating the correlation between each meteorological feature in the first meteorological feature set and the generated power; and performing meteorological feature screening on the first meteorological feature set according to the correlation between each meteorological feature and the generated power to obtain a second meteorological feature set.
In one embodiment, the processor when executing the computer program further performs the steps of: adopting a recursive characteristic elimination method to screen meteorological characteristics in the second meteorological characteristic set, and obtaining a target meteorological characteristic set comprises the following steps: performing meteorological feature extraction on the second meteorological feature by adopting a recursive feature elimination method; and performing cross validation on the extracted meteorological features to obtain a target meteorological feature set.
In one embodiment, a computer-readable storage medium is provided, having a computer program stored thereon, which when executed by a processor, performs the steps of:
acquiring initial meteorological features influencing the power generation power in the new energy power generation system to obtain an initial meteorological feature set;
screening meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain a first meteorological feature set;
performing meteorological feature screening on the first meteorological feature set based on correlation analysis between meteorological features and generated power to obtain a second meteorological feature set;
screening meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set;
and the influence degree of each average meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is gradually increased.
In one embodiment, the computer program when executed by the processor further performs the steps of: adopting a random forest algorithm, screening meteorological features in the initial meteorological feature set, and obtaining a first meteorological feature set comprises the following steps: acquiring power generation sample data of a new energy power generation system; grading the meteorological features in the initial meteorological feature set by adopting a random forest algorithm according to the power generation sample data to obtain feature grades; and according to the feature scores, removing features with the feature scores lower than a preset score threshold value from the initial meteorological feature set to obtain a first meteorological feature set.
In one embodiment, the computer program when executed by the processor further performs the steps of: according to the power generation sample data, scoring the meteorological features in the initial meteorological feature set by adopting a random forest algorithm, and obtaining the feature score comprises the following steps: randomly sampling power generation sample data to obtain training sample data; obtaining test sample data according to the power generation sample data which is not sampled; constructing a decision tree according to training sample data; and calculating the importance degree of the meteorological features in the initial meteorological feature set according to the decision tree and the test sample data to obtain a feature score.
In one embodiment, the computer program when executed by the processor further performs the steps of: calculating the importance degree of meteorological features in the initial meteorological feature set according to the decision tree and the test sample data, and obtaining the feature score comprises the following steps: calculating the prediction error rate of the decision tree according to the test sample data; randomly adding noise into a single meteorological feature of the test sample data, and calculating the noise prediction error rate of the decision tree; and determining the importance degree of the meteorological features in the meteorological feature set according to the prediction error rate and the noise prediction error rate to obtain a feature score.
In one embodiment, the computer program when executed by the processor further performs the steps of: based on the correlation analysis between the meteorological features and the generated power, the meteorological feature screening is carried out on the first meteorological feature set, and the obtaining of the second meteorological feature set comprises the following steps: calculating the correlation between each meteorological feature in the first meteorological feature set and the generated power; and performing meteorological feature screening on the first meteorological feature set according to the correlation between each meteorological feature and the generated power to obtain a second meteorological feature set.
In one embodiment, the computer program when executed by the processor further performs the steps of: and screening meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set, wherein the method comprises the following steps: adopting a recursive feature elimination method to extract meteorological features of the second meteorological feature; and performing cross validation on the extracted meteorological features to obtain a target meteorological feature set.
In one embodiment, a computer program product is provided, comprising a computer program which when executed by a processor performs the steps of:
acquiring initial meteorological features influencing the power generation power in the new energy power generation system to obtain an initial meteorological feature set;
screening meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain a first meteorological feature set;
based on the correlation analysis between the meteorological features and the generated power, carrying out meteorological feature screening on the first meteorological feature set to obtain a second meteorological feature set;
screening meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set;
and the influence degree of each average meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is increased step by step.
In one embodiment, the computer program when executed by the processor further performs the steps of: adopting a random forest algorithm, screening meteorological features in the initial meteorological feature set, and obtaining a first meteorological feature set comprises the following steps: acquiring power generation sample data of a new energy power generation system; according to the power generation sample data, scoring the meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain feature scores; and according to the feature scores, removing features with the feature scores lower than a preset score threshold value from the initial meteorological feature set to obtain a first meteorological feature set.
In one embodiment, the computer program when executed by the processor further performs the steps of: according to the power generation sample data, scoring the meteorological features in the initial meteorological feature set by adopting a random forest algorithm, and obtaining feature scores comprises the following steps: randomly sampling power generation sample data to obtain training sample data; obtaining test sample data according to the power generation sample data which is not sampled; constructing a decision tree according to training sample data; and calculating the importance degree of the meteorological features in the initial meteorological feature set according to the decision tree and the test sample data to obtain a feature score.
In one embodiment, the computer program when executed by the processor further performs the steps of: calculating the importance degree of the meteorological features in the initial meteorological feature set according to the decision tree and the test sample data, and obtaining the feature score comprises the following steps: calculating the prediction error rate of the decision tree according to the test sample data; randomly adding noise into a single meteorological feature of the test sample data, and calculating the noise prediction error rate of the decision tree; and determining the importance degree of the meteorological features in the meteorological feature set according to the prediction error rate and the noise prediction error rate to obtain a feature score.
In one embodiment, the computer program when executed by the processor further performs the steps of: based on the correlation analysis between the meteorological features and the generated power, the meteorological feature screening is carried out on the first meteorological feature set, and the obtaining of the second meteorological feature set comprises the following steps: calculating the correlation between each meteorological feature in the first meteorological feature set and the generated power; and performing meteorological feature screening on the first meteorological feature set according to the correlation between each meteorological feature and the generated power to obtain a second meteorological feature set.
In one embodiment, the computer program when executed by the processor further performs the steps of: and screening meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set, wherein the method comprises the following steps: adopting a recursive feature elimination method to extract meteorological features of the second meteorological feature; and performing cross validation on the extracted meteorological features to obtain a target meteorological feature set.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, databases, or other media used in the embodiments provided herein can include at least one of non-volatile and volatile memory. The nonvolatile Memory may include Read-Only Memory (ROM), magnetic tape, floppy disk, flash Memory, optical Memory, high-density embedded nonvolatile Memory, resistive Random Access Memory (ReRAM), magnetic Random Access Memory (MRAM), ferroelectric Random Access Memory (FRAM), phase Change Memory (PCM), graphene Memory, and the like. Volatile Memory can include Random Access Memory (RAM), external cache Memory, and the like. By way of illustration and not limitation, RAM can take many forms, such as Static Random Access Memory (SRAM) or Dynamic Random Access Memory (DRAM), among others. The databases involved in the embodiments provided herein may include at least one of relational and non-relational databases. The non-relational database may include, but is not limited to, a block chain based distributed database, and the like. The processors referred to in the various embodiments provided herein may be, without limitation, general purpose processors, central processing units, graphics processors, digital signal processors, programmable logic devices, quantum computing-based data processing logic devices, or the like.
The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the present application. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present application shall be subject to the appended claims.
Claims (10)
1. A meteorological feature extraction method for influencing generated power in a new energy power generation system is characterized by comprising the following steps:
acquiring initial meteorological features influencing the power generation power in the new energy power generation system to obtain an initial meteorological feature set;
screening meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain a first meteorological feature set;
based on correlation analysis between meteorological features and generated power, carrying out meteorological feature screening on the first meteorological feature set to obtain a second meteorological feature set;
screening meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set;
the influence degree of each meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is increased step by step.
2. The method of claim 1, wherein the screening meteorological features in the initial meteorological feature set by using a random forest algorithm to obtain a first meteorological feature set comprises:
acquiring power generation sample data of the new energy power generation system;
according to the power generation sample data, scoring the meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain feature scores;
and according to the feature scores, removing the features of which the feature scores are lower than a preset score threshold value from the initial meteorological feature set to obtain a first meteorological feature set.
3. The method according to claim 2, wherein the scoring the meteorological features in the initial meteorological feature set by using a random forest algorithm according to the power generation sample data to obtain a feature score comprises:
randomly sampling the power generation sample data to obtain training sample data;
obtaining test sample data according to the power generation sample data which is not sampled;
constructing a decision tree according to the training sample data;
and calculating the importance degree of the meteorological features in the initial meteorological feature set according to the decision tree and the test sample data to obtain a feature score.
4. The method of claim 3, wherein calculating the importance of the meteorological features in the initial meteorological feature set according to the decision tree and the test sample data to obtain the feature score comprises:
calculating the prediction error rate of the decision tree according to the test sample data;
randomly adding noise to a single meteorological feature of the test sample data, and calculating the noise prediction error rate of the decision tree;
and determining the importance degree of the meteorological features in the meteorological feature set according to the prediction error rate and the noise prediction error rate to obtain a feature score.
5. The method of claim 1, wherein the meteorological feature screening the first meteorological feature set based on correlation analysis between meteorological features and generated power to obtain a second meteorological feature set comprises:
calculating the correlation between each meteorological feature in the first meteorological feature set and the generated power;
and performing meteorological feature screening on the first meteorological feature set according to the correlation between each meteorological feature and the generated power to obtain a second meteorological feature set.
6. The method of claim 1, wherein the filtering the meteorological features in the second meteorological feature set by using a recursive feature elimination method to obtain a target meteorological feature set comprises:
performing meteorological feature extraction on the second meteorological feature by adopting a recursive feature elimination method;
and performing cross validation on the extracted meteorological features to obtain a target meteorological feature set.
7. A meteorological feature extraction device for influencing power generation in a new energy power generation system, the device comprising:
the acquisition module is used for acquiring initial meteorological features influencing the power generation power in the new energy power generation system to obtain an initial meteorological feature set;
the first extraction module is used for screening meteorological features in the initial meteorological feature set by adopting a random forest algorithm to obtain a first meteorological feature set;
the second extraction module is used for screening meteorological features of the first meteorological feature set based on correlation analysis between the meteorological features and the generated power to obtain a second meteorological feature set;
the third extraction module is used for screening the meteorological features in the second meteorological feature set by adopting a recursive feature elimination method to obtain a target meteorological feature set;
the influence degree of each meteorological feature in the initial meteorological feature set, the first meteorological feature set, the second meteorological feature set and the target meteorological feature set on the generated power is increased step by step.
8. A computer device comprising a memory and a processor, the memory storing a computer program, characterized in that the processor realizes the steps of the method of any one of claims 1 to 6 when executing the computer program.
9. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method of any one of claims 1 to 6.
10. A computer program product comprising a computer program, characterized in that the computer program realizes the steps of the method of any one of claims 1 to 6 when executed by a processor.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210997566.0A CN115329880A (en) | 2022-08-19 | 2022-08-19 | Meteorological feature extraction method and device, computer equipment and storage medium |
CN202310972728.XA CN117113230A (en) | 2022-08-19 | 2023-08-03 | New energy high-precision prediction-oriented machine learning feature selection method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210997566.0A CN115329880A (en) | 2022-08-19 | 2022-08-19 | Meteorological feature extraction method and device, computer equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115329880A true CN115329880A (en) | 2022-11-11 |
Family
ID=83925406
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210997566.0A Withdrawn CN115329880A (en) | 2022-08-19 | 2022-08-19 | Meteorological feature extraction method and device, computer equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115329880A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115759446A (en) * | 2022-11-25 | 2023-03-07 | 南方电网数字电网研究院有限公司 | Machine learning feature selection method for new energy high-precision prediction |
CN116341767A (en) * | 2023-05-29 | 2023-06-27 | 南方电网数字电网研究院有限公司 | New energy power prediction characteristic recursion screening method based on random forest |
-
2022
- 2022-08-19 CN CN202210997566.0A patent/CN115329880A/en not_active Withdrawn
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115759446A (en) * | 2022-11-25 | 2023-03-07 | 南方电网数字电网研究院有限公司 | Machine learning feature selection method for new energy high-precision prediction |
CN116341767A (en) * | 2023-05-29 | 2023-06-27 | 南方电网数字电网研究院有限公司 | New energy power prediction characteristic recursion screening method based on random forest |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN115329880A (en) | Meteorological feature extraction method and device, computer equipment and storage medium | |
CN112382352A (en) | Method for quickly evaluating structural characteristics of metal organic framework material based on machine learning | |
CN108491226B (en) | Spark configuration parameter automatic tuning method based on cluster scaling | |
CN109904878B (en) | Multi-wind-field power generation time sequence simulation scene construction method | |
CN111861781A (en) | Feature optimization method and system in residential electricity consumption behavior clustering | |
CN114266421B (en) | New energy power prediction method based on composite meteorological feature construction and selection | |
CN113468796A (en) | Voltage missing data identification method based on improved random forest algorithm | |
CN113344438A (en) | Loan system, loan monitoring method, loan monitoring apparatus, and loan medium for monitoring loan behavior | |
CN115795329A (en) | Power utilization abnormal behavior analysis method and device based on big data grid | |
CN116739172A (en) | Method and device for ultra-short-term prediction of offshore wind power based on climbing identification | |
CN115759446A (en) | Machine learning feature selection method for new energy high-precision prediction | |
CN116304713A (en) | Wind power plant fault scene prediction model generation method and device and computer equipment | |
CN110610203A (en) | Electric energy quality disturbance classification method based on DWT and extreme learning machine | |
CN111612289B (en) | New energy multi-scene risk feature oriented power system risk assessment method | |
CN117113230A (en) | New energy high-precision prediction-oriented machine learning feature selection method | |
CN107491417A (en) | A kind of document structure tree method under topic model based on particular division | |
CN114876731A (en) | Method, system, equipment and medium for checking wind turbine generator in inefficient operation of wind farm | |
Ma | The Research of Stock Predictive Model based on the Combination of CART and DBSCAN | |
CN114118411A (en) | Training method of image recognition network, image recognition method and device | |
CN114021699A (en) | Gradient-based convolutional neural network pruning method and device | |
CN114139482A (en) | EDA circuit failure analysis method based on depth measurement learning | |
CN113408816A (en) | Power grid disaster situation evaluation method based on deep neural network | |
JP2021124949A (en) | Machine learning model compression system, pruning method, and program | |
CN113792749A (en) | Time series data abnormity detection method, device, equipment and storage medium | |
CN116451006B (en) | PMU data recovery method and system based on enhanced time sequence mode attention |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
WW01 | Invention patent application withdrawn after publication | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20221111 |