CN115860173A - Construction and prediction method and medium of carbon emission prediction model based on Stacking algorithm - Google Patents
Construction and prediction method and medium of carbon emission prediction model based on Stacking algorithm Download PDFInfo
- Publication number
- CN115860173A CN115860173A CN202211294294.4A CN202211294294A CN115860173A CN 115860173 A CN115860173 A CN 115860173A CN 202211294294 A CN202211294294 A CN 202211294294A CN 115860173 A CN115860173 A CN 115860173A
- Authority
- CN
- China
- Prior art keywords
- carbon emission
- model
- algorithm
- prediction
- prediction model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 229910052799 carbon Inorganic materials 0.000 title claims abstract description 146
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 title claims abstract description 145
- 238000004422 calculation algorithm Methods 0.000 title claims abstract description 79
- 238000000034 method Methods 0.000 title claims abstract description 47
- 238000010276 construction Methods 0.000 title claims description 14
- 238000012549 training Methods 0.000 claims abstract description 28
- 238000012360 testing method Methods 0.000 claims abstract description 17
- 238000009826 distribution Methods 0.000 claims abstract description 9
- 238000007781 pre-processing Methods 0.000 claims abstract description 7
- 238000012545 processing Methods 0.000 claims description 10
- 230000008569 process Effects 0.000 claims description 9
- 238000005457 optimization Methods 0.000 claims description 8
- 238000002790 cross-validation Methods 0.000 claims description 7
- 238000003066 decision tree Methods 0.000 claims description 6
- 238000010606 normalization Methods 0.000 claims description 6
- 238000003860 storage Methods 0.000 claims description 6
- 230000002159 abnormal effect Effects 0.000 claims description 5
- 238000004590 computer program Methods 0.000 claims description 4
- 238000012856 packing Methods 0.000 claims description 3
- 238000013210 evaluation model Methods 0.000 claims description 2
- 230000010354 integration Effects 0.000 abstract description 3
- 238000005265 energy consumption Methods 0.000 description 12
- 230000009467 reduction Effects 0.000 description 8
- 230000000694 effects Effects 0.000 description 6
- 238000010304 firing Methods 0.000 description 5
- 239000000919 ceramic Substances 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 2
- 241000283153 Cetacea Species 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 238000004134 energy conservation Methods 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- VNWKTOKETHGBQD-UHFFFAOYSA-N methane Chemical compound C VNWKTOKETHGBQD-UHFFFAOYSA-N 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 208000025174 PANDAS Diseases 0.000 description 1
- 208000021155 Paediatric autoimmune neuropsychiatric disorders associated with streptococcal infection Diseases 0.000 description 1
- 240000000220 Panda oleosa Species 0.000 description 1
- 235000016496 Panda oleosa Nutrition 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 229910002092 carbon dioxide Inorganic materials 0.000 description 1
- 239000001569 carbon dioxide Substances 0.000 description 1
- 239000003245 coal Substances 0.000 description 1
- 230000019771 cognition Effects 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 239000002283 diesel fuel Substances 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000000691 measurement method Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000003345 natural gas Substances 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 238000010792 warming Methods 0.000 description 1
Images
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention relates to a method and a medium for constructing and predicting a carbon emission prediction model based on a Stacking algorithm, which comprises the following steps: acquiring power data and corresponding carbon emission data samples to form a data set; preprocessing a data set, and dividing the data set into a training set and a test set; analyzing the characteristics influencing carbon emission by using an XGboost algorithm to obtain target characteristics; constructing a carbon emission prediction model, wherein the carbon emission prediction model comprises a meta model and a plurality of base models; fusing the meta model and the plurality of base models by using a Stacking algorithm, and training the models based on the training set and the target characteristics; and adjusting the weight distribution of the prediction result input into the meta-model by each basic model according to the error proportion of the carbon emission prediction result output by each basic model, so as to obtain the optimized carbon emission prediction model. According to the invention, a plurality of algorithm models with the largest difference are selected as the base models of the Stacking integration model, and weight distribution is carried out according to the predicted error ratio, so that the advantages of different algorithms are fully utilized, and the prediction accuracy is improved.
Description
Technical Field
The invention belongs to the field of artificial intelligence application, particularly relates to the technical field of carbon emission prediction, and particularly relates to a construction and prediction method and medium of a carbon emission prediction model based on a Stacking algorithm.
Background
The problem of climate warming caused by carbon emission is a common challenge facing countries in the world, and the main targets of constructing a green low-carbon sustainable development system, controlling the total energy consumption, improving the energy utilization efficiency, improving the renewable energy consumption ratio and the like are provided. Therefore, energy consumption characteristic research of key energy utilization industries needs to be developed, carbon emission conditions and evolution trends of key industries and enterprises are mastered, relevant work such as carbon emission reduction potential mining and carbon emission reduction service is developed in a targeted manner, and support is provided for implementing energy conservation and carbon reduction and improving resource utilization efficiency of key industries and enterprises. Currently, the following problems are mainly faced in this part of the work:
1) The energy consumption and carbon emission data of enterprises are difficult to master accurately and comprehensively in real time. At present, only a few key energy-using enterprises are provided with energy consumption online monitoring platforms, but the operation and maintenance cost is high, the carbon emission management function is lacked, and the carbon emission of the enterprises cannot be predicted and early warned; in addition, a large number of emission control enterprises with scales are not built with energy consumption monitoring systems. For government regulatory departments and enterprises, it is difficult to accurately and comprehensively master energy consumption data and carbon emission data of the enterprises in real time, and relevant data sources are mainly based on annual carbon check reports, statistical yearbooks and the like.
2) Key energy-using enterprises lack the carbon asset management function and cannot fully exploit the potential of carbon emission reduction. At present, the cognition and knowledge storage of key energy-consuming enterprises on the aspects of carbon emission accounting, carbon asset management, carbon market trading and the like are still relatively lacked, the carbon emission reduction potential of the enterprises cannot be effectively exploited, and the emission reduction requirement cannot be met most economically and effectively.
Therefore, the construction of a scientific carbon emission model can predict the future carbon emission situation, and has important significance for energy conservation and emission reduction work and sustainable development in China.
With the development of artificial intelligence, people can construct an algorithm prediction model by analyzing factors influencing carbon emission, and the accuracy of model prediction is improved. For example, 20319Xin et al predicted the carbon emission in China from 2012 to 2020 using the gray model GM (1, 1), the emission reduction pressure in China is also very large according to the prediction result. The sanfotam employs an improved BP neural network model,the method is characterized in that the prediction research of China carbon emission peak value is performed under 8 development modes, and research shows that China can achieve the 2030-year carbon emission peak reaching target under the economic decline mode, the energy-saving mode and other modes. The carbon emission amount and the carbon emission intensity of the whale in 2019-2040 years are changed by the Wanke and the like by utilizing a whale optimization algorithm, and the future carbon emission trend of China is accurately reflected. The method is characterized in that the building of a Logistic model with increased carbon emission by Duqiang and the like verifies that the Logistic model is applied to CO emission in CO through comparison with original data on carbon dioxide emission in different provinces from 2011 to 2020 2 Accuracy and high reliability in emissions prediction problems. However, the existing common carbon emission method also has certain problems, and although the method can predict the carbon emission, a single model obviously lacks of comprehensive research on multiple models and multiple mechanisms.
In recent years, a carbon emission combined prediction method is also popular in China, for example, zhang Feng et al combines a traditional grey prediction model, a system cloud grey prediction model and a Verhulst model, the model combines the advantages of a single model, and the combined grey prediction model is used for predicting CO in industries such as construction industry between 2013 and 2017 in Shandong province 2 And (4) the emission amount shows that the prediction precision of the combined model is higher than that of a single model. Gena et al proposed a Prophet-LSTM-based time series fusion prediction model in 2019, and finally verified that model fusion has higher accuracy in time series prediction through comparison experiments. The prediction effect of the combined prediction methods is better than that of a single model, but the optimization space is still left in the accuracy of the prediction effect.
Disclosure of Invention
The invention aims to provide a construction and prediction method of a carbon emission prediction model based on a Stacking algorithm.
The scheme for solving the technical problems is as follows: a construction method of a carbon emission prediction model based on a packing algorithm comprises the following steps:
s1, acquiring electric power data and a carbon emission data sample corresponding to the electric power data to form a data set;
s2, preprocessing the data set, and dividing the preprocessed data set into a training set and a test set according to a preset proportion; the training set and test set preset ratio may be 2;
s3, analyzing the characteristics influencing carbon emission by using an XGboost algorithm, selecting the characteristics, and removing redundant characteristics to obtain target characteristics;
s4, constructing a carbon emission prediction model, wherein the constructed carbon emission prediction model comprises a meta-model and a plurality of base models;
s5, fusing the meta-model and the plurality of base models by using a Stacking algorithm, training the constructed carbon emission prediction model based on a training set and target characteristics, and performing parameter optimization on the meta-model and the hyper-parameters of each base model in a grid search mode;
and S6, based on the test set, adjusting the weight distribution of the prediction results input into the meta-model by each base model according to the error proportion of the carbon emission prediction results output by each base model, and obtaining the trained carbon emission prediction model.
Preferably, the above method step further comprises S7:
and S7, evaluating the accuracy of the carbon emission prediction model, inputting the test set for prediction, comparing corresponding carbon emission data samples, and analyzing the prediction result.
Preferably, in S2, the processing of the data set includes:
removing abnormal values in the data set and completing missing values in the data set; and carrying out normalization processing on the data set after the missing value is completed.
Because the power data is obtained through the correlation model, the obtained power data may have missing values and abnormal values in special or artificial situations. Therefore, when carbon emission prediction is carried out, abnormal values are removed from the power data
And completing the missing values. In the invention, two statistical information of the mean value and the highest frequency number are mainly used for filling missing values.
If the value of a feature of one dimension is several orders of magnitude greater than that of other dimensions, it will dominate the objective function of the machine learning model, making it unable to learn features of other dimensions. And the normalization processing can scale each dimension feature in the data set into an array with the mean value of 0, so that each dimension feature is scaled to the same order of magnitude, and the deviation is avoided when the machine learning model is trained. If the normalization processing is not carried out, the phenomenon that the features with larger orders of magnitude are biased occurs in the process of training the model.
The formula for the normalization process is as follows:
where x is the initial data, x' is the processed data, x min Denotes the minimum value, x, in the initial data max Representing the maximum value in the initial data.
Preferably, the S3 includes the following steps:
s31, inputting the characteristics influencing carbon emission into an XGboost algorithm to obtain the gain condition of a decision tree in the training process;
and S32, grading the input features according to the gain condition of the lifting decision tree by the XGboost algorithm according to the input features, and selecting the features with high grades as target features.
The XGboost algorithm can divide input features according to the gain condition of a tree in the training process, remove some redundant features with lower importance and enable prediction to be more accurate and efficient.
Preferably, the base model performs carbon emission prediction by selecting a plurality of algorithm models and inputting power data, and then compares prediction result errors of the algorithm models according to a difference metric method, and selects three algorithm models with the largest difference to determine; the meta model includes XGBoost.
Preferably, the base model is used for predicting carbon emission by inputting power data into GBDT, RF, SVM, KNN and LSTM algorithm models, and then the prediction result errors of the algorithm models are compared according to a difference metric method, and the three algorithm models with the largest difference are selected for determination.
Because the data observation angles of different algorithms are different from the principle structure of the algorithms, the advantages of the algorithms can be maximized when the algorithms with large difference are selected for prediction, and the prediction accuracy can be greatly improved.
Preferably, the S6 includes the steps of:
s61, inputting the prediction result of the base model into a meta model;
s62, calculating the error of the output result of the base model through cross validation, and performing weight distribution on the prediction result input into the meta-model by the base model according to the error proportion to obtain the constructed carbon emission prediction model.
The weighted output can increase the influence degree of the prediction value with high precision on the prediction result, and reduce the influence degree of the prediction value with low precision on the prediction result. The method not only improves the accuracy of the prediction model, but also improves the stability of the prediction model.
Preferably, in S7, the average absolute percentage error is used as an accuracy standard of the evaluation model.
The construction device of the carbon emission prediction model based on the Stacking algorithm comprises the following components:
the first acquisition module is used for acquiring the power data and carbon emission data samples corresponding to the power data to form a data set;
the preprocessing module is used for preprocessing the data set and dividing the preprocessed data set into a training set and a test set according to a preset proportion;
the characteristic selection module is used for analyzing characteristics influencing carbon emission through an XGboost algorithm, performing characteristic selection and removing redundant characteristics to obtain target characteristics;
the carbon emission prediction model construction module is used for constructing a carbon emission prediction model, and the constructed carbon emission prediction model comprises a meta model and a plurality of base models;
the carbon emission prediction model training module is used for fusing the meta-model and the plurality of base models through a Stacking algorithm, training the constructed carbon emission prediction model based on a training set and target characteristics, and performing parameter optimization on the meta-model and the hyper-parameters of each base model in a grid search mode;
and the carbon emission prediction model optimization module is used for adjusting the weight distribution of the prediction results input into the meta-model by each base model according to the error proportion of the carbon emission prediction results output by each base model based on the test set to obtain the trained carbon emission prediction model.
A method of carbon emission prediction, the method comprising:
acquiring power data to be predicted;
inputting the power data to be predicted into a carbon emission prediction model trained in advance, and predicting carbon emission based on the carbon emission prediction model to obtain a carbon emission prediction result;
the carbon emission prediction model is constructed by the construction method of the carbon emission prediction model based on the Stacking algorithm.
A carbon emissions prediction apparatus, the apparatus comprising:
the second acquisition module is used for acquiring the power data to be predicted;
and the prediction module is used for inputting the power data to be predicted into a carbon emission prediction model trained in advance, and predicting carbon emission based on the carbon emission prediction model to obtain a carbon emission prediction result.
Another object of the present invention is to provide a computer-readable storage medium, which stores a computer program, which when executed by a processor, implements the method for constructing a carbon emission prediction model based on a Stacking algorithm as described above.
An electronic device that can read the computer-readable storage medium as described above.
The invention has the beneficial effects that:
1. the XGboost algorithm is used for analyzing the importance of the features during feature engineering, and redundant features are effectively eliminated.
2. When the base model of the Stacking model is selected, the difference of each base model is evaluated by using a difference measurement method. On one hand, when the Stacking model uses a base model with strong learning capacity, a better prediction effect can be obtained. On the other hand, the algorithm with larger difference can fully utilize the advantages of different algorithms, so that each algorithm can make up for the deficiencies of each other. In contrast to selecting the base model randomly or based on manual experience, the invention can select the base model with larger difference to obtain better prediction effect.
The foregoing description is only an overview of the technical solutions of the present invention, and in order to make the technical solutions of the present invention more clearly understood and to implement them in accordance with the contents of the description, the following detailed description is given with reference to the preferred embodiments of the present invention and the accompanying drawings. The detailed description of the present invention is given in detail by the following examples and the accompanying drawings.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the invention and do not constitute a limitation of the invention. In the drawings:
fig. 1 is a flowchart of a method for constructing a carbon emission prediction model based on a Stacking algorithm in embodiment 1 of the present invention;
fig. 2 is a flowchart of selecting a base model of the Stacking algorithm in embodiment 1 of the present invention.
Detailed Description
The principles and features of this invention are described below in conjunction with the following drawings, which are set forth by way of illustration only and are not intended to limit the scope of the invention.
The working principle of the invention is as follows: the invention mainly utilizes an improved Stacking integration algorithm, wherein the Stacking integration algorithm takes the prediction result of a base model as the input of a meta-model, and the output of the meta-model is integrated as the final prediction result through a combination strategy.
The Stacking algorithm generally has better prediction performance than a single model, but has certain defects, the original Stacking algorithm cannot guarantee that a proper base model can be selected, and an overfitting phenomenon is easy to occur. Meta-models may achieve worse experimental results than the base model if the model selection is not appropriate, or due to factors such as data type. The invention makes some improvements to the Stacking algorithm: compared with the base model selected according to manual experience in the original Stacking algorithm, the difference of the base models is evaluated by using a difference metric method, the base models with larger difference are combined together to improve the prediction precision, and the base models are weighted and distributed to the meta-model according to the result of the cross-validation error to reduce the prediction error. Although the training time is increased to a certain extent, the generalization performance of the model is improved. The improved Stacking algorithm overcomes the defects of the original algorithm and obtains better prediction effect on carbon emission.
Example 1
As shown in fig. 1, a ceramic industry is selected as a test point, and a carbon emission prediction model is established according to 25 obtained carbon emission data samples of ceramic enterprises in Fujian province and Hubei province, wherein the data are provided by enterprises, and the specific steps are as follows:
s1, obtaining electric power data of a ceramic enterprise and a carbon emission data sample corresponding to the electric power data to form a data set.
S2, processing the data set, and specifically comprising the following steps:
s21, screening and processing abnormal values in the data set by using a pandas tool;
s22, filling missing values in a data set by using a single variable feature filling method, mainly filling the missing values by using statistical information of numbers with the highest mean value and frequency, firstly calculating the occurrence frequency of each value in the features, filling the missing values of the features by using the number if the value with the higher occurrence frequency exists, and otherwise, filling the missing values by using the mean value;
s23, after filling up missing values, carrying out normalization processing on the data set, wherein the processing formula is as follows:
where x is the initial data, x' is the processed data, x min Denotes the minimum value, x, in the initial data max Representing the maximum value in the initial data.
And S24, dividing the processed data set of the power data into an 80% training set and a 20% testing set according to a proportion.
S3, analyzing the characteristics influencing carbon emission by using an XGboost algorithm, and selecting the characteristics, wherein the method specifically comprises the following steps;
s31, inputting the characteristics influencing carbon emission into an XGboost algorithm to obtain the gain condition of a decision tree in the training process;
and S32, grading the input features according to the gain condition of the lifting decision tree by the XGboost algorithm according to the input features, and selecting the features with high grades as target features.
The ceramic energy consumption sources are mainly electric power, coal, natural gas and diesel oil, wherein the energy consumption of the firing process is the largest, and accounts for more than 60% of the comprehensive energy consumption of a product unit, and the electric energy consumption of the raw material production process and the kiln firing process accounts for more than 85% of the total power consumption of an enterprise, and after the characteristics are selected, the factors influencing the largest energy consumption of the embodiment are as follows: the type of kiln used for firing, the firing temperature, the firing time, the type of fuel and the quality are also important factors.
S4, constructing a carbon emission prediction model, and specifically comprising the following steps:
s41, inputting power data in the GBDT, RF, SVM, KNN and LSTM algorithm models to predict carbon emission;
s42, calculating the prediction error of each model by adopting a Pearson coefficient so as to analyze the difference degree of different base models, wherein the specific selection process is shown in figure 2, and the difference degree analysis of each algorithm model is shown in Table 1. As can be seen from table 1, the correlation coefficients of SVM, KNN, and RF are low, and therefore the degree of model difference is also the greatest, and RF, KNN, and SVM are selected as the base models of the Stacking integrated model, and then XGBoost is selected as the meta model.
TABLE 1
GBDT | RF | SVM | KNN | LSTM | |
GBDT | 1 | 0.923 | 0.664 | 0.597 | 0.688 |
RF | 0.923 | 1 | 0.512 | 0.557 | 0.587 |
SVM | 0.664 | 0.512 | 1 | 0.537 | 0.684 |
KNN | 0.597 | 0.557 | 0.537 | 1 | 0.712 |
LSTM | 0.688 | 0.587 | 0.684 | 0.712 | 1 |
S5, fusing the base models and the meta-models by using a Stacking algorithm, screening effective data of a training set according to target characteristics, and training the integrated model on the screened training set; and optimizing the hyper-parameter sets of each base model and meta-model by using grid search, wherein the optimal parameters of each algorithm are shown in table 2.
TABLE 2
S6, optimizing the model, specifically comprising the following steps:
s61, inputting the prediction result of the base model into the meta model;
s62, calculating the error of the output result of the base model through cross validation, and performing weight distribution on the prediction result input into the meta model by the base model according to the error proportion, wherein the formula is as follows:
let a test set of a certain base model of the first layer model have a result of (y) 1 ,y 2 ,…,y n ) The calculation error is (x) 1 ,x 2 ,…,x n ) Then its weight calculation formula is
The output result is (e) 1 y 1 ,e 2 y 2 ,…,e n y n ) The cross validation errors for each algorithm are shown in table 3.
According to the cross-validation performance of each model, an SVM weight of 0.5, an RF weight of 0.3 and a KNN weight of 0.2 are set.
TABLE 3
Error | |
RF | 0.1269 |
SVM | 0.0824 |
KNN | 0.1916 |
And S7, evaluating the accuracy of the model, inputting the test set for prediction, comparing the carbon emission data samples, and selecting a Mean Absolute Percentage Error (MAPE) as a model evaluation index. MAPE is defined as follows:
wherein x (i) and y (i) represent actual and predicted values of carbon emissions, respectively, and n represents the number of sample points;
through the verification of the test set, the average result of MAPE adopting 5-fold cross validation of each model is shown in Table 4, the model is 1.59%, and the carbon emission prediction achieves a good effect.
TABLE 4
MAPE/% | |
RF | 2.187 |
SVM | 1.972 |
KNN | 2.784 |
Fusion model | 1.59 |
Example 2
A method of carbon emission prediction, the method comprising:
acquiring power data to be predicted;
inputting the power data to be predicted into a carbon emission prediction model trained in advance, and predicting carbon emission based on the carbon emission prediction model to obtain a carbon emission prediction result;
the carbon emission prediction model is constructed by the construction method of the carbon emission prediction model based on the Stacking algorithm in the embodiment 1.
Example 3
An apparatus for constructing a carbon emission prediction model based on a Stacking algorithm, the apparatus comprising:
the first acquisition module is used for acquiring the electric power data and carbon emission data samples corresponding to the electric power data to form a data set;
the preprocessing module is used for preprocessing the data set and dividing the preprocessed data set into a training set and a test set according to a preset proportion;
the characteristic selection module is used for analyzing characteristics influencing carbon emission through an XGboost algorithm, performing characteristic selection and removing redundant characteristics to obtain target characteristics;
the carbon emission prediction model construction module is used for constructing a carbon emission prediction model, and the constructed carbon emission prediction model comprises a meta model and a plurality of base models;
the carbon emission prediction model training module is used for fusing the meta-model and the plurality of base models through a Stacking algorithm, training the constructed carbon emission prediction model based on a training set and target characteristics, and performing parameter optimization on the meta-model and the hyper-parameters of each base model in a grid search mode;
and the carbon emission prediction model optimization module is used for adjusting the weight distribution of the prediction results input into the meta-model by each base model according to the error proportion of the carbon emission prediction results output by each base model based on the test set to obtain the trained carbon emission prediction model.
Example 4
A carbon emissions prediction apparatus, the apparatus comprising:
the second acquisition module is used for acquiring the power data to be predicted;
and the prediction module is used for inputting the power data to be predicted into a carbon emission prediction model trained in advance, and predicting carbon emission based on the carbon emission prediction model to obtain a carbon emission prediction result.
Example 5
A computer-readable storage medium storing a computer program which, when executed by a processor, implements a method of constructing a carbon emission prediction model based on a packing algorithm as in embodiment 1.
Example 6
An electronic device that can read a computer-readable storage medium as in embodiment 5.
The foregoing is merely a preferred embodiment of the invention and is not intended to limit the invention in any manner; the present invention may be readily implemented by those of ordinary skill in the art as illustrated in the accompanying drawings and described above; however, those skilled in the art should appreciate that they can readily use the disclosed conception and specific embodiments as a basis for designing or modifying other structures for carrying out the same purposes of the present invention without departing from the scope of the invention as defined by the appended claims; meanwhile, any changes, modifications, and evolutions of the equivalent changes of the above embodiments according to the actual techniques of the present invention are still within the protection scope of the technical solution of the present invention.
Claims (10)
1. A construction method of a carbon emission prediction model based on a Stacking algorithm is characterized by comprising the following steps:
s1, acquiring electric power data and a carbon emission data sample corresponding to the electric power data to form a data set;
s2, preprocessing the data set, and dividing the preprocessed data set into a training set and a test set according to a preset proportion;
s3, analyzing the characteristics influencing carbon emission by using an XGboost algorithm, selecting the characteristics, and removing redundant characteristics to obtain target characteristics;
s4, constructing a carbon emission prediction model, wherein the constructed carbon emission prediction model comprises a meta-model and a plurality of base models;
s5, fusing the meta-model and the plurality of base models by using a Stacking algorithm, training the constructed carbon emission prediction model based on a training set and target characteristics, and performing parameter optimization on the meta-model and the hyper-parameters of each base model in a grid search mode;
and S6, based on the test set, adjusting the weight distribution of the prediction results input into the meta-model by each base model according to the error proportion of the carbon emission prediction results output by each base model, and obtaining the trained carbon emission prediction model.
2. The method for constructing the carbon emission prediction model based on the Stacking algorithm according to claim 1, further comprising S7:
and S7, evaluating the accuracy of the carbon emission prediction model, inputting the test set for prediction, comparing corresponding carbon emission data samples, and analyzing the prediction result.
3. The method for constructing the carbon emission prediction model based on the Stacking algorithm according to claim 1, wherein in the step S2, the processing of the data set comprises:
removing abnormal values in the data set and completing missing values in the data set; and carrying out normalization processing on the data set after the missing value is completed.
4. The method for constructing a carbon emission prediction model based on the Stacking algorithm according to claim 1,
the S3 comprises the following steps:
s31, inputting the characteristics influencing carbon emission into an XGboost algorithm to obtain the gain condition of a decision tree in the training process;
and S32, grading the input features according to the gain condition of the lifting decision tree by the XGboost algorithm according to the input features, and selecting the features with high grades as target features.
5. The method for constructing the carbon emission prediction model based on the Stacking algorithm according to claim 1, wherein the base model is used for predicting carbon emission by selecting a plurality of algorithm model input power data, comparing prediction result errors of the algorithm models according to a difference metric method, and selecting three algorithm models with the largest difference for determination; the meta model includes XGBoost.
6. The method for constructing the carbon emission prediction model based on the Stacking algorithm as claimed in claim 5, wherein the base model is determined by inputting power data into GBDT, RF, SVM, KNN and LSTM algorithm models to predict carbon emission, comparing prediction result errors of the algorithm models according to a difference metric method, and selecting three algorithm models with the largest difference.
7. The method for constructing the carbon emission prediction model based on the Stacking algorithm according to claim 1, wherein the S6 comprises the following steps:
s61, inputting the prediction result of the base model into the meta model;
s62, calculating the error of the output result of the base model through cross validation, and performing weight distribution on the prediction result input into the meta-model by the base model according to the error ratio to obtain the constructed carbon emission prediction model.
8. The method for constructing the carbon emission prediction model based on the Stacking algorithm as claimed in claim 2, wherein in the step S7, the average absolute percentage error is adopted as an accuracy standard of the evaluation model.
9. A method of predicting carbon emissions, the method comprising:
acquiring power data to be predicted;
inputting the power data to be predicted into a carbon emission prediction model trained in advance, and predicting carbon emission based on the carbon emission prediction model to obtain a carbon emission prediction result;
the carbon emission prediction model is constructed by the construction method of the carbon emission prediction model based on the packing algorithm according to any one of claims 1 to 8.
10. A computer-readable storage medium, in which a computer program is stored, which computer program, when being executed by a processor, carries out a method of constructing a Stacking algorithm based carbon emissions prediction model according to any of the claims 1-8.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211294294.4A CN115860173A (en) | 2022-10-21 | 2022-10-21 | Construction and prediction method and medium of carbon emission prediction model based on Stacking algorithm |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211294294.4A CN115860173A (en) | 2022-10-21 | 2022-10-21 | Construction and prediction method and medium of carbon emission prediction model based on Stacking algorithm |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115860173A true CN115860173A (en) | 2023-03-28 |
Family
ID=85661710
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211294294.4A Pending CN115860173A (en) | 2022-10-21 | 2022-10-21 | Construction and prediction method and medium of carbon emission prediction model based on Stacking algorithm |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115860173A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116151692A (en) * | 2023-04-17 | 2023-05-23 | 南京赛宝工业技术研究院有限公司 | Carbon emission evaluation system and method for life cycle of multi-source solid waste regenerated building masonry |
CN116739867A (en) * | 2023-06-27 | 2023-09-12 | 南方电网能源发展研究院有限责任公司 | Method and device for measuring carbon emission of electric power system and computer equipment |
CN117574329A (en) * | 2024-01-15 | 2024-02-20 | 南京信息工程大学 | Nitrogen dioxide refined space distribution method based on ensemble learning |
CN117575175A (en) * | 2024-01-15 | 2024-02-20 | 国网浙江省电力有限公司 | Carbon emission evaluation method, device, electronic equipment and storage medium |
-
2022
- 2022-10-21 CN CN202211294294.4A patent/CN115860173A/en active Pending
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116151692A (en) * | 2023-04-17 | 2023-05-23 | 南京赛宝工业技术研究院有限公司 | Carbon emission evaluation system and method for life cycle of multi-source solid waste regenerated building masonry |
CN116739867A (en) * | 2023-06-27 | 2023-09-12 | 南方电网能源发展研究院有限责任公司 | Method and device for measuring carbon emission of electric power system and computer equipment |
CN117574329A (en) * | 2024-01-15 | 2024-02-20 | 南京信息工程大学 | Nitrogen dioxide refined space distribution method based on ensemble learning |
CN117575175A (en) * | 2024-01-15 | 2024-02-20 | 国网浙江省电力有限公司 | Carbon emission evaluation method, device, electronic equipment and storage medium |
CN117575175B (en) * | 2024-01-15 | 2024-03-29 | 国网浙江省电力有限公司 | Carbon emission evaluation method, device, electronic equipment and storage medium |
CN117574329B (en) * | 2024-01-15 | 2024-04-30 | 南京信息工程大学 | Nitrogen dioxide refined space distribution method based on ensemble learning |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN115860173A (en) | Construction and prediction method and medium of carbon emission prediction model based on Stacking algorithm | |
CN110544011B (en) | Intelligent system combat effectiveness evaluation and optimization method | |
CN109919489B (en) | Enterprise asset management system and GA-BP-based enterprise equipment life prediction method | |
CN112364560B (en) | Intelligent prediction method for working hours of mine rock drilling equipment | |
CN113094988A (en) | Data-driven slurry circulating pump operation optimization method and system | |
CN105404926A (en) | Aluminum electrolytic production technology optimization method based on BP neural network and MBFO algorithm | |
CN109559154A (en) | Electricity needs total amount Study on Relative Factors system and method based on Johnson-Copula model | |
CN107918368A (en) | The dynamic prediction method and equipment of iron and steel enterprise's coal gas yield and consumption | |
CN112101761A (en) | Comprehensive evaluation method for construction effect of clean, low-carbon, safe and efficient energy system | |
CN115471098A (en) | Carbon emission acquisition method and device, computer equipment and storage medium | |
CN117709524A (en) | Bayesian optimization-based carbon emission prediction method and system for steel industry | |
CN114004525A (en) | Electric energy substitution benefit evaluation method facing user side based on fuzzy comprehensive evaluation method | |
CN113780686A (en) | Distributed power supply-oriented virtual power plant operation scheme optimization method | |
CN113762591A (en) | Short-term electric quantity prediction method and system based on GRU and multi-core SVM counterstudy | |
CN117575564A (en) | Extensible infrastructure network component maintenance and transformation decision evaluation method and system | |
CN117277312A (en) | Gray correlation analysis-based power load influence factor method and equipment | |
CN105426959A (en) | Aluminium electrolysis energy conservation and emission reduction method based on BP neural network and self-adaptive MBFO algorithm | |
CN111353707A (en) | Scientific and technological input performance evaluation method based on data envelope analysis and BP neural network | |
CN115879638B (en) | Carbon emission prediction method for oilfield transfer station system | |
CN116108963A (en) | Electric power carbon emission prediction method and equipment based on integrated learning module | |
CN115511230A (en) | Electric energy substitution potential analysis and prediction method | |
CN115660038A (en) | Multi-stage integrated short-term load prediction based on error factors and improved MOEA/D-SAS | |
CN115034618A (en) | Community comprehensive energy system benefit evaluation method based on fuzzy evaluation | |
CN112785022B (en) | Method and system for excavating electric energy substitution potential | |
Zhou et al. | Analyzing and Predicting CO 2 Emissions in China Based on the LMDI and GA-SVM Model. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |