CN111832839B

CN111832839B - Energy consumption prediction method based on sufficient incremental learning

Info

Publication number: CN111832839B
Application number: CN202010723405.3A
Authority: CN
Inventors: 刘晶; 于兵; 董瑶; 赵佳; 金玉蓉
Original assignee: Hebei University of Technology
Current assignee: Hebei University of Technology
Priority date: 2020-07-24
Filing date: 2020-07-24
Publication date: 2021-04-30
Anticipated expiration: 2040-07-24
Also published as: CN111832839A

Abstract

The invention discloses an energy consumption prediction method based on sufficient incremental learning, which comprises the following contents: sorting the multidimensional parameters of the energy consumption data, extracting the models of different hubs, and using the models to perform effective feature extraction on the sample by a weighted kernel principal component analysis method; then grouping the data, calculating and judging similarity values in the process of each round of incremental learning, performing multi-round incremental learning according to different degrees of the similarity values, and continuously and dynamically adjusting and judging threshold values, data samples and weighted values; and finally, using an error back propagation neural network to predict energy consumption, and using a Root Mean Square Error (RMSE) value to evaluate a prediction result. The invention verifies the effectiveness of the prediction method, and the finally obtained RMSE value is as low as 0.000112, and the effect is optimal compared with other comparison methods.

Description

Energy consumption prediction method based on sufficient incremental learning

Technical Field

The invention relates to the technical field of energy consumption prediction, in particular to an energy consumption prediction method based on sufficient incremental learning.

Background

Energy consumption and environmental pollution are a serious examination accompanying the sustainable and high-speed development of economy in China, wherein industrial energy consumption accounts for nearly 70% of the total energy consumption of the society in China, so that in the industrial field, the improvement of energy use efficiency and the reduction of energy consumption are key points for accelerating the promotion of green transformation of industry. With the civilization of vehicles, vehicle accessories are also more and more concerned by the public, and in order to meet the requirements of customers on the personalized customization of the vehicle accessories, and the wheel hub as an important component part of the vehicle accessories has decorative appearance, the personalized wheel hub is also more and more popular. However, when the hub is produced, the use of energy consumption cannot be effectively controlled, and the energy consumption data can influence the cost for producing the hub, so that the method has important significance for researching the energy consumption of the hub product.

In the industrial field, the use of energy consumption is getting more attention. In the revolution of industry 4.0, the integration of advanced technologies such as the internet of things, artificial intelligence and industrial internet, and the application of methods such as data acquisition, data management, analysis modeling and the like in the industrial production process provide support for the prediction of energy demand. At present, time series prediction, a neural network, gray prediction, an extreme learning machine, a support vector machine and other methods are mainly adopted for prediction, and good research results are obtained. An article [ Wangzuo and the like, building energy consumption prediction [ J ] of a hybrid model constructed by improving a whale algorithm, computer measurement and control 2020,28(02):197 + 201+205 ] proposes a novel hybrid model for short-term prediction of building energy consumption, processes data through a complementary set empirical mode decomposition method and a feedback neural network optimized by an improved whale optimization algorithm to obtain a final energy consumption prediction value, and has good prediction performance. In the article [ Zhousheason et al, incremental ELM virtual machine energy consumption prediction [ J ] automated chemical newspaper, 2019,45(07):1290-1297 ] based on the compressed momentum term, aiming at the problem that a prediction model based on the traditional incremental extreme learning machine has redundant nodes, the compressed momentum term is added to feed back network training errors to the output of a hidden layer, so that the prediction result is closer to an output sample, the redundant hidden layer nodes are reduced, and the convergence speed is accelerated. An article [ Liujia school and the like, an airline energy consumption sequence prediction [ J ] based on LSTM, computer application and software, 2019,36(10):60-65 ] proposes a long-short time memory network prediction model based on grid search algorithm optimization, and improves the accuracy of energy consumption prediction. An article [ Yangjingjun. regional building electric power energy consumption rapid prediction simulation [ J ] computer simulation, 2019,36(04): 432-. An article [ Chen Jing Jie et al, unsaturated airport energy consumption prediction [ J ] based on a two-step decomposition method and SARIMA, computer application and software, 2019,36(04):46-50+78 ] proposes a combined prediction method based on a two-step decomposition method and a seasonal differential autoregressive moving average model, combines an adaptive noise and variational modal decomposition method, and adopts the seasonal differential autoregressive moving average model to model and predict data, thereby greatly improving the accuracy of energy consumption prediction. An article [ GuoJun, energy consumption prediction of polycrystalline silicon production based on LSTM-Adaboost [ J ] computer application and software, 2018,35(12):71-75+117 ], provides an energy consumption prediction model based on the LSTM-Adaboost recurrent neural network polycrystalline silicon production process, optimizes an LSTM objective function by using a regularization method and introduces an Adaboost algorithm to optimize the LSTM model by combining a PCA algorithm to obtain the prediction model, and realizes effective energy consumption prediction. An article [ Dingfeihong and the like, a short-term building energy consumption prediction model [ J ] based on a genetic optimization decision tree, computer engineering, 2019,45(06):280 plus 289+296 ] proposes a genetic optimization decision tree model, the subtree generation process of the decision tree is promoted by optimizing the gradient through a genetic algorithm, and higher prediction precision is obtained. An article [ high school fund and the like ] LS-SVM subway station air conditioning system energy consumption prediction model [ J ] based on ISOA, computer and modernization, 2018(10):36-43 ] provides a method for optimizing model parameters in an LS-SVM modeling process by using a crowd search algorithm improved from two aspects of algorithm search step length and search direction, and the prediction precision and speed are improved. An article [ Wangkun et al, LSSVM for airport energy consumption prediction [ J ] based on EMD and fruit fly parameter optimization [ computer age, 2017(04):35-40 ] proposes an energy consumption prediction method of a least square support vector machine by combining empirical mode decomposition and fruit fly parameter optimization, and improves the prediction precision.

It can be seen from the above documents that, in the process of performing energy consumption prediction, although methods such as dimension reduction and incremental learning are used, information of the attributes of data itself is not fully utilized, and in the process of incremental learning, only the problem of redundant nodes is solved, but for increasing data, effective data is selected to perform efficient energy consumption prediction, which is a problem that needs to be solved urgently at present. The invention improves the use of the label in the dimension reduction process, and uses the attribute of different types of the hub as the label in the weight, thereby realizing the high-efficiency utilization of the data information; in the incremental learning process, on one hand, the weight of effective data is improved, on the other hand, the weight of inefficient data is reduced, and the data is not fully utilized after one-time screening is finished, so that multi-layer screening incremental learning is performed in a circulating mode, all effective data are finally left, and more accurate energy consumption prediction is finally realized.

Disclosure of Invention

The invention aims to:

the invention provides an energy consumption prediction method based on sufficient incremental learning, which is characterized by combining the types of hubs and using a weighted kernel principal component analysis method to extract effective characteristics of a sample, then calculating and judging similarity values, carrying out multi-round incremental learning according to different degrees of the similarity values, continuously and dynamically adjusting judgment threshold values, and finally using an error back propagation model to carry out energy consumption prediction. Compared with the original method, the method has the advantages that effective information is reserved to the maximum extent through characteristic extraction of data, the important degree relation included among energy consumption data is effectively solved through multiple rounds of incremental learning, dynamic adjustment of the energy consumption data is achieved, the effect of efficiently utilizing data information is achieved, and accurate energy consumption prediction is finally achieved.

The technical scheme adopted by the invention is as follows:

an energy consumption prediction method based on sufficient incremental learning comprises the following steps:

(1) sorting the multidimensional parameters of the energy consumption data, and extracting the models of different hubs;

(2) reducing the dimension of the sorted energy consumption data by adopting a weighted kernel principal component analysis method;

(3) dividing the data subjected to dimensionality reduction into training data and test data, grouping the training data and endowing the grouped training data with an initial weight value;

(4) performing first round learning of incremental learning on the first group of training samples and the weighted values, and calculating to obtain an average similarity value of the first group of training data;

(5) continuously inputting the residual data into an incremental learning model, and respectively comparing the residual data with the first group of training data to judge similarity values to perform different operations;

(6) when the similarity between the samples is greater than the average similarity value, increasing the weight value of the existing samples; when the similarity value between the samples is smaller than the average similarity value, reducing the weight value of the existing samples, uniformly storing the samples together for the incremental learning of the next round, and realizing sufficient incremental learning through multiple rounds of incremental learning;

(7) in the process of the operation, the threshold parameter, the data sample and the weight value of each round of incremental learning are adjusted along with the difference of data;

(8) when the average similarity value of the data used for incremental learning at a time is smaller than 1/2 of the initial average similarity value, the incremental learning is finished, and at the moment, the data with the ownership weight value smaller than the initial weight value 1/2 is deleted;

(9) inputting the obtained data into an error back propagation model for energy consumption prediction;

(10) and evaluating the energy consumption prediction result by adopting an evaluation index RMSE value.

The weighting of the weighted kernel principal component analysis method in the step (2) is calculated by processing labels of samples with different hub models as weights, constructing a weighted kernel matrix by fusing information values of the labels in the kernel matrix, wherein the weights corresponding to data in the weighted kernel matrix are

Wherein u is_iAnd representing the weight vector corresponding to the ith sample.

In the step (5), the Euclidean distance is used for calculating the similarity value of the multi-dimensional data, and the similarity value calculation formula is as follows:

where dist (x, y) represents the Euclidean distance between two different failure features, namely the new feature x and the existing feature y (the existing feature contains the initial feature and the new feature for completing the new operation), that is

Wherein x is_iAnd y_iThe ith value representing the new feature x and the existing featureThe ith value of y, the difference between x and y is smaller, and the Euclidean distance is smaller, the similarity value is larger, so that the similarity between the newly added feature and the existing feature is measured.

The process of sufficient increment learning in the step (6) comprises the following steps:

in the whole process of sufficient incremental learning, firstly giving initial weight values to all data, secondly calculating similarity values among the first group of data, calculating to obtain an average similarity value, then respectively calculating the similarity values of the other newly added data and the first group of data, and increasing the weight values of the existing samples when the similarity values are larger than the existing average similarity value; and when the current average similarity value is smaller than the existing average similarity value, reducing the weight value of the existing sample and storing all data together for carrying out the increment learning of the next round. In the next round of incremental learning, firstly grouping the existing data, secondly calculating the similarity value between the first group of data, calculating to obtain an average similarity value, then respectively calculating the similarity values of the other newly added data and the first group of data, and increasing the weight value of the existing sample when the similarity value is greater than the average similarity value; when the average similarity value is smaller than the average similarity value, the weight value of the existing sample is reduced, all data are stored together for incremental learning in the next round, until 1/2 that the average similarity value of the data for incremental learning is smaller than the initial average similarity value at a certain time, the incremental learning is finished, at this time, the data with the ownership weight value smaller than the initial weight value 1/2 are deleted, and the whole process of sufficient incremental learning is completed.

In order to solve the problem of energy consumption of a hub in the production process, the patent provides an energy consumption prediction method based on sufficient incremental learning, a weighting kernel principal component analysis method is used for extracting effective characteristics of a sample, similarity values are calculated and judged, the weights of an initial data sample and a newly added data sample are dynamically adjusted by using the sufficient incremental learning according to different conditions of the similarity values, and finally an error back propagation model is used for predicting the energy consumption, so that the efficient incremental learning and the accurate energy consumption prediction are realized.

Firstly, the theoretical basis of the method of the invention is as follows:

1. weighted kernel principal component analysis: although the kernel principal component analysis method can process the nonlinear problem, the information in the data label is ignored, so the invention adopts the weighted kernel principal component analysis method (WKPCA) to reduce the dimension of the energy consumption data, and retains effective information as much as possible, thereby improving the efficiency of energy consumption prediction and reducing the error of the energy consumption prediction value.

2. And (3) sufficient increment learning: in the whole process of sufficient incremental learning, firstly giving initial weight values to all data, secondly calculating similarity values among the first group of data, calculating to obtain an average similarity value, then respectively calculating the similarity values of the other newly added data and the first group of data, and increasing the weight values of the existing samples when the similarity values are larger than the existing average similarity value; and when the current average similarity value is smaller than the existing average similarity value, reducing the weight value of the existing sample and storing all data together for carrying out the increment learning of the next round. In the next round of incremental learning, firstly grouping the existing data, secondly calculating the similarity value between the first group of data, calculating to obtain an average similarity value, then respectively calculating the similarity values of the other newly added data and the first group of data, and increasing the weight value of the existing sample when the similarity value is greater than the average similarity value; when the average similarity value is smaller than the average similarity value, the weight value of the existing sample is reduced, all data are stored together for incremental learning in the next round, until 1/2 that the average similarity value of the data for incremental learning is smaller than the initial average similarity value at a certain time, the incremental learning is finished, at this time, the data with the ownership weight value smaller than the initial weight value 1/2 are deleted, and the whole process of sufficient incremental learning is completed.

3. BP neural network (BPNN): and (3) an error back propagation algorithm model is used for realizing accurate prediction of energy consumption through training data.

Secondly, designing an energy consumption prediction method based on sufficient incremental learning:

the energy consumption prediction method based on the sufficient incremental learning has the following working principle: performing feature extraction on the data after the multidimensional parameters are sorted by adopting a weighted kernel principal component analysis method, so that the data becomes more efficient, and meanwhile, the data processing efficiency is improved; performing multiple rounds of incremental learning on the extracted features, continuously learning new knowledge on the basis of keeping original data information, realizing sufficient incremental learning, and continuously and dynamically adjusting and judging a threshold value, a data sample and a weight value in the incremental learning process; and finally, inputting the data obtained by training into an error back propagation algorithm to accurately predict the energy consumption, wherein the operation of the method is shown in figure 1.

1. Energy consumption prediction method structure based on sufficient incremental learning

According to the weighted kernel principal component analysis method, the dimensionality of the data sample is reduced through feature extraction, and the data processing efficiency is further improved. And performing multi-round incremental learning on the obtained data, dynamically adjusting and judging a threshold value, a data sample and a weight value by calculating a similarity value, and inputting the obtained data with the largest information content into an error back propagation algorithm for energy consumption prediction.

2. Specific implementation of the Algorithm

The steps of the energy consumption prediction method based on the sufficient incremental learning are described as follows, and the flow chart is shown in FIG. 2.

(9) inputting the obtained data into a BPNN for energy consumption prediction;

(10) and evaluating the energy consumption prediction result by adopting an evaluation index Root Mean Square Error (RMSE) value.

Adopt the produced beneficial effect of above-mentioned technical scheme to lie in:

the invention provides an energy consumption prediction method based on sufficient incremental learning, which is used for processing three processes of energy consumption data characteristic extraction, incremental learning and energy consumption prediction. The model firstly reserves effective data samples with label information by using a weighted kernel principal component analysis method, and then predicts the energy consumption after performing multiple rounds of incremental learning on the data by calculating the similarity value and according to the judgment condition, thereby realizing efficient incremental learning and accurate energy consumption prediction. Through experimental analysis, a weighted kernel principal component analysis method, sufficient incremental learning and an error back propagation model are combined together, the RMSE value of energy consumption prediction is 0.000112, and meanwhile, through experimental comparison, the method disclosed by the invention is superior to a long-short term memory network (LSTM), a Support Vector Regression (SVR) and an original error back propagation model (BP), and accurate energy consumption prediction can be realized.

Drawings

FIG. 1 is a block diagram of the present invention energy consumption prediction method based on full incremental learning;

FIG. 2 is a flow chart of a method for energy consumption prediction based on full incremental learning of the present invention;

FIG. 3 is a graph of training data set RMSE values and test data set RMSE values for a BP algorithm with an intermediate layer output dimension of 7 in the method of the present invention;

FIG. 4 is a graph of the training data set RMSE values and the test data set RMSE values for the BP algorithm for a middle layer output dimension of 4 in the method of the present invention;

FIG. 5 shows the RMSE values for the training data set and the RMSE values for the test data set for a selected number of iterations of 3000 in the method of the present invention;

Detailed Description

In order to better understand the technical solution of the present invention, the following detailed description is made with reference to the accompanying drawings and examples.

In the embodiment of the invention, an energy consumption prediction method based on sufficient incremental learning comprises the following steps:

(1) taking prediction of power consumption of a single hub product as an example, selecting 20 hub models for experiment, simulating partial data by Python due to less available data amount, obtaining related data from 1 month in 2015 to 12 months in 2018, sorting multi-dimensional parameters of all energy consumption data, and extracting models of different hubs;

although the kernel principal component analysis method can process the nonlinear problem, the information in the data label is ignored, so the invention adopts the weighted kernel principal component analysis method (WKPCA) to reduce the dimension of the energy consumption data, and retains effective information as much as possible, thereby reducing the error of the predicted value. Assume a high-dimensional dataset X ═ X₁,…,x_i,…,x_N]，x_iRepresents the ith sample of the data set X, N represents the number of samples, and c represents the sample class. Let the weight vector of the sample be [ u ]₁,…,u_i,…,u_N]Wherein u is_iDenotes the ith sample x_iAnd (4) corresponding weight vectors. Let the label information value of the sample be { l₁,…,l_j,…,l_cIn which l_jIndicating the jth tag information value. Sample x_iIs determined by class information, i.e. sample x_iWhen the group belongs to the j-th class,

u_i＝l_j

wherein, i is 1,2, …, N, j is 1,2, …, c. Since the Gaussian kernel function is adopted, the kernel function

Wherein, K (x)_i,x_j) The kernel matrix of i rows and j columns is represented, and sigma represents the width parameter of the function, and controls the radial action range of the function. Constructing a weighted kernel matrix according to the weight information of the sample

Wherein, K_w(x_i,x_j) A weighted kernel matrix, w, representing i rows and j columns_ijIs the weight value of the ith row and the jth column in the weight matrix, and

constructing a weight matrix W ═ W_ij}。

Thus obtaining K_w(x_i,x_j) The following were used:

according to the above formula, when u_i＝u_jTime, sample x_iAnd sample x_jBelong to the same model, w_ijWhen the weighted kernel matrix is equal to 1, the original kernel matrix K (x)_i,x_j) (ii) a When u is_i≠u_jTime, sample x_iAnd sample x_jBelong to different models, in this case w_ijIs a positive number greater than 0, and the class information between samples is introduced into the weighted kernel matrix K_w(x_i,x_j) In (1).

K is calculated by using the Jacobi method_wCharacteristic value λ of₁,…,λ_i,…,λ_NAnd a feature vector alpha₁,…,α_i,…,α_NWherein λ is_iDenotes the ith characteristic value, α_iRepresents the ith feature vector, i ═ 1,2, …, N; secondly, the characteristic value lambda is measured_iPerforming descending arrangement and adjusting the corresponding characteristic vector alpha_i(ii) a Then orthogonalizing the eigenvector by Gram-Schmidt orthogonal method unit to obtain an orthogonal eigenvector beta_iWherein beta is_iRepresenting the ith orthogonal eigenvector. Calculating the cumulative contribution rate B of the obtained characteristic value₁,…,B_i,…,B_nIn which B is_iRepresenting the ith cumulative contribution rate, i 1,2, …, n, given a contribution rate value p, if B_tIf the number is more than or equal to p, extracting t main components beta₁,…,β_i,…,β_tFinally, calculating the projection Y of the normalized sample X on the extracted feature vector, wherein Y is equal to K_w(i, j) · β, wherein β ═ β (β) is obtained₁,…,β_i,…,β_t) And the obtained projection Y is data obtained by extracting WKPCA characteristics from the original data.

when calculating the similarity between the new feature and the existing feature, the Euclidean distance is adopted to calculate the similarity Sim (x, y) of the multi-dimensional data

Where dist (x, y) represents the Euclidean distance between the newly added feature x and the existing feature y

Wherein x is_iAnd y_iRepresenting newly added features xThe difference between the ith value and the ith value of the existing feature y is smaller, the Euclidean distance is smaller, the similarity value is larger, and the similarity between the newly added feature and the existing feature is measured.

(6) When the similarity between the samples is larger than the average similarity value, increasing the weight value of the samples; when the similarity value between the samples is smaller than the average similarity value, reducing the weight value of the corresponding samples, uniformly storing the weight values together for the incremental learning of the next round, and realizing sufficient incremental learning through multiple rounds of incremental learning;

calculating the similarity between the newly added feature and the existing feature by using the formula in the step (5), selecting the maximum value Sim (x, y) as the similarity value of the feature, and judging how to process the weight of the feature according to the following principle:

using a_iRepresenting the similarity value between two pairs of bit features in the ith round of increment learning, and the value of the similarity value is a_iAver Sim (x, y), representing the average of the similarity between two features x and y in the existing features:

(1) if a_iIf < Sim (x, y), then the new feature is highly similar to the existing feature, so the weight value of the existing feature is increased.

(2) If Sim (x, y) < a_iIf the similarity between the new feature and the existing feature is lower than the average similarity, the similarity between the new feature and the existing feature is not high enough, so that the data meeting the condition are stored together, the weight value of the corresponding existing feature is correspondingly reduced, and then the next round of incremental learning is performed.

And calculating the dynamic weight of the feature according to the newly added feature increment principle to measure the change degree of the importance of the feature along with the change of time. Because the similarity of the newly added feature and the existing feature can reflect the importance of the newly added feature to the current model to a certain extent, the similarity value is normalized by using the following formula, and the dynamic weight of the similarity value is calculated.

Wherein, Sim (x, y)_normAnd representing the similarity value of the newly added feature x and the existing feature y after normalization processing, wherein minSim (x, y) represents the minimum value of the similarity between the newly added feature x and the existing feature y, and maxSim (x, y) represents the maximum value of the similarity between the newly added feature x and the existing feature y.

For a feature that satisfies the condition in principle (1), since it has a higher similarity value, its weight is increased to enhance and remember the valid feature. The dynamic weight calculation method is as follows.

W_i+1＝W_i+Sim(x,y)_norm

Wherein, W_i+1Weight value, W, representing the i +1 th improvement of the existing feature y_iIndicates the ith increase of the weight value of the existing feature y, and W_iAdding the similarity values of the newly added feature x and the existing feature y to obtain W_i+1When the first combination is carried out, i is 0, W₀Is the initial weight value.

For the features satisfying the conditions in the principle (2), the similarity is low, so that the weight value is reduced, and after the first round of incremental learning is finished, the features are used for the second round of incremental learning.

W_i+1＝W_i-Sim(x,y)_norm

Wherein, W_i+1Weight value, W, representing the i +1 th reduction of the existing feature y_iRepresents the ith decrease of the weight value of the existing feature y, and W_iObtaining W by making a difference with the similarity value of the newly added feature x and the existing feature y_i+1When the weight value of the newly added feature x is reduced for the first time, i is 0, W₀Is the initial weight value.

(9) inputting the obtained data into a BPNN for energy consumption prediction;

The test verification of the energy consumption prediction method based on the sufficient incremental learning comprises the following steps:

1. description of data

The experimental data are derived from the hub characteristic parameters and energy consumption data of a hub manufacturing company, and the hub manufacturing company is provided with intelligent instruments in each production line for obtaining electric energy, water and natural gas consumed in the production process. In the experiment, the power consumption of a single hub is predicted as an example, 20 hub models are selected for carrying out the experiment, and as the available data amount is small, part of data is simulated by Python, and related data from 1 month in 2015 to 12 months in 2018 are obtained. As shown in table 1 below:

TABLE 1 characterization parameter description

2. Procedure of experiment

For energy consumption data of 20 wheel hub models, firstly, parameters of 'single model' are extracted to serve as labels of the data, then, dimension reduction is carried out on the remaining 10 parameters by using a weighted kernel principal component analysis method, and the most important parameters are extracted. In the process of dimension reduction, firstly, all energy consumption data are subjected to standardization processing, and differences caused by different metering units are eliminated; then, inputting the energy consumption data of the combined label into a dimension reduction model, calculating a weighting kernel matrix, a characteristic value and a characteristic vector, and unitizing the characteristic vector; the cumulative contribution rate is set to 0.85, and the feature is finally reduced to two dimensions according to the value of the cumulative contribution rate.

And then carrying out sufficient incremental learning training on the data after dimensionality reduction. First round of incremental learning is performed: calculating the similarity value between the first group of data, taking an average similarity value alpha, then calculating the similarity value between the newly added data and the first group of data, and increasing the weight value of the existing sample when the similarity value is greater than the existing average similarity value; and when the similarity value is smaller than the existing average similarity value, reducing the weight value of the existing sample and storing all the data together for the incremental learning of the next round. In the next round of incremental learning: firstly grouping the existing data, then calculating the similarity value between the first group of data, calculating to obtain an average similarity value alpha, then calculating the similarity values between the other newly added data and the first group of data, and increasing the weight value of the existing sample when the similarity value is greater than the average similarity value; and when the similarity value is smaller than the average similarity value, reducing the weight value of the existing sample and storing all data together for carrying out the next round of incremental learning, wherein the incremental learning is finished until 1/2 that the average similarity value of the data for incremental learning at a certain time is smaller than the initial average similarity value, and at the moment, deleting the data with the ownership weight value smaller than the initial weight value 1/2 to finish the whole process of sufficient incremental learning. Wherein, the average similarity value alpha, all the energy consumption data and the corresponding weight values of each round are continuously and dynamically adjusted. Through training with sufficient incremental learning, the resulting data distribution is shown in table 2:

TABLE 2 data distribution

712 data obtained by sufficient incremental learning in table 2 are then input into an error back propagation model (BP) for energy consumption prediction. Before the energy consumption prediction experiment is carried out, the output dimension and the iteration number of the middle layer need to be selected through the experiment. Firstly, selecting the output dimension of the middle layer, and measuring the energy consumption prediction result by using a Root Mean Square Error (RMSE). All RMSE values and run times are shown in table 3.

TABLE 3 RMSE values and run times

As can be seen from table 3, when the middle layer output dimension is 7, the RMSE value is the smallest, 0.000431. The corresponding training RMSE and test RMSE values are shown in fig. 3. The RMSE values for the comparative middle layer output dimension of 4 are shown in fig. 4.

After the middle layer dimension is determined to be 7, the minimum RMSE value can be obtained, and then the iteration number is selected. The number of iterations was set to 3000 and the results obtained are shown in table 4. It can be seen that the difference in RMSE values over the test sample is very small, by comparing the RMSE values for different iterations with the run time, when the number of iterations is 1800, the RMSE value is minimal, 0.000112, and the run time is 691.28 seconds. Wherein, when the number of iterations is 3000, the RMSE values of the training data and the test data are shown in fig. 5. The final selected mid-level output dimension and the number of iterations are 7 and 1800, respectively.

TABLE 4 RMSE values and runtimes

The data after dimension reduction and sufficient incremental learning are input into the LSTM model and the SVR model for energy consumption prediction, and the obtained RMSE value and the operation time are shown in Table 5.

TABLE 5 RMSE values and run times

As can be seen from Table 5, in the improved method, the RMSE value of BP algorithm was minimal, 0.07856 less than that of LSTM, and 0.240259 less than that of SVR. Although the running time is not the shortest, the training time is greatly reduced compared with the LSTM prediction method, and the energy consumption prediction is performed by using the BP algorithm in combination with the reduction and sufficient incremental learning, so that the effect is the best. Compared with the results of energy consumption prediction by using the original BP, original LSTM and original SVR methods, the result of energy consumption prediction by combining the BP algorithm with dimension reduction and sufficient incremental learning is shown in Table 5, and as shown in Table 5, the RMSE value of the method provided by the invention is far smaller than the RMSE value of the energy consumption prediction by using the original method, wherein the RMSE value of the method used by the invention is 0.018945 smaller than that of the original BP, 2.491674 smaller than that of the original LSTM and 1.047864 smaller than that of the original SVR, so that the method of dimension reduction and sufficient incremental learning used by the invention realizes high-efficiency utilization of data. However, from the view of the operation time, because sufficient incremental learning is performed, the time used for energy consumption prediction is longer compared with the energy consumption prediction performed by the original BP method, but the operation time is moderate as a whole, so the combination of the weighted kernel principal component analysis method, the sufficient incremental learning and the BP algorithm provided by the invention is effective for energy consumption prediction, and the requirement of energy consumption prediction of continuous data updating is met.

3. Conclusion

In order to avoid the waste of energy consumption during the production of the hub, the invention combines the methods of dimension reduction, incremental learning and energy consumption prediction, firstly, the dimension reduction is carried out on other parameters while the model information is fully utilized by a weighted kernel principal component analysis method, then, the incremental learning is carried out for multiple times by calculating the similarity value and judging, finally, the BP algorithm is used for predicting the energy consumption, and the RMSE is used as an evaluation index for evaluation, thereby realizing the efficient incremental learning and the accurate energy consumption prediction. From the analysis of experimental results, the data can be used more efficiently through the full increment learning, and meanwhile, the RMSE value of the energy consumption prediction is 0.000112 when the method is combined with the weighted kernel principal component analysis method and the BP algorithm, and is respectively 0.07856, 0.240259, 0.018945, 2.491674 and 1.047864 smaller than the LSTM algorithm combined with the weighted kernel principal component analysis method and the full increment learning, the SVR algorithm combined with the weighted kernel principal component analysis method and the full increment learning, the original BP algorithm directly used, the LSTM algorithm directly used and the SVR algorithm directly used. The method of the invention enables accurate energy consumption prediction of the hub.

Claims

1. An energy consumption prediction method based on sufficient incremental learning is characterized by comprising the following steps:

in the step (2), the weight of the weighted kernel principal component analysis method is calculated by taking the models of different hubs as labels for weight, a weighted kernel matrix is constructed by fusing the information values of the labels in the kernel matrix, and the weight corresponding to the data in the weighted kernel matrix is

Wherein u is_iRepresenting a weight vector corresponding to the ith sample;

(9) inputting the obtained data into a BP neural network for energy consumption prediction;

(10) and evaluating the energy consumption prediction result by adopting an evaluation index Root Mean Square Error (RMSE).

2. The method of energy consumption prediction based on full incremental learning of claim 1, wherein: in the step (5), the euclidean distance is used to calculate a similarity value of the multidimensional data, and the similarity value calculation formula is as follows:

where dist (x, y) represents the Euclidean distance between two different fault signatures, i.e. the newly added signature x and the existing signature y

Wherein x is_iAnd y_iAnd the ith value represents the new added feature x and the ith value of the existing feature y, the difference between x and y is smaller, and the Euclidean distance is smaller, so that the similarity value is larger, and the similarity between the new added feature and the existing feature is measured.

3. The method of energy consumption prediction based on full incremental learning of claim 1, wherein: the process of sufficient incremental learning in step (6) comprises the following steps:

firstly, giving initial weight values to all data, secondly, calculating similarity values among a first group of data to obtain an average similarity value, then, respectively calculating the similarity values of the other newly added data and the first group of data, and increasing the weight value of the existing sample when the similarity values are larger than the existing average similarity value; when the value is smaller than the existing average similarity value, reducing the weight value of the existing sample and storing all data together for carrying out the increment learning of the next round; in the next round of incremental learning, firstly grouping the existing data, secondly calculating the similarity value between the first group of data, calculating to obtain an average similarity value, then respectively calculating the similarity values of the other newly added data and the first group of data, and increasing the weight value of the existing sample when the similarity value is greater than the average similarity value; when the average similarity value is smaller than the average similarity value, the weight value of the existing sample is reduced, all data are stored together for incremental learning in the next round, until 1/2 that the average similarity value of the data for incremental learning is smaller than the initial average similarity value at a certain time, the incremental learning is finished, at this time, the data with the ownership weight value smaller than the initial weight value 1/2 are deleted, and the whole process of sufficient incremental learning is completed.