CN110070145A

CN110070145A - LSTM wheel hub single-item energy consumption prediction based on increment cluster

Info

Publication number: CN110070145A
Application number: CN201910364392.2A
Authority: CN
Inventors: 陈珊珊; 马东方; 路海伦; 焦正杉
Original assignee: Tianjin Development Zone Jingnuo Ocean Data Technology Co Ltd
Current assignee: Tianjin Development Zone Jingnuo Ocean Data Technology Co Ltd
Priority date: 2019-04-30
Filing date: 2019-04-30
Publication date: 2019-07-30
Anticipated expiration: 2039-04-30
Also published as: CN110070145B

Abstract

The invention discloses the LSTM wheel hub single-item energy consumption clustered based on increment predictions, are related to wheel hub single-item energy consumption electric powder prediction.The present invention realizes the clustering to hubless feature parameter using the dynamic increment Density Clustering based on PCA, obtain the affiliated historical product classification of newly-increased product, and single-item energy consumption strong solution is filtered out using Pearson coefficient and Adaptive-Lasso algorithm based on energy consumption Factor system and releases sexual factor, the prediction of new product explanatory variable by force is realized using BP, constructing LSTM energy consumption prediction model for the product that respectively clusters realizes being effectively predicted for new product unit consumption, the optimization to LSTM is realized using ADE, while the dynamic for introducing incremental learning strategy implementation model updates.The present invention demonstrates the validity of prediction technique, make the root-mean-square error RMSE of energy consumption prediction down to 0.016524,0.013653 is averagely reduced relative to other algorithms without incremental learning, the search performance of its ADE is more excellent simultaneously, the RMSE ratio of training set has the average reduction 0.004089 of incremental learning DE-LSTM, and effectively shortens runing time.

Description

LSTM wheel hub single-item energy consumption prediction based on increment cluster

Technical field

The present invention relates to wheel hub single-item energy consumption electric powder prediction more particularly to a kind of LSTM wheel hubs based on increment cluster Single-item energy consumption prediction technique.

Background technique

State's inner wheel hub industry is driven to realize under the thriving demand in vehicle market fast-developing, at present China existing more than 300 Wheel hub manufacturer, family, automotive hub yield keep double-digit growth speed, it is contemplated that domestic automobile after market wheel hub in 2022 Demand is at 6,040,000 or so, it is seen that wheel products have entered " made in China epoch ".Therefore for response market demand and Meet the personalized customization requirement of customer, the production model that wheel hub manufacturing enterprise gradually develops as more category small lots.But working as needs When producing the new product of no knowhow, due to the accumulation of no historical energy consumption data, the unpredictable new product energy consumption of conventional model, And energy consumption directly affects the single-item cost of wheel hub, therefore the energy consumption for studying wheel hub new product is of great significance.

At present in terms of establishing energy consumption prediction model, because of the depth integration of intelligent production and technology of Internet of things, industry is raw Data acquisition facilitates quick during production, and various intelligent predicting technologies have been used for energy demand management, not with Accurate Prediction The energy demand come is mainly taken using time series forecasting, gray prediction, support vector machines, neural network prediction etc. at present Obtained preferable research achievement.Article [Air Transport Enterprises Under energy consumption prediction model (English) of the such as Geng Hong based on small echo-ARIMA [J] lathe and hydraulic, 2018,46 (06): 13-17+42.] propose that a kind of ARIMA decomposed based on two multi-scale wavelets predicts mould Type；Article [Su Wei equalization papermaking enterprise technical process energy consumption predictive simulation [J] Computer Simulation, 2016,33 (08): 438-442+447.] by being improved to the maximum likelihood parameter estimation method in ARIMA model, it proposes based on Bayes's ARIMA model carries out energy consumption prediction.[such as Li Lixin are based in energy consumption prediction [J] of Gray Markov Model for article State's scientific and technological information, 2018 (15): 74-75.] it is directed to the prediction of AND ENERGY RESOURCES CONSUMPTION IN CHINA total amount, propose the base in grey forecasting model The viewpoint of Markov chain is introduced on plinth；Article [boat of the such as Liu Jiaxue based on metabolic grey Markov-arma modeling Empty company's energy consumption prediction (English) [J] lathe and hydraulic, 2017,45 (18): 55-62.] by the prediction result of GM (1,1) by horse Er Kefu amendment, and with metabolism method deleting madel in no longer effective property legacy data, by time slip-window with Arma modeling is to residual error corrections；Article [application of the .GM-WLSSVM such as Du Ruizhi model in the prediction of office building electric power energy consumption [J] computer application and software, 2018,35 (09): 44-49+55.] utilize gray model to choose different samples to same a period of time Prediction result, is combined followed by Weighted Least Squares Support Vector Machines model, realizes by the prediction of Duan Jinhang polymorphic type The short-term forecast of office building electric energy；[such as Wang Kun are predicted article based on the airport energy consumption of the EMD and LSSVM of drosophila parameter optimization [J] computer age, 2017 (04): 35-40.] propose the minimum two of a kind of combination empirical mode decomposition and drosophila parameter optimization Multiply the energy consumption prediction technique of support vector machines；Article [papermaking enterprise energy consumption prediction model of the such as summer prestige an ancient unit of weight based on PSO-LSSVR Research [J] computer measurement and control, 2013,21 (12): 3433-3435+3438.] propose it is a kind of based on particle group optimizing Least square support vector regression energy consumption prediction model；[it is pre- that Guo Xiao waits the short-term energy optimization of the Civil Aviation Airport electricity consumption quietly to article Survey emulation [J] Computer Simulation, 2018,35 (09): 31-36.] propose improved grey color depth conviction net combined prediction mould Type improves the short-term energy consumption model precision of prediction of Civil Aviation Airport electricity consumption；[such as Chen Zhoulin improve PSO-BP neural network forecast mould to article Application [J/OL] the light science and technology of type in the prediction of papermaking energy consumption, 2018 (11): 91-94 [2019-01-08.], which is established, to be based on Improve the BP neural network energy consumption prediction model of particle swarm algorithm optimization；[such as Chen Yanxi are built article based on the green offfice of ANN Build HVAC system operation energy consumption prediction [J] building energy conservation, 2017,45 (10): 1-5.] establish classification multilayer perceptron nerve Network Prediction Model realizes the energy consumption prediction of heating ventilation air-conditioning system in office building；[the one kind such as Zhang Yuhang is based on article Short-term electro-load forecast method [J] the power information and the communication technology of LSTM neural network, 2017,15 (09): 19-25.] Using Power system load data itself as training data and output label, the electricity based on LSTM is established by the method for repetitive exercise Power load forecasting model.

It can be seen that from above-mentioned document currently without the forecasting problem for researching and analysing single-item energy consumption, and hybrid prediction model More it is capable of the overall performance of improved model.Due to the relationship between energy consumption and its influence factor be it is nonlinear, using nerve net Network more can be realized accurate prediction, but the time modeling ability of traditional feedforward neural network is fairly limited, in output In the case that predicted value depends on the long history of input feature vector sequence, LSTM is able to solve the problem of learning long-rang dependence.And such as The strong explanatory influence factor what obtains energy consumption is the difficult point of research and the basis of building prediction model, and is pushed away along with the time It moves, data are increasing, these above-mentioned prediction techniques are conventional batch predictions, be not suitable for the constantly newly-increased scene of data volume, How incremental learning and current urgent problem carried out on original learning foundation.

Summary of the invention

The purpose of the present invention is to provide a kind of LSTM wheel hub single-item energy consumption prediction techniques based on increment cluster.It is intended to adopt With PCA to the feature vectors dimensional down for determining history wheel hub production model after, obtained and similar with new product gone through using clustering algorithm History single-item classification；Then, energy consumption Factor system is constructed based on order data, creation data, crucial consumable component parameter, utilized Pearson coefficient and Adaptive-Lasso obtain with the strongly connected strong explanatory variable of single-item energy consumption, and use BP nerve Sexual factor value is released in the strong solution that neural network forecast goes out new product；The single-item energy for the wheel hub that respectively clusters finally is constructed based on the above method Prediction model is consumed, proposes that a kind of LSTM incremental update wheel hub unit consumption prediction model based on ADE, the model use ADE algorithm pair The initial parameter of LSTM model scans for, and when there is newly-increased sample, is updated to prediction model.The method achieve right The incrementally updating of new wheel hub single-item energy consumption being effectively predicted with model, not only increases precision of prediction, while reducing data Memory space and model calculate the time.

The technical solution used in the present invention is a kind of LSTM wheel hub single-item energy consumption prediction technique based on increment cluster, It is characterized in that including the following steps:

(1) collect the characteristic parameter sample of different model wheel hub, including 12 characteristic parameters: wheel diameter, wheel rim width, Centre bore away from, bolt hole count, pitch diameter, offset distance, weight, spoke number, spoke front pose, center disk moulding, wheel hub Material, manufacturing process；

(2) order data, creation data, the crucial consumable component parameter, single-item energy consumption data composition of different model wheel hub are collected Energy consumption sample；Wherein order data, creation data, crucial consumable component parameter constitute energy consumption Factor system, order data packet Single-item model, single-item order volume, category sum are included, when creation data includes production time, raw material input amount, equipment operation Between, production efficiency, rejection rate, wherein production efficiency by machine add efficiency (part/hour) and the aspect of finishing efficiency (minute/part) two Lai Consider, crucial consumable component parameter includes drill bit usage amount, cutter usage amount；

(3) after the character shape parameter of history hubless feature parameter sample in step (1) being carried out labeling processing, using master Parameter sample is down to two dimension and obtains data set P by constituent analysis；

(4) cluster operation then is carried out to P using dynamic increment density clustering algorithm, obtains the original result C that clusters_i(i= 0,1 ..., k) it ∈ P and peels off and collects O ∈ P；

(5) when there is newly-increased feature parameter sample, to its using step (3) pretreatment after Δ P, search for O ∪ Δ P in Original clusters the reachable data object of density, updates cluster result, and output clusters C '_i(i=0,1 ..., k ')=C_i∪ΔC_i, peel off Collect O ' and newly-increased product generic；

(6) sexual factor is released according to the strong solution of energy consumption Factor system analysis single-item energy consumption in step (2), utilized Pearson coefficient rejects the weak related and uncorrelated factor in energy consumption Factor system, followed by Adaptive- Lasso algorithm carries out second of variables choice and obtains the strong explanatory variable of wheel hub single-item energy consumption；

(7) strong explanatory to being obtained by step (3) labeling treated history hubless feature parameter, step (6) Variable and single-item power consumption values are standardized；

It (8) is input with the history hubless feature parameter after step (7) standardization, to be marked by step (7) Treated that strong explanatory variable is output for standardization, constructs the BP prediction model of new wheel hub explanatory variable by force, it is new to be able to prediction The strong explanatory variate-value of wheel hub；

(9) the single-item energy consumption prediction model that respectively clusters is constructed according to the Historic Clustering result of step (4), by the history energy that respectively clusters It consumes sample standard deviation and training set and test set is randomly divided into the ratio of 3:1, training set is divided into four groups, wherein one group of carry out step (9), remaining set is successively used as a sample increment collection；

(10) the LSTM single-item energy consumption prediction model that respectively clusters is constructed, by training set after step (7) standardization Strong explanatory variable is input, and the single-item energy consumption after step (7) standardization is output, is calculated in model construction using ADE Method realizes the optimization to LSTM parameter；

(11) it in the LSTM single-item energy consumption prediction model that respectively clusters, is carried out using sample increment the set pair analysis model in step (9) Incremental update；

(12) according to the test set test respectively to cluster in step (9) by step (11) updated prediction mould respectively to cluster Type, and model evaluation is carried out, it is handled simultaneously for the characteristic parameter sample of product to be predicted by step (3)-(5), Yi Jijing The strong explanatory variable that step (7)-(8) BP prediction model obtains new wheel hub is crossed, it is updated each followed by step (11) The LSTM single-item energy consumption prediction model that clusters is predicted, the predicted value of newly-increased product energy consumption is exported.

A further technical solution lies in cluster the characteristic parameter sample set P after dimensionality reduction in the step (4) Process includes the following steps:

1) for the raw data set P after dimensionality reduction, determine the value of coefR, calculate density adjusting parameter σ and density up to away from From R, and calculate the density value Density (P of each data object in P_i), obtain Density (P in P_i) maximum local density Attractor Attractor:

In formula,Indicate point P_iTo point P_jEuclidean distance,The average value of distance between each point in sample, CoefR (0 < coefR < 1) is the original regulation coefficient of density reach distance, and n is total sample number；

2) data object in raw data set P is scanned, by density-attractors Attractor and the reachable data pair of its density As being assigned to first cluster C₀, and concentrated from initial data and delete the object that clusters；

3) for remaining data collection, another density-attractors Attractor is searched_i, calculate density self-adapting up to away from From R_adap,i, the density-attractors and the reachable data object of its density are assigned to another C that clusters_i, and from raw data set The middle deletion object that clusters, and so on, finally the cluster with little data object is put into exceptional value or noise group, at this time Obtain the original result C that clusters_i(i=0,1 ..., k) ∈ P, peels off and collects O ∈ P:

R_adap,i=α R

α is regulation coefficient in formula, and formula is as follows:

A further technical solution lies in increment clustering methods in the step (5) are as follows:

1) for pretreated incremental data set Δ P, Δ P, which is peeled off with original, to be collected O and merges, and is searched for each with original in O ∪ Δ P Cluster C_iThe reachable data object of density updates cluster result C_i(i=0,1 ..., k) ∈ (P ∪ Δ P), at this time remaining data collection For

2) to remaining data collectionData object carry out clustering, be likely to be present inIn cluster Δ C_i(i=k+1 ..., k ', k ' >=k), update cluster result as C '_i(i=0,1 ..., k ') =C_i(i=0,1 ..., k) ∪ Δ C_i, and peel off and collect O '；

If 3)It then detects whether containing the data object in increment sample Δ P in O ', if joining containing new product Numerical example, then the distance for comparing the new sample point to the middle each point that respectively clusters will in order to realize the energy consumption prediction of the new product It is first tagged to apart from cluster where its closest approach；

4) after for the pretreatment of other incremental data sets, the existing label that clusters collected in O ' that peels off is deleted, step is repeated It is rapid 1) to 3)；

5) cluster result C ' is exported_i(i=0,1 ..., k ') and peel off set O ' and new product generic.

A further technical solution lies in single-item energy consumption prediction model construction methods in the step (10) are as follows:

1) dimension of ADE individual is equal to the weight of LSTM neural network and the summation of threshold number, initializes evolution number G =0, population scale N, crossover probability f, mutation probability cr, and using root-mean-square error RMSE as fitness function:

GenM is maximum number of iterations in formula, and G is current iteration number, and α, β are the constant in [0.5,1] range, y_tFor True value, k are the number of data object；

2) fitness value, that is, RMSE of each individual in population is calculated；

3) if minimum RMSE is met the requirements in current population or current iteration number G=GenM, ADE termination changes In generation, obtains optimized individual and executes step 5), otherwise continues to execute step 4)；

4) new group is obtained according to adaptive intersection, TSP question and selection operation, G=G+1 is set, return executes step It is rapid 3)；

5) optimize the initial connection weight and threshold value for obtaining optimum individual as LSTM based on ADE；And with training sample pair LSTM implements training, and then obtains optimum network.

A further technical solution lies in increase sample newly to the update method of LSTM prediction model in the step (11) Are as follows:

1) for each newly-increased energy consumption sample, first which cluster c judgement belongs to_i, corresponding i-th of the LSTM model of i-th of cluster；It determines After affiliated model, model parameter is updated on the basis of model history data training, i.e., by the parameter after historical data training Network is initialized, the strong explanatory variable of new samples is input in prediction model, is obtained newly by the forward calculation of LSTM The predicted value of sampleBy predicted valueIt is added to the error of actual value y on original error function J (θ)；

In formula, parameter θ=(W_f,W_i,W_c,W_o,b_f,b_i,b_c,b_o)

2) model parameter value is updated according to error function J (θ) backpropagation of update:

θ '=(W_f-λ*Δw_f,…,W_o-λ*Δw_o,b_f-λ*Δb_f,…,b_o-λ*Δb_o)

In formula, λ is learning rate, Δ w_f、Δw_oWith Δ b_f、Δb_oIt is the weight of neuron and the gradient square of offset respectively Battle array and vector.

The beneficial effects of adopting the technical scheme are that

LSTM wheel hub single-item energy consumption prediction technique proposed by the present invention based on increment cluster, using the dynamic based on PCA Increment Density Clustering realizes the clustering to hubless feature parameter, thus obtain the affiliated historical product classification of newly-increased product, And single-item energy consumption strong solution is filtered out using Pearson coefficient and Adaptive-Lasso algorithm based on energy consumption Factor system and is released Sexual factor, and then realize using BP the prediction of new product explanatory variable by force, it is finally the product building LSTM energy consumption prediction that respectively clusters Being effectively predicted for new product unit consumption of model realization, using ADE realizes the optimization to LSTM, while it is real to introduce incremental learning strategy The dynamic of existing model updates.By analysis of experiments, the LSTM wheel hub single-item energy consumption prediction based on increment cluster of proposition is demonstrated The validity of method, the root-mean-square error RMSE for predicting energy consumption is down to 0.016524, relative to other calculations without incremental learning Method averagely reduces 0.013653, while the search performance of its ADE is more excellent, and the RMSE ratio of training set has incremental learning DE-LSTM Average reduction 0.004089, and effectively shorten runing time.

Detailed description of the invention

The present invention will be further described in detail below with reference to the accompanying drawings and specific embodiments.

Fig. 1 is that the present invention is based on the LSTM wheel hub single-item energy consumption prediction technique structure charts of increment cluster；

Fig. 2 is the dynamic increment Density Clustering flow chart the present invention is based on PCA；

Fig. 3 is that the present invention is based on the LSTM incremental update wheel hub unit consumption of ADE to predict flow chart；

Fig. 4 is cluster analysis result figure of the present invention；

Fig. 5, Fig. 6 are respectively the RMSE variation of the RMSE variation diagram and hidden layer number of nodes and X9 of hidden layer number of nodes and X8 Figure；

Fig. 7 is the RMSE variation diagram of the LSTM number of plies with the prediction model that respectively clusters；

Fig. 8 is the RMSE variation diagram of LSTM node layer number with the prediction model that respectively clusters；

Fig. 9 is cluster C₀Energy consumption incremental data in the training precision comparison diagram of five kinds of algorithms；

Figure 10 is cluster C₁Energy consumption incremental data in the training precision comparison diagram of five kinds of algorithms；

Figure 11 is cluster C₄Energy consumption incremental data in the training precision comparison diagram of five kinds of algorithms；

Specific embodiment

The present invention proposes a kind of dynamic increment density clustering algorithm based on PCA, i.e., based on symbol wheel hub production model Characteristic parameter data obtain history single-item classification similar with new product using clustering algorithm；Then, Pearson coefficient is utilized Sexual factor is released in strong solution with Adaptive-Lasso algorithm analysis single-item energy consumption, and predicts new product using BP neural network Factor value is released in strong solution；Finally, proposing a kind of LSTM incremental update wheel hub unit consumption prediction model based on ADE, which utilizes ADE Algorithm reduces influence of the initiation parameter to model accuracy, and introduces incremental learning strategy, and the dynamic of implementation model updates.

One, the theoretical foundation of the method for the present invention

1, principal component analysis (Principal Component Analysis, PCA): by linear transformation by initial data It is transformed to the expression of one group of each dimension linear independence (unit is orthogonal), can be used for extracting the main feature component of data, protect simultaneously Changeability as much as possible is stayed, the dimensionality reduction of high dimensional data is usually used in.

2, the dynamic increment cluster based on density: it is density-attractors that it is maximum, which to define data dot density, never complete every time It is concentrated at the remaining data of cluster and finds local density's attractor and the reachable sample point of its density, to gather for one kind, and be It is suitable for dynamic incremental data library, it is adaptive to adjust density reach distance.

3, Pearson correlation coefficients (Pearson coefficient): for the degree of correlation reflected between two variables, in engineering It can be used to calculate the similarity between feature and classification in habit, that is, can determine whether that extracted feature and classification are to be positively correlated, is negative Correlation is again without degree of correlation.

4, Adaptive-Lasso algorithm: being the improvement to Lasso algorithm, by construct a penalty function obtain one compared with For the model of refining, so that it compresses some coefficients, concurrently setting some coefficients is zero, and then is realized when variable is numerous Significant variable is fast and effeciently extracted, thus simplified model.

5, shot and long term Memory Neural Networks (Long-Short Term Memory, LSTM): it is the deformation to RNN, that is, exists On the basis of common RNN, by increasing memory unit in each neural unit of hidden layer, to make the recall info in time series It controllably, can by several controllable gates (forgeing door, input gate, candidate door, out gate) when being transmitted between hidden layer each unit every time With the memory of information and current information before control and degree is forgotten, so that RNN network be made to have long-term memory function.

6, adaptive differential evolution algorithm: (Adaptive differential evolution algorithm, ADE): By adaptively adjusting zoom factor and crossover probability in calculating process, the improvement to standard difference evolution algorithm is realized.

Two, based on the LSTM wheel hub single-item energy consumption prediction technique design of increment cluster:

LSTM wheel hub single-item energy consumption prediction technique working principle based on increment cluster is as follows: real to newly-increased product when needing When applying the prediction of single-item energy consumption, after being down to two dimension to the characteristic parameter for determining history wheel hub production model using PCA first, using dynamic State increment density clustering algorithm obtains history single-item classification similar with new product；Then, based on order data, creation data, Crucial consumable component parameter constructs energy consumption Factor system, is obtained and list using Pearson coefficient and Adaptive-Lasso algorithm The strongly connected strong explanatory variable of product energy consumption, and using hubless feature parameter as input, strong explanatory variable as output, building The BP prediction model of new product explanatory variable by force；Finally, constructing the single-item energy consumption for the wheel hub that respectively clusters based on the above method Prediction model weakens influence of the initiation parameter to model using ADE algorithm, it is contemplated that the processing problem of newly-increased energy consumption sample, The strategy for introducing incremental learning, by judging the determination prediction model to be updated that clusters belonging to new samples, by determined model Initial parameter of the history parameters as the model, and the error between new samples predicted value and actual value is added to whole mistake In difference, model parameter is updated come iteration using the method that minimizes the error, guarantees that model can constantly handle the newly-increased of addition with this Sample.

1, the LSTM wheel hub single-item energy consumption prediction technique structure based on increment cluster

The present invention show that history similar with newly-increased production mode produces by the dynamic increment Density Clustering based on PCA Category is other, and is LSTM incremental update wheel hub unit consumption prediction model of the building based on ADE that respectively cluster based on the result that clusters, and realizes single The prediction of product energy consumption.As shown in Figure 1, after it is down to two dimension by PCA, passing through when there is newly-increased product feature parameter sample Dynamic increment density clustering algorithm updates cluster result and obtains its affiliated historical product classification, at the same by Pearson coefficient and Adaptive-Lasso algorithm realizes the screening to energy consumption explanatory variable by force, and predicts to obtain new volume increase by BP neural network Sexual factor is released in the strong solution of product, and then is input to the LSTM model of corresponding A/D E optimization, to obtain the single-item energy consumption of newly-increased product Value, and when there is newly-increased energy consumption sample, according to the error update model error function between its predicted value and true value, adjust mould Shape parameter, the incremental update of implementation model.

2, algorithm implements

LSTM wheel hub single-item energy consumption prediction technique based on increment cluster includes the following steps:

(1) the characteristic parameter sample of different model wheel hub, including 12 characteristic parameters: wheel diameter (p1), wheel rim are collected Width (p2), centre bore are away from (p3), bolt hole count (p4), pitch diameter (p5), offset distance (p6), weight (p7), spoke number (p8), spoke front pose (p9), center disk moulding (p10), hub material (p11), manufacturing process (p12)；

(2) order data, creation data, the crucial consumable component parameter, single-item energy consumption data composition of different model wheel hub are collected Energy consumption sample；Wherein order data, creation data, crucial consumable component parameter constitute energy consumption Factor system, order data packet It is total (X3) to include single-item model (X1), single-item order volume (X2), category, creation data includes production time (X4), raw material throwing Enter amount (X5), operation hours (X6), production efficiency (machine adds efficiency X7, finishing efficiency X8), rejection rate (X9), crucial consumable component Parameter includes drill bit usage amount (X10), cutter usage amount (X11)；

In embodiment of the present invention, the characteristic parameter sample set P after dimensionality reduction clustered in the step (4) Journey includes the following steps:

R_adap,i=α R

α is regulation coefficient in formula, and formula is as follows:

In embodiment of the present invention, increment clustering method in the step (5) are as follows:

In embodiment of the present invention, single-item energy consumption prediction model construction method in the step (9) are as follows:

Sample is increased in embodiment of the present invention, in the step (11) newly to the update method of LSTM prediction model are as follows:

In formula, parameter θ=(W_f,W_i,W_c,W_o,b_f,b_i,b_c,b_o)

θ '=(W_f-λ*Δw_f,…,W_o-λ*Δw_o,b_f-λ*Δb_f,…,b_o-λ*Δb_o)

Three, data describe

The data of experimental data source Mr. Yu's wheel hub Manufacturing Co., Ltd, the said firm have been respectively mounted Intelligent Instrument in production line Device, for obtaining consumed electric energy, water and natural gas in production process.This is tested for predicting single-item power consumption, Since amount of available data is less, partial data is simulated with Python, has obtained in January, 2015 at present into December, 2018 The related data of 725 wheel-type, in this experiment, wherein 475 wheel-type are used as historical product sample for random selection, remaining 250 wheel-type are considered as newly-increased product for verifying the prediction result of model.

By the characteristic parameter sample of 475 selected wheel-type as raw data set P, the ginseng of remaining 250 wheel-type The random equivalent of numerical example is divided into five groups, for being incremental data set Δ P₁、ΔP₂、ΔP₃、ΔP₄、ΔP₅Increment cluster is carried out, Specific features parameter pattern representation is as shown in table 1 below.

The description of 1 characteristic parameter data of table

On the basis of cluster result, the LSTM incremental update wheel hub unit consumption prediction model based on ADE is constructed.725 wheels Order data, manufacturing parameter, crucial consumable component parameter and the energy consumption data of type form energy consumption sample, and wherein energy consumption data is single-item Power consumption amounts to 34,800 samples, wherein 475 historical products include 22,800 samples.Historical product is respectively clustered institute Containing sample standard deviation according to 3:1 ratio random division be training sample and test sample, construct the energy consumption prediction model of each cluster. In the training process, the training set equivalent respectively to cluster is divided into four groups, wherein one group for constructing prediction model, remaining three Group is added to existing model and carries out incremental learning.Specific energy consumption data and its Correlative Influence Factors are as shown in table 2 below.

2 energy consumption sample of table

1, model parameter and structure

(1) clustering

For feature spoke front pose (p9), center disk moulding (p10), hub material (p11), manufacturing process (p12) it is character type, needs its standard makeup being changed to numeric type data.

It uses the model split of random sampling at historical product sample in sample, increase product sample newly, wherein historical product 475, remaining is that newly-increased product is added to Clustering Model progress incremental update in five times, and coefR value is 0.6, this time Experiment has carried out 20 random samplings, wheel hub finally can be divided into 5 classes, cluster result is as shown in Figure 4.

One such experimental result is selected to be illustrated: by the parameter sample after randomly select 475 wheel-type dimensionality reductions Clustering is carried out, sample has finally been divided into 6 class C_-1、C₀、C₁、C₂、C₃、C₄, there is 13,261,53,27,32,89 samples respectively Point, wherein C_-1It is the cluster comprising noise spot, by C_-1Collect O as peeling off.Remaining 250 hub parameter samples point 5 times to cluster As a result incremental update is carried out, searching for can be with the C that clusters in increment sample₀、C₁、C₂、C₃、C₄The reachable data object of density, finally Cluster C₀、C₁、C₄Separately include 303,78,185 samples, peel off collect O include 9 sample points, C₂、C₃No new samples are added.

(2) energy consumption Analysis of key influential factors

1) correlation analysis

Tentatively judge whether there is linear dependence, meter between single-item power consumption and each influence factor using Pearson coefficient It calculates shown in result table 3.

3 Person coefficient table of table

X1	X2	X3	X4	X5	X6	X7	X8	X9	X10	X11
											-0.015	0.804	0.637	0.003	0.004	0.612	-0.896	0.748	0.530	0.629	0.616

It follows that X1, X4, X5 and single-item energy consumption are almost uncorrelated, X7 and single-item energy consumption are in strong negatively correlated, surplus variable It is rendered as being positively correlated.Therefore by being somebody's turn to do as a result, these three weak correlative factors of X1, X4, X5 can be removed.

2) Adaptive-Lasso variables choice

Then second of Variable Selection is carried out using Adaptive-Lasso algorithm to surplus variable, as a result shown in table 4:

4 Adaptive-Lasso coefficient table of table

X2	X3	X6	X7	X8	X9	X10	X11
								-0.1705	0.0776	0.0000	0.0000	0.4209	0.3929	0.0000	0.0000

It is found that the coefficient of X6, X7, X11, X10 are 0, then when constructing model, these variables are removed, surplus variable X2, X3, X8, X9 are then used as the key influence factor of single-item power consumption.

3) sexual factor prediction is released in the strong solution of new product

By above-mentioned analysis, show that X2, X3, X7, X9 are that sexual factor is released in the strong solution of energy consumption, X2 is order volume, and X3 is same The different product model number that production line produces within the same production cycle, therefore can be according to reality for X2, X3 for increasing product newly Situation is manually set, and X8, X9 need to obtain using BP neural network prediction, becomes 12 characteristic parameters of wheel hub as input Amount, X8, X9 are to export.

The mode that experimentation is all made of k-fold cross validation is trained model, and initial data is divided into k group (K- Fold), each subset data is made into one-time authentication collection respectively, remaining k-1 group subset data obtains k mould as training set Type.This k model is respectively in verifying central evaluation as a result, the smallest model of final choice error.K is taken as 4 in an experiment, i.e., will Data are divided into 4 parts, and k-1=3 parts of data therein is selected to train, and remaining a for verifying, network is 4 times trained in total, often A data have the opportunity to as checksum set.Cycle-index trained every time is gradually increased to 300 during the experiment, i.e., Epoch=300, and getting off after each training to the error log of verification data.In each training process, by learning rate Dynamic adjustment is carried out, when learning to stagnate, learning rate is less in 0.1 times of form.

It is identical by setting BP other parameters for the determination of hiding number of layers, hiding number of layers is gradually increased by 1 layer To 5 layers test predicted value RMSE, as shown in table 5 below, hidden layer be 1 layer when X8, X9 RMSE value it is smaller, when hide layer by layer When number is gradually increased to 5 by 2, the value of RMSE fluctuates in smaller range, therefore selects 1 hidden layer.

Table 5 hides influence of the number of plies to key influence factor precision of prediction

For the determination of hidden layer interstitial content, same setting BP other parameters are identical, and according to formulaSearch, x be input layer number i.e. 12, y be output layer number of nodes i.e. 2, a be it is normal more than or equal to 1 Number, z is hidden layer number of nodes, therefore can set 5 for hidden layer number of nodes initial value and gradually increase, and records the mistake of predicted value Difference, experimental result is as shown in Figure 5,6, and when hiding node layer increases to 16 by 5, the RMSE of finishing efficiency and rejection rate is gradually It reduces, and when hidden layer number of nodes continues growing, the RMSE of finishing efficiency and rejection rate is gradually increased, therefore hides node layer Number is taken as 16.

By above-mentioned experimental result it is found that predicting that the BP neural network of newly-increased product key influence factor is total to 3-tier architecture, wherein Input layer number identical as hubless feature parameter dimension i.e. 12, hidden layer number of nodes is 16, and output layer number of nodes is 2.By this Model running 10 times, the average value of the predicted value of obtained X8, X9 does final predicted value, while each newly-increased product is arranged X2, X3 variable, for doing the input of verifying energy consumption prediction model.

(3) respectively cluster energy consumption prediction model

According to above-mentioned clustering as a result, historical product has been aggregated into 5 classes, historical product is respectively clustered contained energy consumption Sample standard deviation is training sample and test sample according to the ratio random division of 3:1, constructs the energy consumption prediction model of each cluster.It is instructing During white silk, the training set equivalent respectively to cluster is divided into four groups, wherein one group is used to construct prediction model, remaining three groups add It is added to existing model and carries out incremental learning.The input layer number of each model releases the dimension of sexual factor identical i.e. 4 with strong solution, defeated Node layer number is 1 out, that is, exports the single-item power consumption values under the influence of each feature.In addition the population scale that ADE searches for part is set as 50, the number of iterations 100, α, β take 0.8,0.5 respectively.

1) determine that LSTM is counted layer by layer

According to the C that clusters₀、C₁、C₂、C₃、C₄The single-item energy consumption data for being included constructs the single-item energy consumption prediction mould of such product Type.The other parameters that model is arranged are identical, the LSTM layer of the neural network are gradually increased to 10 layers from 1 layer, record cast is pre- The situation of change of the RMSE of measured value, shown in experimental result Fig. 7.For classification C₀With C₄Prediction model for, when LSTM layers by 1 When being gradually increased to 3 layers, the value of RMSE is gradually reduced, and when the LSTM number of plies continues growing, RMSE goes out in smaller range It now fluctuates, therefore C₀With C₄Prediction model can be identified as 3 layers of LSTM structure.And for C₁、C₂、C₃Prediction model for, when When LSTM increases to 2 layers by 1 layer, the value of RMSE is significantly smaller, and when continuing to increase for LSTM layers, RMSE is in smaller range Fluctuation, without obviously less trend, therefore C₁、C₂、C₃Prediction model LSTM layer use two layers.

2) LSTM node layer number is determined

Determination for the middle LSTM node layer number that respectively clusters, the other parameters that each model is arranged are identical, only concept transfer Number obtains the C that respectively clusters by testing₀、C₁、C₂、C₃、C₄Each node layer number ratio of prediction model be respectively as follows: 4:2:1,2:1, 5:2,5:2,4:3:1, first floor node are gradually increased to 52 by 4, and record cast predicts the variation of error, remaining each node layer number is logical Ratio setting is crossed, experimental results are shown in figure 8.For the C that clusters₀Prediction model for, when number of nodes increases to 24 by 4, Each error criterion significantly reduces, when number of nodes increases to 36 by 28, the trend that error criterion is reduced, but only compared with A small range fluctuation, when continuing growing number of nodes, error criterion is on the rise, therefore first layer uses 24 nodes.Together Reason is it is found that C₁、C₂、C₃、C₄Prediction model first layer be respectively 16,20,20,20.

2, interpretation of result

Cluster C₀、C₁、C₄There is newly-increased product to be added, therefore to analyze the precision of prediction of newly-increased product single-item energy consumption, comparative test is adopted Clustered corresponding prediction model with this three.By cluster C₀、C₁、C₄Contained historical product energy consumption sample standard deviation according to 3:1 ratio with Machine is divided into training sample and test sample.In the training process, training set equivalent is divided into four groups, and one group for constructing prediction Model, remaining three groups are added to existing model and carry out incremental learning.Increment Density Clustering knot is based on using proposed by the present invention The same LSTM, BP of energy consumption prediction technique, SVR, the DE-LSTM method without incremental learning for closing LSTM carry out incremental learning comparison, The input and output of middle traditional algorithm use the identical input and output of this method, record the precision of prediction and runing time of 10 experiments And averaged, comparing result are as shown in table 6 below.

6 model prediction accuracy of table compares

As seen from the above table, the LSTM wheel hub single-item energy consumption prediction model proposed by the invention based on increment cluster is being predicted Precision and training time are better than other algorithms substantially.From the point of view of precision of prediction, this method is in training stage, test phase and new The RMSE for increasing product energy consumption predicted value is minimum, and the RMSE minimum of training set is down to 0.016524, the RMSE of model training stage 0.012545,0.015129,0.007131 is averagely reduced compared to LSTM, BP, DE-LSTM, is averagely reduced compared to SVR 0.019806, it is seen that the LSTM wheel hub single-item energy consumption prediction model based on increment cluster initially joins model due to using ADE Number optimizes, it is often more important that uses incremental learning strategy, it is contemplated that the timing feature of energy consumption characters, therefore make model Precision of prediction be obviously improved.From the point of view of runing time, the runing time of this method is more than SVR, this is because this method Need to construct the LSTM neural network of multilayer, and the training time and runing time relative to other algorithms are less, this be by Incremental learning can not be carried out in other algorithms, needs re -training therefore to increase runing time when adding training set every time.

Next it is analyzed for the training for adding training data each time, as Fig. 9-11 indicates each institute that clusters When the history energy consumption sample for including is added sequentially to model in four times, each time be added when training precision variation diagram, equally with LSTM, BP, SVR, the DE-LSTM algorithm without incremental learning compare, by the visible algorithm of Fig. 9-11 in first time training The value no significant difference of RMSE, as the addition of incremental data is bright relative to the value reduction of other algorithm RMSE without incremental learning It is aobvious.

Four, conclusion

For solve the problems, such as conventional machines learning model cannot achieve new product unit consumption prediction and can not incremental learning, this hair The bright LSTM wheel hub single-item energy consumption prediction technique for proposing to cluster based on increment, using the dynamic increment Density Clustering based on PCA point The characteristic parameter for analysing wheel hub, to obtain historical product classification similar with newly-increased product, while by Person coefficient and Sexual factor is released in the strong solution that Adaptive-Lasso algorithm filters out single-item energy consumption, and BP algorithm prediction is combined to obtain newly-increased product Sexual factor is released in strong solution, and then is that every a kind of product individually constructs energy consumption prediction model according to cluster result, and it is adaptive poor to use Point evolution algorithm optimizes LSTM, while introducing incremental learning strategy to consider the sequential character of sample, newly-increased when having When product energy consumption sample, incremental update can be carried out to model, save time cost and space consumption.By analysis of experiments, test Demonstrate,proved propose based on increment cluster LSTM wheel hub single-item energy consumption prediction technique validity, make energy consumption predict RMSE down to 0.016524,0.013653 is averagely reduced relative to other algorithms without incremental learning, is realized to the reliable of single-item energy consumption Prediction.Main advantage is as follows:

(1) it is directed to the various problem for causing product classification difficulty of hubless feature parameter complicated and changeable, is proposed based on PCA's Dynamic increment Density Clustering, this method pass through the main feature component of PCA extracting parameter, and the uniqueness of retained product, in turn History single-item classification similar with new product is obtained using clustering algorithm；

(2) aiming at the problem that energy consumption factor importance analysis in wheel hub production process, composite order data, production number Construct energy consumption Factor system according to, crucial consumable component parameter, pass through Pearson coefficient and Adaptive-Lasso algorithm extract it is single Sexual factor is released in the strong solution of product energy consumption, reduces the dimension of prediction input data, and predicts new product using BP neural network Sexual factor value is released in strong solution；

(2) the unpredictable new product energy consumption of conventional machines learning model and can not incremental learning aiming at the problem that, propose to be based on The LSTM incremental update wheel hub unit consumption prediction model of ADE, the model optimize model parameter using ADE algorithm, and will be new The global error of error and historical models between the predicted value and actual value of sample sums it up, and foundation minimizes the error method iteration Model parameter is updated, to guarantee the incremental learning ability of model, keeps model data suitable for production growing number of Actual conditions save space hold and time cost.

Claims

1. a kind of LSTM wheel hub single-item energy consumption prediction technique based on increment cluster, which is characterized in that include the following steps:

(1) the characteristic parameter sample of different model wheel hub, including 12 characteristic parameters: wheel diameter, wheel rim width, center are collected Pitch-row, bolt hole count, pitch diameter, offset distance, weight, spoke number, spoke front pose, center disk moulding, hub material, Manufacturing process；

(2) energy that the order data of collection different model wheel hub, creation data, crucial consumable component parameter, single-item energy consumption data form Consume sample；Wherein order data, creation data, crucial consumable component parameter constitute energy consumption Factor system, and order data includes single Product model, single-item order volume, category sum, creation data includes production time, raw material input amount, operation hours, life Efficiency, rejection rate are produced, is considered in terms of wherein production efficiency adds efficiency and finishing efficiency two by machine, machine adds efficiency unit: part/small When；Finishing efficiency unit: minute/part；Crucial consumable component parameter includes drill bit usage amount, cutter usage amount；

(3) after the character shape parameter of history hubless feature parameter sample in step (1) being carried out labeling processing, using principal component Parameter sample is down to two dimension and obtains data set P by analysis；

(4) cluster operation then is carried out to P using dynamic increment density clustering algorithm, obtains the original result C that clusters_i(i=0, 1 ..., k) it ∈ P and peels off and collects O ∈ P；

(5) it when there is newly-increased feature parameter sample, to it using Δ P is obtained after step (3) pretreatment, searches for poly- with original in O ∪ Δ P The reachable data object of cluster density updates cluster result, exports the C that clusters_i' (i=0,1 ..., k ')=C_i∪ΔC_i, peeling off collects O ' With newly-increased product generic；

(6) sexual factor is released according to the strong solution of energy consumption Factor system analysis single-item energy consumption in step (2), utilizes Pearson system Number rejects the weak related and uncorrelated factor in energy consumption Factor system, carries out followed by Adaptive-Lasso algorithm Second of variables choice obtains the strong explanatory variable of wheel hub single-item energy consumption；

(7) to the strong explanatory variable obtained by step (3) labeling treated history hubless feature parameter, step (6) And single-item power consumption values are standardized；

It (8) is input with the history hubless feature parameter after step (7) standardization, to be standardized by step (7) Treated, and strong explanatory variable is output, constructs the BP prediction model of new wheel hub explanatory variable by force, is able to predict new wheel hub Strong explanatory variate-value；

(9) the single-item energy consumption prediction model that respectively clusters, the history that will respectively cluster energy consumption sample are constructed according to the Historic Clustering result of step (4) This is randomly divided into training set and test set with the ratio of 3:1, and training set is divided into four groups, wherein one group of carry out step (10), remaining set is successively used as a sample increment collection；

(10) the LSTM single-item energy consumption prediction model that respectively clusters is constructed, by strong solution of the training set after step (7) standardization The property released variable is input, and the single-item energy consumption after step (7) standardization is output, real using ADE algorithm in model construction Now to the optimization of LSTM parameter；

(11) in the LSTM single-item energy consumption prediction model that respectively clusters, increment is carried out using sample increment the set pair analysis model in step (9) It updates；

(12) step (11) updated prediction model respectively to cluster is passed through according to the test set test respectively to cluster in step (9), And model evaluation is carried out, it is handled simultaneously for the characteristic parameter sample of product to be predicted by step (3)-(5), and by step Suddenly the BP prediction model of (7)-(8) obtains the strong explanatory variable of new wheel hub, respectively clusters followed by step (11) is updated LSTM single-item energy consumption prediction model is predicted, the predicted value of newly-increased product energy consumption is exported.

2. the LSTM wheel hub single-item energy consumption prediction technique according to claim 1 based on increment cluster, which is characterized in that described Cluster process is carried out to the characteristic parameter sample set P after dimensionality reduction in step (4), is included the following steps:

1) it for the raw data set P after dimensionality reduction, determines the value of coefR, calculates density adjusting parameter σ and density reach distance R, And calculate the density value Density (P of each data object in P_i), obtain Density (P in P_i) maximum local density attracts Sub- Attractor:

In formula,Indicate point P_iTo point P_jEuclidean distance,The average value of distance between each point in sample, coefR (0 < coefR < 1) be density reach distance original regulation coefficient, n is total sample number；

2) data object in raw data set P is scanned, by density-attractors Attractor and the reachable data object of its density point It is fitted on first cluster C₀, and concentrated from initial data and delete the object that clusters；

3) for remaining data collection, another density-attractors Attractor is searched_i, calculate density self-adapting reach distance R_adap,i, the density-attractors and the reachable data object of its density are assigned to another C that clusters_i, and concentrated from initial data The object that clusters is deleted, and so on, finally the cluster with little data object is put into exceptional value or noise group, at this time To the original result C that clusters_i(i=0,1 ..., k) ∈ P, peels off and collects O ∈ P:

R_adap,i=α R

α is regulation coefficient in formula, and formula is as follows:

3. the LSTM wheel hub single-item energy consumption prediction technique according to claim 1 based on increment cluster, which is characterized in that described Increment clustering method in step (5) are as follows:

1) for pretreated incremental data set Δ P, Δ P, which is peeled off with original, to be collected O and merges, and is searched in O ∪ Δ P and is respectively clustered with original C_iThe reachable data object of density updates cluster result C_i(i=0,1 ..., k) ∈ (P ∪ Δ P), remaining data collection is at this time

If 3)It then detects whether containing the data object in increment sample Δ P in O ', if containing new product parameter sample This, then compare the new sample point to the middle each point that respectively clusters distance, in order to realize the new product energy consumption prediction, by its elder generation It is tagged to apart from cluster where its closest approach；

4) after for the pretreatment of other incremental data sets, the existing label that clusters collected in O ' that peels off is deleted, step 1) is repeated To 3)；

4. the LSTM wheel hub single-item energy consumption prediction technique according to claim 1 based on increment cluster, which is characterized in that described Single-item energy consumption prediction model construction method in step (10) are as follows:

1) dimension of ADE individual is equal to the weight of LSTM neural network and the summation of threshold number, initializes evolution number G=0, Population scale N, crossover probability f, mutation probability cr, and using root-mean-square error RMSE as fitness function:

GenM is maximum number of iterations in formula, and G is current iteration number, and α, β are the constant in [0.5,1] range, y_tIt is true Value, k are the number of data object；

3) if minimum RMSE is met the requirements or current iteration number G=GenM, ADE termination iteration in current population, obtain It obtains optimized individual and executes step 5), otherwise continue to execute step 4)；

4) new group is obtained according to adaptive intersection, TSP question and selection operation, G=G+1 is set, is returned to step 3)；

5) optimize the initial connection weight and threshold value for obtaining optimum individual as LSTM based on ADE；And with training sample to LSTM Implement training, and then obtains optimum network.

5. the LSTM wheel hub single-item energy consumption prediction technique according to claim 1 based on increment cluster, which is characterized in that described Sample is increased in step (11) newly to the update method of LSTM prediction model are as follows:

1) for each newly-increased energy consumption sample, first which cluster c judgement belongs to_i, corresponding i-th of the LSTM model of i-th of cluster；Belonging to determination After model, model parameter is updated on the basis of model history data training, i.e., is come the parameter after historical data training just The strong explanatory variable of new samples is input in prediction model, obtains new samples by the forward calculation of LSTM by beginningization network Predicted valueBy predicted valueIt is added to the error of actual value y on original error function J (θ)；

In formula, parameter θ=(W_f,W_i,W_c,W_o,b_f,b_i,b_c,b_o)

θ '=(W_f-λ*Δw_f,…,W_o-λ*Δw_o,b_f-λ*Δb_f,…,b_o-λ*Δb_o)

In formula, λ is learning rate, Δ w_f、Δw_oWith Δ b_f、Δb_oBe respectively the weight of neuron and the gradient matrix of offset and to Amount.