CN114117891A - Rotary kiln energy consumption prediction method based on deep belief network - Google Patents

Rotary kiln energy consumption prediction method based on deep belief network Download PDF

Info

Publication number
CN114117891A
CN114117891A CN202111291054.4A CN202111291054A CN114117891A CN 114117891 A CN114117891 A CN 114117891A CN 202111291054 A CN202111291054 A CN 202111291054A CN 114117891 A CN114117891 A CN 114117891A
Authority
CN
China
Prior art keywords
time
layer
energy consumption
data
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111291054.4A
Other languages
Chinese (zh)
Inventor
周杭翱
秦岭
杨小健
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Tech University
Original Assignee
Nanjing Tech University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Tech University filed Critical Nanjing Tech University
Priority to CN202111291054.4A priority Critical patent/CN114117891A/en
Publication of CN114117891A publication Critical patent/CN114117891A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • G06F30/27Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/04Manufacturing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2111/00Details relating to CAD techniques
    • G06F2111/10Numerical modelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2119/00Details relating to the type or aim of the analysis or the optimisation
    • G06F2119/06Power analysis or power optimisation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Biomedical Technology (AREA)
  • Computing Systems (AREA)
  • Molecular Biology (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • Computational Linguistics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Data Mining & Analysis (AREA)
  • Geometry (AREA)
  • Game Theory and Decision Science (AREA)
  • Development Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Computer Hardware Design (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Medical Informatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Manufacturing & Machinery (AREA)
  • Primary Health Care (AREA)
  • Testing And Monitoring For Control Systems (AREA)

Abstract

The invention discloses a rotary kiln prediction method based on a deep belief network. And (3) directly training the multivariate data by adopting a DBN model of a nonlinear mapping function, learning the internal representation of the multivariate data, capturing the nonlinear correlation between the multivariate data and the internal representation of the multivariate data, and establishing a sliding window model of a time sequence input layer. On the basis, a multi-index energy consumption early warning model SW-DBN is established, and the multi-index energy consumption early warning model SW-DBN can learn the hierarchical structure of time-varying time delay and nonlinear characteristics. These features are useful for energy consumption prediction. And predicting the energy consumption of the rotary kiln in real time through a sliding window model and a multi-index energy consumption early warning model. For the cement production plan, the analysis of an energy consumption prediction system can be simulated, the most effective and cost-saving production parameters can be determined, the electricity and coal demand required in the operation process can be predicted in advance, the production efficiency is effectively improved, and the large error condition caused by only depending on the experience of workers in the past is reduced. Meanwhile, the method ensures that the rotary kiln can be in a stable running state for a long time, effectively improves the shutdown times of the rotary kiln, reduces the loss caused by shutdown maintenance, and can greatly increase the production benefits of enterprises.

Description

Rotary kiln energy consumption prediction method based on deep belief network
Technical Field
The invention relates to a prediction method based on a deep belief network in a neural network.
Background
The cement industry is dominated by electricity and fossil fuels and has the characteristic of high energy consumption. On a global scale, governments of various countries and public and private sectors are striving to seek ways to improve cement energy performance for energy safety, economic and environmental concerns. The effect of cement energy consumption prediction is discussed from the perspective of energy conservation and consumption reduction. On one hand, the power consumption and the coal consumption are key reference indexes of the energy consumption in the cement calcination process. Traditionally, the operator can only read the current energy consumption from the sensors, and cannot know the trend of the energy consumption. Generally, when the energy consumption of the calcination process is abnormal, the operator can adjust the production equipment by depending on experience. The operation has strong hysteresis, causes energy waste and is not beneficial to energy conservation and emission reduction in the production process. Therefore, operator adjustments to production equipment require urgent predictive information on energy consumption trends. On the other hand, for the cement production plan, the analysis of an energy consumption prediction system can be simulated, and the most effective and cost-saving production parameters can be determined. Accurate prediction of energy consumption provides important reference information for cement production planning and energy control. Therefore, the prediction of the energy consumption in the cement calcination process is of great significance. Because the cement calcination process is complicated and changeable, the relation among all variables is nonlinear. In addition, the raw materials are required to be completely calcined into clinker in about 50-60 minutes, so that delay time exists between the raw material feeding quantity data and the corresponding energy consumption data. While all other variables related to energy consumption also have time-lag characteristics. Due to different influences, the delay time of each variable is different and is dynamically changed. Thus, the calcination process has three main features: uncertainty, nonlinearity, and time-varying skew.
Traditionally, the data-driven models for solving nonlinear predictions are support vector machines, artificial neural networks, statistical regression, and the like. However, the variables associated with cement energy consumption are numerous and all delay times are too complex to calculate. In addition, the delay time during cement calcination is not consistent and is dynamically variable. It is difficult to calculate the delay time for all variables. Therefore, this method cannot achieve the best effect in the field of cement calcination. Frank et al teach how to predict time series using a sliding window technique that incorporates more information into the artificial neural network input layer. The time series are predicted using a sliding window technique that incorporates more information into the artificial neural network input layer. In the cement calcination process, not only the variables at the present time are related to energy consumption, but also the variables in the past time affect the trend of energy consumption. In this case, the use of sliding window technology to solve the cement calcination delay problem has certain advantages. In addition, in the energy optimization model, the electricity consumption prediction value and the coal consumption prediction value must be applied at the same time. If these energy consumption values are predicted separately, different time lengths may be required due to differences in data characteristics and complexity. In addition, the cement calcination process consumes both electricity and heat in the same step, most variables being related to electricity and coal consumption. Therefore, there is a coupling relationship between the used amount and the used amount. Therefore, the two indexes should be predicted synchronously in the same model. To our knowledge, most studies predicted only one of them. At present, no one can predict the electricity consumption and the coal consumption in the cement calcination process at the same time. Therefore, a multi-index energy consumption prediction model based on a sliding window deep confidence network is provided. Firstly, the model combines the past variable data and the current variable data into time series data through a sliding window method, so that the problem of complex time-varying and time-delaying research is avoided. And then establishing a deep confidence network capable of capturing nonlinear features among variables to learn time series information.
Disclosure of Invention
The invention aims to provide a prediction method aiming at the problem of energy consumption in the rotary kiln operation process.
In order to achieve the above object, the present invention employs a method based on a deep belief network, the method comprising the following steps:
the method comprises the following steps: and (3) directly training the multivariate data by adopting a DBN model of a nonlinear mapping function, learning the internal representation of the multivariate data, capturing nonlinear correlation between the multivariate data and the multivariate data, and finally establishing a sliding window model of a time sequence input layer.
Step two: and establishing a multi-index energy consumption early warning model on the basis of the time sequence of the step one. The SW-DBN is capable of learning a hierarchy of time-varying time-lags and non-linear features. These features are useful for energy consumption prediction.
Step three: and predicting the energy consumption of the rotary kiln in real time through a sliding window model and a multi-index energy consumption early warning model, and further feeding back and guiding the production process, wherein the core flow is shown as an attached figure 3, and an accurate prediction result is finally obtained through a training algorithm in the step one and a prediction algorithm in the step two.
2. The rotary kiln prediction method based on the deep belief network as claimed in claim 1, wherein the sliding window model of the time series input layer in the first step is divided into the following 2 steps:
the method comprises the following steps: each data sample of each time instance in the sliding window model of the time series input layer is independent, so the method ignores important time information inherent in the time series data. Therefore, this will result in a non-ideal prediction effect. To address this problem, a sliding window technique is applied to process each input variable. It can integrate data relating to the entire production cycle into the model at once. The operating conditions of the cement calcination process in normal operation are dynamically variable and the delay time of the variables is difficult to determine. All relevant information is contained in the time series as long as the variation range of the delay time does not exceed the size of the sliding window. Therefore, the SW-DBN model can still capture all the characteristics of the time-varying delay and obtain an accurate prediction result.
Step two: the input variable data being a matrix of values, xi(t) is the value of the first variable at the time instant. Time series data s obtained by sliding windowi(t) the following:
si(t)=[xi(t-ω+1),xi(t-ω+2),...xi(t)] (1)
t, where T is the number of sample data. The time series data of the input layer comprises time-varying delay characteristics between the energy consumption variable and each input variable. Therefore, the time series data will provide more time series features for the DBN model.
3. The deep belief network-based rotary kiln prediction method as claimed in claim 1, wherein the SW-DBN training process in step two is divided into the following 2 steps:
the method comprises the following steps: bottom-up unsupervised greedy, layer-by-layer pre-training in the first section, the SW-DBN is composed of several stacked RBMs, as shown in FIG. 1. Visible layer V0And adjacent hidden layer H1Form an RBM1,H1And h2Constitute RBM2. In the same way, only one RBM network is trained from the bottom layer each time, and then the next RBM network is trained by fixing the trained RBM parameters. All RMBs were trained using a contrast divergence algorithm. Figure 2 shows the structure of an RBM. An RBM is composed of n visible units v ═ v1,...vn]And m hidden units h ═ h1,...hn]Composition of wherein viAnd hjBinary states, a, of visible units i and hidden units j, respectivelyiAnd bjRespectively, are their deviations, wijAre real-valued weights between them. Each cell of the visible layer is connected to all cells of the hidden layer. There is no link between units on the same layer. By giving a visible layer, the activation probability of a hidden layer is obtained
Figure BSA0000256458940000041
The activation probability of the visible layer is found in the known hidden layer.
Figure BSA0000256458940000042
Where σ (x) is 1/1+ e-xIs an activation function.
Given training input data, training an RBM means training samples by adjusting the parameter θ (a, b, w) against the divergence algorithm. The formula is as follows:
Δwij=p(hj=1|v(0))vi (0)-p(hj=1|v(k))vi k (4)
Δaj=vi (0)-vi (k) (5)
Δbj=p(hj=1|v(0))-p(hj=1|v(k)) (6)
where k is the number of cycles typically 1. And training all RBMs layer by layer according to the method until the specified iteration times are reached, and finishing the bottom-up unsupervised layer-by-layer greedy training. Finally, initial parameters representing time-varying time lag and nonlinear characteristics are obtained in the pre-training process.
Step two: and according to the label of the output data Y, using a supervised backward fitting algorithm to carry out weight adjustment and error correction. Meanwhile, the feature hierarchy is associated with energy consumption data. As with most common training methods, the DBN is considered a traditional deep neural network, with gradient descent algorithms to update parameters. As the amount of data processed by the sliding window increases, the parameter size of the DBN becomes larger, and the computational complexity increases. Therefore, we replace the commonly used gradient descent method with Adam's algorithm. The Adam algorithm is well suited to handle large data and parameter problems. The method has good demonstration effect, and is superior to other SGD methods. Adam designs independent adaptive learning rates for different parameters by computing first and second moment estimates of the gradient. b1 and b2 are the exponential decay rates of the second moment estimates. f (w, b) whether the random objective function contains w and b parameters. At time step t, the gradient with respect to the random target is
Figure BSA0000256458940000051
Finally, under the weight adjustment value and the error correction of the fitting algorithm, the error range of the obtained predicted value and the actual data is greatly reduced.
The invention has the beneficial effects that: the energy consumption in the rotary kiln operation process is predicted by adopting a sliding window model and a supervised backward fitting algorithm, the electricity and coal demand required in the operation process is predicted in advance, preparation is made, the production efficiency is effectively improved, and the large error condition caused by only depending on the experience of workers in the past is reduced. Meanwhile, the method ensures that the rotary kiln can be in a stable running state for a long time, effectively improves the shutdown times of the rotary kiln, reduces the loss caused by shutdown maintenance, and can greatly increase the production benefit of enterprises.
Drawings
FIG. 1 is an architecture of a SW-DBN.
Fig. 2 is a structure of an RBM.
Fig. 3 is the overall flow of the algorithm.
Detailed Description
Embodiments of the present invention will be described in detail below with reference to the accompanying drawings.
In order to achieve the above object, the present invention adopts a method based on a deep belief network, which comprises the following steps:
the method comprises the following steps: and (3) directly training the multivariate data by adopting a DBN model of a nonlinear mapping function, learning the internal representation of the multivariate data, capturing nonlinear correlation between the multivariate data and the multivariate data, and finally establishing a sliding window model of a time sequence input layer. Each data sample of each time instance in the sliding window model of the time series input layer is independent, so the method ignores important time information inherent in the time series data. Therefore, this will result in a non-ideal prediction effect. To address this problem, a sliding window technique is applied to process each input variable. It can integrate data relating to the entire production cycle into the model at once. The operating conditions of the cement calcination process during normal operation are dynamically variable, and the delay time of the variables is difficult to determine. All relevant information is contained as long as the variation range of the delay time does not exceed the size of the sliding windowIn a time series. Therefore, the SW-DBN model can still capture all the characteristics of the time-varying delay and obtain an accurate prediction result. The input variable data being a matrix of values, xi(t) is the value of the first variable at the time instant. Time series data s obtained by sliding windowi(t) the following:
si(t)=[xi(t-ω+1),xi(t-ω+2),...xi(t)] (1)
t, where T is the number of sample data. The time series data of the input layer comprises time-varying delay characteristics between the energy consumption variable and each input variable. Therefore, the time series data will provide more time series features for the DBN model.
Step two: and establishing a multi-index energy consumption early warning model on the basis of the time sequence of the step one. The SW-DBN is capable of learning a hierarchy of time-varying time-lags and non-linear features. These features are useful for energy consumption prediction. In the first section, the bottom up unsupervised greedy, layer-by-layer pre-training, the SW-DBN is composed of several stacked RBMs, as shown in FIG. 1. Visible layer V0And adjacent hidden layer H1Form an RBM1,H1And h2Form an RBM2. In the same way, only one RBM network is trained from the bottom layer each time, and then the next RBM network is trained by fixing the trained RBM parameters. All RMBs were trained using a contrast divergence algorithm. Figure 2 shows the structure of an RBM. An RBM is composed of n visible units v ═ v1,...vn]And m hidden units h ═ h1,...hn]Composition of wherein viAnd hjBinary states, a, of visible units i and hidden units j, respectivelyiAnd bjRespectively, their deviation, wijAre real-valued weights between them. Each cell of the visible layer is connected to all cells of the hidden layer. There is no link between units on the same layer. By giving a visible layer, the activation probability of a hidden layer is obtained
Figure BSA0000256458940000071
The activation probability of the visible layer is found in the known hidden layer.
Figure BSA0000256458940000072
Where σ (x) is 1/1+ e-xIs an activation function.
Given training input data, training an RBM means training samples by adjusting the parameter θ (a, b, w) against the divergence algorithm. The formula is as follows:
Δwij=p(hj=1|v(0))vi (0)-p(hj=1|v(k))vi k (4)
Δaj=vi (0)-vi (k) (5)
Δbj=p(hj=1|v(0))-p(hj=1|v(k)) (6)
where k is the number of cycles typically 1. And training all RBMs layer by layer according to the method until the specified iteration times are reached, and finishing the bottom-up unsupervised layer-by-layer greedy training. Finally, initial parameters representing time-varying time lag and nonlinear characteristics are obtained in the pre-training process. And (4) according to the label of the output data Y, using a supervised backward fitting algorithm to carry out weight adjustment and error correction. Meanwhile, the feature hierarchy is associated with energy consumption data. As with most common training methods, the DBN is considered a traditional deep neural network, with gradient descent algorithms to update parameters. As the amount of data processed by the sliding window increases, the parameter size of the DBN becomes larger, and the computational complexity increases. Therefore, we replace the commonly used gradient descent method with Adam's algorithm. The Adam algorithm is well suited to handle large data and parameter problems. The method has good demonstration effect, and is superior to other SGD methods. Adam designs independent adaptive learning rates for different parameters by computing first and second moment estimates of the gradient. b1 and b2 are the exponential decay rates of the second moment estimates. f (w, b) whether the random objective function contains w and b parameters. At time step t, the gradient with respect to the random target is
Figure BSA0000256458940000081
Finally, under the weight adjustment value and the error correction of the fitting algorithm, the error range of the obtained predicted value and the actual data is greatly reduced.
Step three: and predicting the energy consumption of the rotary kiln in real time through a sliding window model and a multi-index energy consumption early warning model, and further feeding back and guiding the production process, wherein the core flow is shown as an attached figure 3, and an accurate prediction result is finally obtained through a training algorithm in the step one and a prediction algorithm in the step two.

Claims (3)

1. A rotary kiln energy consumption prediction method based on a deep belief network is characterized by comprising the following steps:
the method comprises the following steps: directly training the multivariate data by adopting a DBN model of a nonlinear mapping function, learning the internal representation of the multivariate data, capturing the nonlinear correlation between the multivariate data and the internal representation of the multivariate data, and finally establishing a sliding window model of a time sequence input layer;
step two: establishing a multi-index energy consumption early warning model on the basis of the time sequence of the step one, wherein the SW-DBN can learn the hierarchical structure of time-varying time delay and nonlinear characteristics, and the characteristics are favorable for energy consumption prediction;
step three: and predicting the energy consumption of the rotary kiln in real time through a sliding window model and a multi-index energy consumption early warning model, and then feeding back and guiding the production process, wherein the core flow is shown as figure 3, and an accurate prediction result is finally obtained through a training algorithm in the step one and a prediction algorithm in the step two.
2. The deep belief network-based rotary kiln prediction method as claimed in claim 1, wherein the sliding window model of the time series input layer in the step one is divided into the following 2 steps:
the method comprises the following steps: in order to solve the problem that each data sample of each time instance in the sliding window model of the time sequence input layer is independent, so that the method ignores important time information inherent in time sequence data, the prediction effect is not ideal, a sliding window technology is applied to process each input variable, the data related to the whole production period can be integrated into the model at one time, the operating condition of the cement calcination process in normal work is dynamically changed, the delay time of the variable is difficult to determine, and all related information is contained in the time sequence as long as the variation range of the delay time does not exceed the size of the sliding window, so that the SW-DBN model can still capture all characteristics of time-varying delay and obtain an accurate prediction result;
step two: the input variable data being a matrix of values, xi(t) time series data s obtained by sliding window and the value of the first variable at timei(t) the following:
si(t)=[xi(t-ω+1),xi(t-ω+2),...xi(t)] (1)
t, where T is the number of sample data, and the time-series data of the input layer includes time-varying delay characteristics between the energy consumption variable and each input variable, so that the time-series data provides more time-series characteristics for the DBN model.
3. The deep confidence network-based rotary kiln prediction method as claimed in claim 1, wherein the SW-DBN training process in the second step is divided into the following 2 steps:
the method comprises the following steps: bottom-up unsupervised greedy layer-by-layer pretraining in the first section, the SW-DBN is composed of several stacked RBMs, as shown in FIG. 1, visible layer V0And adjacent hidden layer H1Form an RBM1,H1And h2Form an RBM2In the same way, only one RBM network is determined to be trained from the bottom layer each time, then the parameters of the trained RBM are fixed, the next RBM network is trained, all RMBs are trained by adopting a contrast divergence algorithm, and the attached figure 2 shows the RBM of one RBMStructure, one RBM is composed of n visible units v ═ v1,...vn]And m hidden units h ═ h1,...hn]Composition of wherein v1And hjBinary states, a, of visible units i and hidden units j, respectivelyiAnd bjRespectively, are their deviations, wijThe real value weight value between them, each unit of the visible layer is connected with all units of the hidden layer, there is no relation between units of the same layer, and the activation probability of the hidden layer is obtained by giving the visible layer
Figure FSA0000256458930000021
The activation probability of the visible layer is found in the known hidden layer,
Figure FSA0000256458930000022
where σ (x) is 1/1+ e-xIs the function of the activation of the function,
given training input data, training an RBM means training samples by comparing the divergence algorithm tuning parameter θ ═ a, b, w, as shown below:
Δwij=p(hj=1|v(0))vi (0)-p(hj=1|v(k))vi k (4)
Δaj=vi (0)-vi (k) (5)
Δbj=p(hj=1|v(0))-p(hj=1|v(k)) (6)
the number of cycles of k is usually 1, training all RBMs layer by layer according to the method until reaching the specified iteration times, completing unsupervised layer-by-layer greedy training from bottom to top, and finally obtaining initial parameters representing time-varying time lag and nonlinear characteristics in the pre-training process;
step two: according to the label of the output data Y, a supervised backward fitting algorithm is used for carrying out weight adjustment and error correction, meanwhile, the feature level is associated with energy consumption data, like most common training methods, a DBN is regarded as a traditional deep neural network, a gradient descent algorithm is adopted for updating parameters, as the data volume processed by a sliding window is increased, the parameter size of the DBN is increased, and the calculation complexity is increased, therefore, an Adam algorithm is used for replacing a common gradient descent method, the Adam algorithm is very suitable for processing the problem of large data and parameters, the method has a good empirical effect and is superior to other SGD methods, the Adam designs independent self-adaptive learning rates of different parameters by calculating first moment estimation and second moment estimation of the gradient, b1 and b2 are exponential attenuation rates of second moment estimation, and f (w, b) random objective function contains w and b parameters, at time step t, the gradient with respect to the random target is
Figure FSA0000256458930000031
Finally, under the weight adjustment value and the error correction of the fitting algorithm, the error range of the obtained predicted value and the actual data is greatly reduced.
CN202111291054.4A 2021-11-02 2021-11-02 Rotary kiln energy consumption prediction method based on deep belief network Pending CN114117891A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111291054.4A CN114117891A (en) 2021-11-02 2021-11-02 Rotary kiln energy consumption prediction method based on deep belief network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111291054.4A CN114117891A (en) 2021-11-02 2021-11-02 Rotary kiln energy consumption prediction method based on deep belief network

Publications (1)

Publication Number Publication Date
CN114117891A true CN114117891A (en) 2022-03-01

Family

ID=80380342

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111291054.4A Pending CN114117891A (en) 2021-11-02 2021-11-02 Rotary kiln energy consumption prediction method based on deep belief network

Country Status (1)

Country Link
CN (1) CN114117891A (en)

Similar Documents

Publication Publication Date Title
CN109685252B (en) Building energy consumption prediction method based on cyclic neural network and multi-task learning model
CN112990556A (en) User power consumption prediction method based on Prophet-LSTM model
CN113128113B (en) Lean information building load prediction method based on deep learning and transfer learning
CN110766212A (en) Ultra-short-term photovoltaic power prediction method for historical data missing electric field
CN111680786B (en) Time sequence prediction method based on improved weight gating unit
CN110598929B (en) Wind power nonparametric probability interval ultrashort term prediction method
CN112329990A (en) User power load prediction method based on LSTM-BP neural network
CN114757104B (en) Method for constructing hydraulic real-time regulation and control model of series gate group water transfer project
CN110097929A (en) A kind of blast furnace molten iron silicon content on-line prediction method
CN114119273A (en) Park comprehensive energy system non-invasive load decomposition method and system
CN115828768A (en) Data-driven power distribution network reactive power prediction method
CN112183877A (en) Photovoltaic power station fault intelligent diagnosis method based on transfer learning
Jia et al. Water quality prediction method based on LSTM-BP
CN105302976A (en) Aluminum electrolysis production optimization method based on SPEA2 algorithm
CN109408896B (en) Multi-element intelligent real-time monitoring method for anaerobic sewage treatment gas production
CN117291069A (en) LSTM sewage water quality prediction method based on improved DE and attention mechanism
CN117114438A (en) Building area energy system cold and hot load data driving prediction method with flexibility and interpretability
CN115481788B (en) Phase change energy storage system load prediction method and system
Juan et al. Multi-step prediction of dissolved oxygen in river based on random forest missing value imputation and attention mechanism coupled with recurrent neural network
CN114117891A (en) Rotary kiln energy consumption prediction method based on deep belief network
CN110543942A (en) Multi-space-time long and short memory depth network accurate prediction method
CN113112089B (en) Power consumption prediction method and prediction system for cement raw material grinding system
CN114861555A (en) Regional comprehensive energy system short-term load prediction method based on Copula theory
CN114372615A (en) Short-term photovoltaic power prediction method and system considering spatial correlation
CN113962454A (en) LSTM energy consumption prediction method based on dual feature selection and particle swarm optimization

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication