WO2023082788A1 - Method and apparatus for predicting oxygen content in flue gas and load, method and apparatus for selecting prediction model, and method and apparatus for predicting flue gas emission - Google Patents

Method and apparatus for predicting oxygen content in flue gas and load, method and apparatus for selecting prediction model, and method and apparatus for predicting flue gas emission Download PDF

Info

Publication number
WO2023082788A1
WO2023082788A1 PCT/CN2022/116583 CN2022116583W WO2023082788A1 WO 2023082788 A1 WO2023082788 A1 WO 2023082788A1 CN 2022116583 W CN2022116583 W CN 2022116583W WO 2023082788 A1 WO2023082788 A1 WO 2023082788A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
equipment
participant
model
prediction
Prior art date
Application number
PCT/CN2022/116583
Other languages
French (fr)
Chinese (zh)
Inventor
刘胜伟
杨杰
余真鹏
Original Assignee
新智我来网络科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN202111331379.0A external-priority patent/CN114118543A/en
Priority claimed from CN202111331191.6A external-priority patent/CN114118541A/en
Priority claimed from CN202111331186.5A external-priority patent/CN114118540A/en
Priority claimed from CN202111331355.5A external-priority patent/CN114118542A/en
Application filed by 新智我来网络科技有限公司 filed Critical 新智我来网络科技有限公司
Publication of WO2023082788A1 publication Critical patent/WO2023082788A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"

Definitions

  • the present disclosure relates to the field of comprehensive energy technology, and in particular to a flue gas oxygen content load prediction method, a prediction model selection method, a flue gas emission prediction method and a device.
  • thermal efficiency is an important indicator to measure gas-fired boilers.
  • the oxygen content of the boiler flue gas is often measured and maintained by zirconia measuring instrument, but the cost is high.
  • the embodiments of the present disclosure provide a method, device, computer equipment, and computer-readable storage medium for predicting the oxygen content load of flue gas based on joint learning, so as to solve the problem of the inability to improve the flue gas load of energy equipment in the prior art.
  • the accuracy of the oxygen load prediction results in a waste of resources.
  • the first aspect of the embodiments of the present disclosure provides a flue gas oxygen content load prediction method, including:
  • the participants respectively determine the sample weights corresponding to the data sets of the local multiple groups of devices and the target device data sets;
  • the oxygen content value of the flue gas of the target equipment is predicted.
  • the second aspect of the embodiments of the present disclosure provides a flue gas oxygen content load prediction device, including:
  • a determination module used for the participants to respectively determine the sample weights corresponding to the data sets of multiple groups of local devices and the data sets of the target device;
  • the first training module is used to train the prediction neural network models of multiple sets of equipment according to the data sets of multiple sets of local equipment and the corresponding sample weights;
  • the aggregation module is used to upload the predictive neural network models of multiple groups of local devices to the central node for model aggregation, so as to obtain the aggregated predictive neural network models;
  • the second training module trains the aggregated prediction neural network model according to preset training conditions to obtain a joint prediction model
  • the prediction module is used to predict the oxygen content value of the flue gas of the target device according to the joint prediction model and the sample weight corresponding to the target device data set.
  • the third aspect of the embodiments of the present disclosure provides a flue gas oxygen content load prediction method, which is applied in a joint learning framework, including:
  • the equipment data of the first participant and the equipment data of the second participant under the federated learning architecture are the equipment data of the first participant and the equipment data of the second participant under the federated learning architecture; among them, the first participant is the participant who proposes the prediction demand, and the second participant is other participants except the first participant square;
  • the fourth aspect of the embodiments of the present disclosure provides a flue gas oxygen content load prediction device, which is applied in a joint learning framework, including:
  • the acquisition module is used to acquire the equipment data of the first participant and the equipment data of the second participant under the joint learning architecture; wherein, the first participant is the participant who proposes the prediction demand, and the second participant is the participant other than the first participant parties other than the Party;
  • the first training module uses the equipment data of the first participant and the equipment data of the second participant to train the predictive classifier
  • a calculation module configured to determine weight data of the equipment data of the first participant with respect to the equipment data of the second participant according to the predictive classifier
  • the second training module is used to train the predictive gradient boosting model based on the equipment data and weight data of the second participant;
  • the prediction module is used to predict the flue gas oxygen content load of the equipment of the first participant by using the predicted gradient boosting model.
  • a method for selecting a flue gas oxygen content load prediction model including:
  • Preprocess the data in the training data set and the test data set of the prediction device Preprocess the data in the training data set and the test data set of the prediction device, and obtain the preprocessed device data set;
  • the flue gas oxygen content load prediction model suitable for the prediction equipment is determined.
  • the sixth aspect of the embodiments of the present disclosure provides a flue gas oxygen content load prediction model selection device, including:
  • the receiving module is used to receive training data sets and test data sets from prediction devices of participating parties based on the federated learning architecture
  • the preprocessing module is used to preprocess the data in the training data set and the test data set of the prediction device, and obtain the preprocessed device data set;
  • a calculation module configured to calculate the evaluation index value of each piece of data in the preprocessed device data set according to the established prediction model group
  • the prediction module is used to determine the flue gas oxygen content load prediction model suitable for the prediction equipment according to the minimum evaluation index value.
  • the seventh aspect of the embodiments of the present disclosure provides a flue gas emission prediction method, including:
  • the local energy data measurement model is trained according to the test data and the energy data of the target energy equipment, and the test data prediction model and the target energy data prediction model are respectively obtained;
  • the test data prediction model and the target energy data prediction model calculate the first sample migration weight and the second sample migration weight, wherein the first sample migration weight is the local energy data for the target energy equipment
  • the sample migration weight of the energy data, the second sample migration weight is the sample migration weight of the test data for the energy data of the target energy device;
  • test data and the second sample migration weight respectively train the local energy data network model and the test data network model
  • the flue gas emission of the target energy equipment is predicted.
  • the eighth aspect of the embodiments of the present disclosure provides a flue gas emission prediction device, including:
  • the first training module is used to train the local energy data measurement model according to the local energy data
  • the second training module is used to train the local energy data measurement model based on the joint learning framework according to the test data and the energy data of the target energy equipment, and respectively obtain the test data prediction model and the target energy data prediction model;
  • a calculation module configured to calculate the first sample migration weight and the second sample migration weight based on the local energy data prediction model, the test data prediction model and the target energy data prediction model, wherein the first sample migration weight is the local energy
  • the sample migration weight of the data for the energy data of the target energy device, the second sample migration weight is the sample migration weight of the test data for the energy data of the target energy device;
  • the third training module is used to train the local energy data network model and the test data network model respectively by using the local energy data and the first sample transfer weight, the test data and the second sample transfer weight;
  • the prediction module is used to predict the flue gas emission of the target energy equipment according to the joint learning prediction model.
  • a ninth aspect of the embodiments of the present disclosure provides a computer device, including a memory, a processor, and a computer program stored in the memory and operable on the processor, and the processor implements the steps of the above method when executing the computer program.
  • a tenth aspect of the embodiments of the present disclosure provides a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and when the computer program is executed by a processor, the steps of the above method are implemented.
  • the beneficial effects of the embodiments of the present disclosure at least include: determining the sample weights corresponding to the datasets of multiple groups of local devices and the datasets of the target device through the participating parties; Sample weight, training to obtain the prediction neural network model of multiple groups of devices; upload the prediction neural network model of multiple groups of local devices to the central node for model aggregation to obtain the aggregated prediction neural network model; train the aggregated model according to preset training conditions predictive neural network model to obtain a joint forecasting model; according to the joint forecasting model and the sample weight corresponding to the target equipment data set, predict the oxygen content value of the flue gas of the target equipment.
  • the embodiments of the present disclosure solve the problem of waste of resources caused by the inability to improve the accuracy of load prediction of flue gas oxygen content of energy equipment in the prior art.
  • FIG. 1 is a schematic diagram of a joint learning architecture according to an embodiment of the present disclosure
  • Fig. 2 is a flowchart of a flue gas oxygen content load prediction method provided by an embodiment of the present disclosure
  • Fig. 3 is a block diagram of a flue gas oxygen content load prediction device provided by an embodiment of the present disclosure
  • Fig. 4 is a flow chart of another flue gas oxygen content load prediction method provided by an embodiment of the present disclosure.
  • Fig. 5 is a block diagram of another flue gas oxygen content load prediction device provided by an embodiment of the present disclosure.
  • Fig. 6 is a flow chart of a method for selecting a flue gas oxygen content load prediction model provided by an embodiment of the present disclosure
  • Fig. 7 is a block diagram of a flue gas oxygen content load prediction model selection device provided by an embodiment of the present disclosure.
  • Fig. 8 is a flow chart of a flue gas emission prediction method provided by an embodiment of the present disclosure.
  • Fig. 9 is a block diagram of a flue gas emission prediction device provided by an embodiment of the present disclosure.
  • Fig. 10 is a schematic diagram of a computer device provided by an embodiment of the present disclosure.
  • Joint learning refers to the comprehensive utilization of various AI (Artificial Intelligence, artificial intelligence) technologies on the premise of ensuring data security and user privacy, and joint multi-party cooperation to jointly mine the value of data, and to promote new intelligent business forms and models based on joint modeling.
  • Federated learning has at least the following characteristics:
  • Participating nodes control the weakly centralized joint training mode of their own data to ensure data privacy and security in the process of co-creating intelligence.
  • FIG. 1 is a schematic diagram of a joint learning architecture according to an embodiment of the present disclosure.
  • the architecture of joint learning may include a server (central node) 101 , a participant 102 , a participant 104 and a participant 104 .
  • a participant can be composed of one or more clients.
  • the basic model can be established by the server 101, and the server 101 sends the model to the participant 102, the participant 104 and the participant 104 with which a communication connection is established.
  • the basic model can also be uploaded to the server 101 after being created by any participant, and the server 101 sends the model to other participants that have established communication connections with it.
  • Participant 102, participant 104, and participant 104 build a model according to the downloaded basic structure and model parameters, use local data for model training, obtain updated model parameters, and encrypt and upload the updated model parameters to the server 101.
  • the server 101 aggregates the model parameters sent by the participant 102 , the participant 104 and the participant 104 to obtain global model parameters, and returns the global model parameters to the participant 102 , the participant 104 and the participant 104 .
  • the participant 102, the participant 104 and the participant 104 iterate their respective models according to the received global model parameters until the models finally converge, thereby realizing the training of the models.
  • the data uploaded by participant 102, participant 104, and participant 104 are model parameters, local data will not be uploaded to the server 101, and all participants can share the final model parameters, so data can be guaranteed Co-modeling is achieved on the basis of privacy.
  • Fig. 2 is a flow chart of a method for predicting the load of oxygen content in flue gas provided by an embodiment of the present disclosure.
  • the execution subject is set as a participant, and the participant can be a client or an independent server, which is collectively referred to as a participant here; the central node can be a cloud or an integrated server.
  • the flue gas oxygen content load prediction method includes:
  • the participant respectively determines sample weights corresponding to data sets of multiple groups of local devices and target device data sets.
  • the participants respectively determine the sample weights corresponding to the data sets of multiple groups of local devices and the target device data sets in the following ways:
  • Step 1 The participant selects the data sets of multiple sets of local devices and the data sets of the target device;
  • the data set can be steam boiler flue gas temperature, economizer outlet temperature, instantaneous value of flue gas flow, steam boiler gas temperature, steam boiler flue gas standard flow rate, steam boiler natural gas inlet pressure, steam boiler flue flow rate, steam boiler Condenser inlet smoke temperature, steam boiler exhaust gas temperature, steam boiler flue gas pressure, steam boiler condenser inlet pressure, steam boiler main steam instantaneous flow, steam boiler operating status, steam boiler natural gas inlet instantaneous flow, etc.
  • the limit is a steam boiler, but it can also be a gas furnace or other energy equipment.
  • Step 2 merging the data sets of the local multiple groups of devices and the data sets of the target device to obtain merged data
  • Step 3 using the merged data to train a kernel density estimation model
  • Step 4 According to the kernel density estimation model, respectively determine the sample weights corresponding to the data sets of the local multiple groups of devices and the data sets of the target device.
  • uploading the prediction neural network models of multiple groups of local devices to the central node for model aggregation to obtain the aggregated prediction neural network model can be achieved in the following ways:
  • Step 1 Upload the prediction neural network models of multiple groups of local devices to the central node;
  • Step 2 Responding to the information fed back by the central node;
  • Step 3 receiving the aggregated prediction neural network model delivered by the central node.
  • the preset training condition may include a preset number of training times or a predicted convergence state value of model training, and the like.
  • training the aggregated prediction neural network model according to preset training conditions to obtain a joint prediction model can be achieved in the following manner:
  • Step 1 Responding to the aggregated prediction neural network model issued by the central node;
  • Step 2 Determine the preset training conditions
  • Step 3 Train the aggregated prediction neural network model according to preset training conditions to obtain a joint prediction model.
  • the prediction of the oxygen content value of the flue gas of the target equipment can be achieved in the following ways:
  • Step 1 Participants upload the joint prediction model to the central node for joint learning and training;
  • Step 2 Joint learning and training of the joint prediction model in response to feedback from the receiving engine
  • Step 3 Send the joint prediction model trained by joint learning to the target device;
  • Step 4 According to the joint prediction model and the sample weight corresponding to the target equipment data set, predict the oxygen content value of the flue gas of the target equipment.
  • Step 1 Participants use the data sets of multiple sets of local devices to establish training samples for predictive neural network models
  • Step 2 using the data set of the target device to establish a predictive neural network model test sample
  • Step 3 According to the prediction neural network model training sample and the model test sample, the sample prediction value is obtained;
  • Step 4 according to the norm of the error matrix of the sample predicted value and the sample expected value, the fitness value of the predicted neural network model is obtained;
  • Step 5 Update the particles in the population in the predictive neural network model according to the fitness value of the predictive neural network model to obtain an optimized predictive neural network model.
  • the optimization prediction neural network model is as follows:
  • each parameter is represented by 13bit binary code, and these parameters are spliced into a particle;
  • the parameters involved in the update of the particle algorithm are: speed, position, individual extremum, and group extremum of the population.
  • the speed and position update methods are shown in the following formulas (1) and (2).
  • X i (x i1 , x i2 , etc x iD ) represents a population particle with D dimension, and also represents a solution to the problem
  • V i (v i1 ,v i2 , etcv iD ) means the velocity of a population particle with D dimension
  • P g (p g1 ,p g2 ,...p gD ) means the population extremum with D dimension
  • w is the inertia weight
  • d 1, 2,...D
  • i 1,2,...n
  • k is the current iteration number
  • Vid is the speed of the particle
  • c1, c2 are non Negative constants, called acceleration factors
  • r1, r2 are random numbers distributed in [0,1].
  • (g) Perform optimal crossover on individuals in the population. Individual particles are updated by crossing with individual extremum particles.
  • the crossover method uses an integer crossover method. First, two crossover positions are selected, and then the individual and the individual extremum are crossed. The obtained new individuals adopt the strategy of retaining excellent individuals, and update the particles only when the fitness value of the new particle is greater than the fitness value of the old particle.
  • the mutation operation is performed on the particle operation in the population.
  • the mutation operation adopts the two-bit exchange method within the individual. First, the mutation positions pos1 and pos2 are randomly selected, and then the two mutation positions are exchanged. For the obtained new individuals, the strategy of retaining excellent individuals is adopted, and the particles are updated only when the fitness value of the new particles is better than that of the old particles.
  • a further example of a flue gas oxygen content load prediction method based on joint learning provided in this disclosure is as follows: There are three boiler data sets of boiler 1, boiler 2, and boiler 3, where boiler 1 and boiler 2 are local equipment, boiler 3 is the target equipment.
  • the data of multiple boilers can be used to improve the prediction accuracy and reduce the installation of boiler sensors, thereby reducing costs and resource waste.
  • the predictive neural network model can be trained using data from dataset A and sample weights; at the same time, at boiler 2, the predictive neural network model can be trained using data from dataset B and sample weights. Then upload the predictive neural network models trained by dataset A and dataset B to the central node for model aggregation.
  • the central node sends the aggregated model to Boiler 1 and Boiler 2
  • Boiler 1 and Boiler 2 use the aggregated model to train respectively to obtain a joint prediction model, and repeat this many times until the model is trained to Until it converges, upload the joint prediction model to the central node.
  • the central node sends the joint prediction model trained in the third step to boiler 3, and uses the joint prediction model to predict the oxygen content value of the flue gas at boiler 3.
  • the participants respectively determine the sample weights corresponding to the data sets of the local multiple groups of devices and the target device data sets; according to the data sets of the local multiple groups of devices and the corresponding sample weights, multiple groups The predictive neural network model of the device; upload the predictive neural network model of multiple groups of local devices to the central node for model aggregation to obtain the aggregated predictive neural network model; train the aggregated predictive neural network model according to preset training conditions to A joint prediction model is obtained; according to the joint prediction model and the sample weight corresponding to the target equipment data set, the oxygen content value of the flue gas of the target equipment is predicted.
  • the resource waste problem caused by the inability to improve the accuracy of load prediction of flue gas oxygen content of energy equipment in the prior art.
  • Fig. 3 is a schematic diagram of a flue gas oxygen content load prediction device provided by an embodiment of the present disclosure. As shown in Figure 3, the flue gas oxygen content load prediction device includes:
  • Determining module 301 used for participants to respectively determine the sample weights corresponding to the data sets of multiple groups of local devices and the data sets of the target device;
  • the first training module 302 is used to train the prediction neural network models of multiple sets of equipment according to the data sets of local multiple sets of equipment and the corresponding sample weights;
  • Aggregation module 303 for uploading the predictive neural network models of multiple groups of local devices to the central node for model aggregation, so as to obtain the aggregated predictive neural network models;
  • the second training module 304 trains the aggregated prediction neural network model according to preset training conditions to obtain a joint prediction model
  • the prediction module 305 is configured to predict the oxygen content value of the flue gas of the target device according to the joint prediction model and the sample weight corresponding to the target device data set.
  • the participants respectively determine the sample weights corresponding to the data sets of the local multiple groups of devices and the target device data sets; according to the data sets of the local multiple groups of devices and the corresponding sample weights, multiple groups The predictive neural network model of the device; upload the predictive neural network model of multiple groups of local devices to the central node for model aggregation to obtain the aggregated predictive neural network model; train the aggregated predictive neural network model according to preset training conditions to A joint prediction model is obtained; according to the joint prediction model and the sample weight corresponding to the target equipment data set, the oxygen content value of the flue gas of the target equipment is predicted.
  • the resource waste problem caused by the inability to improve the accuracy of load prediction of flue gas oxygen content of energy equipment in the prior art.
  • Fig. 4 is a flow chart of another method for predicting the load of oxygen content in flue gas provided by an embodiment of the present disclosure.
  • the method for predicting the load of flue gas oxygen content based on sample migration in FIG. 4 can be executed by the server in FIG. 1 .
  • the flue gas oxygen load prediction method includes:
  • the first participant is the participant who proposes the forecast demand
  • the second participant is other participants except the first participant.
  • the equipment data set from the first participant and the equipment data set from the second participant by receiving the equipment data set from the first participant and the equipment data set from the second participant; and then filtering the equipment data set of the first participant and the equipment data set of the second participant according to the preset screening features , to obtain the sample size of the equipment data of the first participant and the volume of equipment data of the second participant respectively; Device data for one party and device data for a second party.
  • the device data of the first participant and the device data of the second participant can be processed through tagging to obtain the tag data of the device data of the first participant and the tag data of the device data of the second participant;
  • the label data of the equipment data of one participant and the label data of the equipment data of the second participant to obtain the combined label data; finally, according to the combined label data, a predictive classifier is trained.
  • the equipment failure probability value corresponding to the equipment data of the first participant and the equipment failure probability value corresponding to the equipment data of the second participant can be respectively obtained; and then according to the equipment data of the first participant
  • the corresponding equipment failure probability value and the equipment failure probability value corresponding to the equipment data of the second participant determine the weight data of the equipment data of the first participant with respect to the equipment data of the second participant.
  • the predictive classifier for realizing using the predictive classifier to obtain the equipment failure probability value corresponding to the equipment data of the first participant and the equipment failure probability value corresponding to the equipment data of the second participant, respectively, by using the predictive classifier, respectively Classifying the equipment data of the first participant and the equipment data of the second participant to obtain equipment failure data corresponding to the equipment data of the first participant and equipment failure data corresponding to the equipment data of the second participant; then, respectively Calculate the equipment failure probability value corresponding to the equipment failure data corresponding to the equipment data of the first participant and the equipment failure data corresponding to the equipment data of the second participant.
  • the predictive gradient boosting model can be trained based on the equipment data of the second participant and the weight data of the equipment data of the first participant with respect to the equipment data of the second participant; Data training predictive gradient boosting model to obtain the predicted value of the equipment of the first participant; then according to the norm of the error matrix between the predicted value of the equipment of the first participant and the expected value of the equipment, the fitness value of the predictive gradient boosting model is obtained; Finally, according to the fitness value of the predictive gradient boosting model, the particles in the population in the predictive gradient boosting model are updated to obtain an optimized predictive gradient boosting model.
  • optimizing the predictive gradient boosting model can be achieved in the following ways:
  • the first participant is the participant who proposes the prediction demand, and the second participant
  • the party is other parties except the first party
  • use the equipment data of the first party and the equipment data of the second party to train the predictive classifier; according to the predictive classifier, determine that the equipment data of the first party
  • the weight data of the equipment data of the second participant based on the equipment data and weight data of the second participant, train the predictive gradient boosting model; use the predictive gradient boosting model to predict the flue gas oxygen content load of the equipment of the first participant.
  • Fig. 5 is a schematic diagram of a flue gas oxygen content load prediction device provided by an embodiment of the present disclosure, which is applied in a joint learning framework.
  • the flue gas oxygen content load prediction device based on sample migration includes:
  • the acquisition module 501 is used to acquire the equipment data of the first participant and the equipment data of the second participant under the joint learning framework; wherein, the first participant is the participant who proposes the prediction demand, and the second participant is the a party other than the party;
  • the first training module 502 uses the device data of the first participant and the device data of the second participant to train a predictive classifier
  • a calculation module 503, configured to determine the weight data of the equipment data of the first participant with respect to the equipment data of the second participant according to the predictive classifier;
  • the second training module 504 is configured to train a predictive gradient boosting model based on the equipment data and weight data of the second participant;
  • the prediction module 505 is configured to use the predicted gradient boosting model to predict the flue gas oxygen content load of the equipment of the first participant.
  • the first participant is the participant who proposes the prediction demand
  • the second participant The party is other parties except the first party
  • use the equipment data of the first party and the equipment data of the second party to train the predictive classifier
  • determine that the equipment data of the first party The weight data of the equipment data of the second participant; based on the equipment data and weight data of the second participant, train the predictive gradient boosting model; use the predictive gradient boosting model to predict the flue gas oxygen content load of the equipment of the first participant.
  • Fig. 6 is a schematic flowchart of a method for selecting a flue gas oxygen content load prediction model provided by an embodiment of the present disclosure, and the method may be executed by the server in Fig. 1 .
  • the selection method of the flue gas oxygen content load prediction model includes:
  • the training data set may be different energy equipment models (for example, different boiler models), and the test set data may be flue gas oxygen content data and corresponding characteristic data of energy equipment under different processes.
  • the attributes of the prediction equipment it can be determined that the attributes of the prediction equipment correspond to the oxygen content data of the flue gas of the prediction equipment; then, the features of the oxygen content data of the flue gas of the prediction equipment are extracted; The characteristics of the oxygen content data are used to construct the training data set and test data set of the prediction equipment respectively.
  • S602. Perform preprocessing on the data in the training data set and the data in the testing data set for predicting the device, and obtain a preprocessed device data set.
  • the prediction model group can be composed of xgboost algorithm, SVR algorithm, neural network algorithm, belief network algorithm, decision tree algorithm, random forest regression algorithm, gradient boosting tree regression algorithm, linear regression algorithm, deep learning algorithm and other algorithms.
  • the present invention is not limited.
  • a prediction model group can be established according to the properties of the predicted device and each piece of data in the preprocessed device data set; then the root mean square error of each piece of data in the training set and the test set can be calculated using the prediction model group ; Furthermore, the root mean square error of each piece of data in the obtained training set and test set can be used as the evaluation index value of each piece of data in the preprocessed device data set.
  • the training data set of the prediction device can be used to train the algorithm in the prediction model group to obtain Prediction result; the algorithm in the prediction model group can be trained by using the test data set of the prediction device to obtain the test result; then, according to the prediction result and the test result, the root mean square of each piece of data in the training set and the test set can be obtained error.
  • each piece of data in the training set and test set for prediction calculates each piece of data in the training set and test set for prediction, get the root mean square error (set to rmse) of each piece of data, and use the root mean square error as the value of each piece of data in the preprocessed device data set Evaluate the index value, and then select the algorithm with the smallest rmse value as the corresponding algorithm label of the training set and test set (label each algorithm in the algorithm group, and the label value is 1, 2, 3, etc.)
  • rmse index uses the training set to train the algorithm given by the algorithm group, use the test set to test the prediction results obtained by the training algorithm, and use the prediction results and the test set to obtain the rmse index.
  • the calculation formula of rmse index is as follows:
  • n ⁇ 1 is the label value
  • y i is the training set, is the test set
  • i ⁇ 1 is the corresponding data set number.
  • the evaluation index value of each piece of data in the preprocessed equipment data set can be sorted from small to large; and according to the sorting result, the smallest evaluation index value is selected; and then the corresponding prediction model in the prediction model group is called tag value;
  • the prediction model corresponding to the label value corresponding to the prediction model is a flue gas oxygen content load prediction model suitable for the prediction equipment.
  • the classification algorithm can be used to cluster the training data set of the prediction device to obtain at least two types of training cluster data; then, by calling the classification Classify the data of at least two types of training clusters; then according to the classified data of at least two types of training clusters, train at least two classifiers corresponding to at least two types of training clusters; Cluster prediction to obtain at least one class from at least two training clusters. According to at least one category in the at least two training clusters, determine the classifier corresponding to at least one category in the at least two training clusters; finally, determine the corresponding prediction model in the prediction model group according to the category label value corresponding to the classifier tag value.
  • the binary algorithm can be used to cluster the training set data to obtain the number of clusters K categories (K is a constant), and then use the classification algorithm to cluster the K categories to obtain the corresponding data in the K categories, and select a category
  • K is a constant
  • the gradient boosting regression tree classifies K types of data respectively, and the corresponding label of the training data (given by step S203) has K types of data, so K classifiers are trained.
  • the clustering prediction operation is performed on the test set data.
  • the prediction result is a certain category in the 1-K clustering, and then the corresponding classifier is used to perform the classification operation to obtain an output result, which corresponds to the algorithm group.
  • a predictive model and then use this algorithm to predict the data, and then judge the correctness of the predictive model.
  • the training data set and the test data set of the prediction device from the participant are received; the data in the training data set and the test data set of the prediction device are preprocessed , and obtain the preprocessed equipment data set; according to the establishment of the prediction model group, calculate the evaluation index value of each piece of data in the preprocessed equipment data set; according to the minimum evaluation index value, determine the flue gas oxygen content suitable for predicting the equipment load forecasting model.
  • the prediction of the oxygen content of the flue gas of the energy equipment and reduce the measurement cost of the existing technology.
  • Fig. 7 is a schematic diagram of a flue gas oxygen content load prediction model selection device provided by an embodiment of the present disclosure. As shown in Figure 7, the device includes:
  • the receiving module 701 is configured to receive training data sets and test data sets from prediction devices of participating parties based on the joint learning architecture;
  • a preprocessing module 702 configured to preprocess the data in the training data set and the test data set of the prediction device, and obtain a preprocessed device data set;
  • the calculation module 703 is used to calculate the evaluation index value of each piece of data in the preprocessed device data set according to the established prediction model group;
  • the prediction module 704 is configured to determine a flue gas oxygen content load prediction model suitable for the prediction device according to the smallest value of the evaluation index.
  • the training data set and the test data set of the prediction device from the participant are received; the data in the training data set and the test data set of the prediction device are preprocessed , and obtain the preprocessed equipment data set; according to the establishment of the prediction model group, calculate the evaluation index value of each piece of data in the preprocessed equipment data set; according to the minimum evaluation index value, determine the flue gas oxygen content suitable for predicting the equipment load forecasting model.
  • the prediction of the oxygen content of the flue gas of the energy equipment and reduce the measurement cost of the existing technology.
  • FIG. 8 is a schematic flowchart of a smoke emission prediction method provided by an embodiment of the present disclosure, and the smoke emission prediction method based on joint learning can be executed by the participants in Fig. 1 . As shown in Figure 8, the method includes:
  • local energy data can refer to the flue gas temperature, flue gas flow rate, and equipment inlet pressure of local equipment, such as steam boiler flue gas temperature, economizer outlet temperature, instantaneous value of flue gas flow, steam boiler gas temperature, steam boiler Flue gas standard flow rate, steam boiler natural gas inlet pressure, steam boiler flue gas flow rate, steam boiler condenser inlet flue temperature, steam boiler exhaust gas temperature, steam boiler flue gas pressure, steam boiler condenser inlet pressure, steam boiler main steam instantaneous Flow rate, operating status of steam boiler, instantaneous flow rate of natural gas inlet of steam boiler, etc.
  • equipment inlet pressure of local equipment such as steam boiler flue gas temperature, economizer outlet temperature, instantaneous value of flue gas flow, steam boiler gas temperature, steam boiler Flue gas standard flow rate, steam boiler natural gas inlet pressure, steam boiler flue gas flow rate, steam boiler condenser inlet flue temperature, steam boiler exhaust gas temperature, steam boiler flue gas pressure, steam boiler condenser inlet pressure
  • the following methods can also be used to organize or filter the local energy data, test data and energy data of the target energy equipment: first, select the sample data set; Labeling the data in the data set, and obtaining the label data corresponding to the data in the sample data set; respectively determining the label data corresponding to the local energy data, test data, and energy data of the target energy equipment.
  • test data may be selected from local energy data, or may be energy data extracted from other related equipment; the energy data of the target energy equipment may be the energy data of the equipment to be predicted.
  • the participants send the local energy data prediction model to the central node; in response to the feedback information of the central node, train the local energy data according to the test data and the energy data of the target energy equipment
  • the test model is used to obtain the test data prediction model and the target energy data prediction model respectively.
  • the participants send the local energy data prediction model, test data prediction model and target energy data prediction model to the central node; the central node can After the local energy data prediction model, test data prediction model and target energy data prediction model are sorted or adjusted, they are sent to relevant participants; then, in response to the feedback information from the central node, based on the local energy data prediction model, test data prediction
  • the model and the target energy data prediction model perform target classification on the local energy data, test data and energy data of the target energy equipment respectively; finally, calculate the first sample migration weight and the second sample migration weight according to the target classification.
  • test data and the second sample transfer weight respectively train the local energy data network model and the test data network model.
  • the test data network model can be trained according to the expected data set.
  • the local energy data network model and the test data network model can be trained in parallel, or one of the two models can be trained first, which is not limited in this disclosure.
  • the local energy data network model and the test data network model can be uploaded to the central node; the central node performs aggregation training on the local energy data network model and the test data network model. Then, after receiving the joint learning prediction model from the central node, the local energy data network model and the test data network model are aggregated and trained.
  • the joint learning prediction model can be optimized according to the prediction conditions; wherein the prediction conditions include: the prediction value of the model parameters and the judgment of the fitness of the model parameters.
  • the local energy data measurement model is trained according to the local energy data; based on the joint learning framework, the local energy data measurement model is trained according to the test data and the energy data of the target energy equipment, and the test data predictions are respectively obtained model and target energy data prediction model; based on the local energy data prediction model, the test data prediction model and the target energy data prediction model, the first sample migration weight and the second sample migration weight are calculated, wherein the first sample migration weight is the local
  • the energy data is aimed at the sample migration weight of the energy data of the target energy device, and the second sample migration weight is the sample migration weight of the test data for the energy data of the target energy device; using the local energy data and the first sample migration weight, the test data and the second sample migration weight Two-sample migration weights to train the local energy data network model and the test data network model respectively; receive the joint learning prediction model after the aggregation training of the local energy data network model and the test data network model from the central node; according to the joint learning prediction model,
  • Fig. 9 is a schematic diagram of a flue gas emission prediction device provided by an embodiment of the present disclosure. As shown in Figure 9, the flue gas emission prediction device includes:
  • the first training module 901 is used to train the local energy data measurement model according to the local energy data
  • the second training module 902 is used to train the local energy data measurement model based on the joint learning framework according to the test data and the energy data of the target energy equipment, and respectively obtain the test data prediction model and the target energy data prediction model;
  • Calculation module 903 configured to calculate the first sample migration weight and the second sample migration weight based on the local energy data prediction model, the test data prediction model and the target energy data prediction model, wherein the first sample migration weight is the local energy data For the sample migration weight of the energy data of the target energy device, the second sample migration weight is the sample migration weight of the test data for the energy data of the target energy device;
  • the third training module 904 is used to train the local energy data network model and the test data network model respectively by using the local energy data and the first sample transfer weight, test data and the second sample transfer weight;
  • Establishment module 905 used to receive the joint learning prediction model from the central node after the aggregation and training of the local energy data network model and the test data network model;
  • the prediction module 906 is used to predict the smoke emission of the target energy equipment according to the joint learning prediction model.
  • the local energy data measurement model is trained according to the local energy data; based on the joint learning framework, the local energy data measurement model is trained according to the test data and the energy data of the target energy equipment, and the test data predictions are respectively obtained model and target energy data prediction model; based on the local energy data prediction model, the test data prediction model and the target energy data prediction model, the first sample migration weight and the second sample migration weight are calculated, wherein the first sample migration weight is the local
  • the energy data is aimed at the sample migration weight of the energy data of the target energy device, and the second sample migration weight is the sample migration weight of the test data for the energy data of the target energy device; using the local energy data and the first sample migration weight, the test data and the second sample migration weight Two-sample migration weights to train the local energy data network model and the test data network model respectively; receive the joint learning prediction model after the aggregation training of the local energy data network model and the test data network model from the central node; according to the joint learning prediction model,
  • FIG. 10 is a schematic diagram of a computer device 10 provided by an embodiment of the present disclosure.
  • the computer device 10 of this embodiment includes: a processor 1001 , a memory 1002 , and a computer program 1003 stored in the memory 1002 and capable of running on the processor 1001 .
  • the processor 1001 executes the computer program 1003
  • the steps in the foregoing method embodiments are implemented.
  • the processor 1001 executes the computer program 1003 the functions of the modules/units in the foregoing device embodiments are implemented.
  • the computer program 1003 can be divided into one or more modules/units, and one or more modules/units are stored in the memory 1002 and executed by the processor 1001 to complete the present disclosure.
  • One or more modules/units may be a series of computer program instruction segments capable of accomplishing specific functions, and the instruction segments are used to describe the execution process of the computer program 1003 in the computer device 10 .
  • the computer device 10 may be a computer device such as a desktop computer, a notebook, a palmtop computer, or a cloud server.
  • the computer device 10 may include, but is not limited to, a processor 1001 and a memory 1002 .
  • FIG. 4 is only an example of the computer device 10, and does not constitute a limitation to the computer device 10. It may include more or less components than those shown in the illustration, or combine certain components, or different components. , for example, computer equipment may also include input and output equipment, network access equipment, bus, and so on.
  • the processor 1001 may be a central processing unit (Central Processing Unit, CPU), or other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), on-site Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • a general-purpose processor may be a microprocessor, or the processor may be any conventional processor, or the like.
  • the memory 1002 may be an internal storage unit of the computer device 10 , for example, a hard disk or a memory of the computer device 10 .
  • the memory 1002 can also be an external storage device of the computer device 10, for example, a plug-in hard disk equipped on the computer device 10, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) card, a flash memory card ( Flash Card), etc.
  • the storage 1002 may also include both an internal storage unit of the computer device 10 and an external storage device.
  • the memory 1002 is used to store computer programs and other programs and data required by the computer equipment.
  • the memory 1002 can also be used to temporarily store data that has been output or will be output.
  • the disclosed apparatus/computer equipment and methods may be implemented in other ways.
  • the device/computer device embodiments described above are only illustrative, for example, the division of modules or units is only a logical function division, and there may be other division methods in actual implementation, and multiple units or components can be Incorporation may either be integrated into another system, or some features may be omitted, or not implemented.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical, mechanical or other forms.
  • a unit described as a separate component may or may not be physically separated, and a component displayed as a unit may or may not be a physical unit, that is, it may be located in one place, or may be distributed to multiple network units. Part or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
  • each functional unit in each embodiment of the present disclosure may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
  • an integrated module/unit is realized in the form of a software function unit and sold or used as an independent product, it can be stored in a computer-readable storage medium.
  • the present disclosure realizes all or part of the processes in the methods of the above embodiments, and can also be completed by instructing related hardware through computer programs.
  • the computer programs can be stored in computer-readable storage media, and the computer programs can be processed. When executed by the controller, the steps in the above-mentioned method embodiments can be realized.
  • a computer program may include computer program code, which may be in source code form, object code form, executable file, or some intermediate form or the like.
  • the computer-readable medium may include: any entity or device capable of carrying computer program code, recording medium, U disk, removable hard disk, magnetic disk, optical disk, computer memory, read-only memory (Read-Only Memory, ROM), random access Memory (Random Access Memory, RAM), electrical carrier signal, telecommunication signal and software distribution medium, etc. It should be noted that the content contained in computer readable media may be appropriately increased or decreased according to the requirements of legislation and patent practice in the jurisdiction. For example, in some jurisdictions, computer readable media may not Including electrical carrier signals and telecommunication signals.

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • Strategic Management (AREA)
  • Marketing (AREA)
  • Game Theory and Decision Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Development Economics (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Provided are a method and apparatus for predicting oxygen content in flue gas and a load, a method and apparatus for selecting a prediction model, and a method and apparatus for predicting flue gas emission. The method for predicting oxygen content in flue gas and a load comprises: by means of a participant, respectively determining data sets of a plurality of groups of local devices and a sample weight which corresponds to a data set of a target device; on the basis of the data sets of the plurality of groups of local devices and corresponding sample weights, obtaining, by means of training, prediction neural network models of the plurality of groups of devices; uploading the prediction neural network models of the plurality of groups of local devices to a central node for model aggregation, so as to obtain an aggregated prediction neural network model; training the aggregated prediction neural network model on the basis of a preset training condition, so as to obtain a joint prediction model; and predicting the value of oxygen content in flue gas of the target device on the basis of the joint prediction model and the sample weight corresponding to the data set of the target device.

Description

烟气含氧量负荷预测方法、预测模型选择方法、烟气排放预测方法及装置Flue gas oxygen content load prediction method, prediction model selection method, flue gas emission prediction method and device 技术领域technical field
本公开涉及综合能源技术领域,尤其涉及一种烟气含氧量负荷预测方法、预测模型选择方法、烟气排放预测方法及装置。The present disclosure relates to the field of comprehensive energy technology, and in particular to a flue gas oxygen content load prediction method, a prediction model selection method, a flue gas emission prediction method and a device.
背景技术Background technique
随着综合能源的广泛应用,热效率是衡量燃气锅炉的重要指标。一般通过控制锅炉烟气含氧量为最优设计值以实现不同能源设备工作下热效率最高。烟气含氧量常用氧化锆测量仪测量和维护,但费用较高。With the wide application of comprehensive energy, thermal efficiency is an important indicator to measure gas-fired boilers. Generally, by controlling the oxygen content of the boiler flue gas to the optimal design value, the highest thermal efficiency under different energy equipment can be achieved. The oxygen content of flue gas is often measured and maintained by zirconia measuring instrument, but the cost is high.
例如,在分布式能源领域,小型燃气锅炉为了节约成本普遍放弃安装氧化锆测量仪,导致无法实现闭环控制和热效率最优运行,特别是当燃气热值不稳定的情况下,热效率会大部分被牺牲掉。即不同锅炉的数据分布是不一样的,这样很大程度上影响了数据的预测精度,从而锅炉烟气含氧量负荷预测就不会准确,这样给应用能源设备的企业或工厂带来了很大的经济损失,所以目前急需解决此问题。For example, in the field of distributed energy, small gas-fired boilers generally abandon the installation of zirconia measuring instruments in order to save costs, resulting in the inability to achieve closed-loop control and optimal operation of thermal efficiency, especially when the calorific value of gas is unstable, thermal efficiency will be mostly affected Sacrifice. That is, the data distribution of different boilers is not the same, which greatly affects the prediction accuracy of the data, so that the prediction of the oxygen content load of the boiler flue gas will not be accurate, which brings a lot of trouble to the enterprises or factories that use energy equipment. Big economic loss, so urgently need to solve this problem at present.
发明内容Contents of the invention
有鉴于此,本公开实施例提供了一种基于联合学习的烟气含氧量负荷预测方法、装置、计算机设备及计算机可读存储介质,以解决现有技术中因无法提高能源设备的烟气含氧量负荷预测的准确性的而造成资源浪费问题。In view of this, the embodiments of the present disclosure provide a method, device, computer equipment, and computer-readable storage medium for predicting the oxygen content load of flue gas based on joint learning, so as to solve the problem of the inability to improve the flue gas load of energy equipment in the prior art. The accuracy of the oxygen load prediction results in a waste of resources.
本公开实施例的第一方面,提供了一种烟气含氧量负荷预测方法,包括:The first aspect of the embodiments of the present disclosure provides a flue gas oxygen content load prediction method, including:
参与方分别确定本地多组设备的数据集和目标设备数据集对应的样本权重;The participants respectively determine the sample weights corresponding to the data sets of the local multiple groups of devices and the target device data sets;
根据本地多组设备的数据集和对应的样本权重,训练得到多组设备的预测神经网络模型;According to the data sets of multiple sets of local equipment and the corresponding sample weights, train the prediction neural network model of multiple sets of equipment;
将本地多组设备的预测神经网络模型上传到中心节点进行模型聚合,以得到聚合后的预测神经网络模型;Upload the predictive neural network models of multiple groups of local devices to the central node for model aggregation to obtain the aggregated predictive neural network models;
根据预设训练条件训练聚合后的预测神经网络模型,以得到联合预测模型;Train the aggregated prediction neural network model according to preset training conditions to obtain a joint prediction model;
根据联合预测模型和目标设备数据集对应的样本权重,预测目标设备烟气含氧量值。According to the joint prediction model and the sample weight corresponding to the target equipment data set, the oxygen content value of the flue gas of the target equipment is predicted.
本公开实施例的第二方面,提供了一种烟气含氧量负荷预测装置,包括:The second aspect of the embodiments of the present disclosure provides a flue gas oxygen content load prediction device, including:
确定模块,用于参与方分别确定本地多组设备的数据集和目标设备数据集对应的样本权重;A determination module, used for the participants to respectively determine the sample weights corresponding to the data sets of multiple groups of local devices and the data sets of the target device;
第一训练模块,用于根据本地多组设备的数据集和对应的样本权重,训练得到多组设备的预测神经网络模型;The first training module is used to train the prediction neural network models of multiple sets of equipment according to the data sets of multiple sets of local equipment and the corresponding sample weights;
聚合模块,用于将本地多组设备的预测神经网络模型上传到中心节点进行模型聚合,以得到聚合后的预测神经网络模型;The aggregation module is used to upload the predictive neural network models of multiple groups of local devices to the central node for model aggregation, so as to obtain the aggregated predictive neural network models;
第二训练模块,根据预设训练条件训练聚合后的预测神经网络模型,以得到联合预测模型;The second training module trains the aggregated prediction neural network model according to preset training conditions to obtain a joint prediction model;
预测模块,用于根据联合预测模型和目标设备数据集对应的样本权重,预测目标设备烟气含氧量值。The prediction module is used to predict the oxygen content value of the flue gas of the target device according to the joint prediction model and the sample weight corresponding to the target device data set.
本公开实施例的第三方面,提供了一种烟气含氧量负荷预测方法,应用于联合学习框架中,包括:The third aspect of the embodiments of the present disclosure provides a flue gas oxygen content load prediction method, which is applied in a joint learning framework, including:
获取联合学习架构下的第一参与方的设备数据和第二参与方的设备数据;其中,第一参与方为提出预测需求的参与方,第二参与方为除第一参与方外的其他参与方;Obtain the equipment data of the first participant and the equipment data of the second participant under the federated learning architecture; among them, the first participant is the participant who proposes the prediction demand, and the second participant is other participants except the first participant square;
利用第一参与方的设备数据和第二参与方的设备数据,训练预测分类器;using the device data of the first party and the device data of the second party to train a predictive classifier;
根据预测分类器,确定第一参与方的设备数据关于第二参与方的设备数据的权重数据;determining weight data of the device data of the first party with respect to the device data of the second party based on the predictive classifier;
基于第二参与方的设备数据和权重数据,训练预测式梯度提升模型;Train a predictive gradient boosting model based on the device data and weight data of the second participant;
利用预测式梯度提升模型预测第一参与方设备的烟气含氧量负荷。Using a predictive gradient boosting model to predict flue gas oxygen loads for first-party equipment.
本公开实施例的第四方面,提供了一种烟气含氧量负荷预测装置,应用于联合学习框架中,包括:The fourth aspect of the embodiments of the present disclosure provides a flue gas oxygen content load prediction device, which is applied in a joint learning framework, including:
获取模块,用于获取联合学习架构下的第一参与方的设备数据和第二参与方的设备数据;其中,第一参与方为提出预测需求的参与方,第二参与方为除第一参与方外的其他参与方;The acquisition module is used to acquire the equipment data of the first participant and the equipment data of the second participant under the joint learning architecture; wherein, the first participant is the participant who proposes the prediction demand, and the second participant is the participant other than the first participant parties other than the Party;
第一训练模块,利用第一参与方的设备数据和第二参与方的设备数据,训练预测分类器;The first training module uses the equipment data of the first participant and the equipment data of the second participant to train the predictive classifier;
计算模块,用于根据预测分类器,确定第一参与方的设备数据关于第二参与方的设备数据的权重数据;A calculation module, configured to determine weight data of the equipment data of the first participant with respect to the equipment data of the second participant according to the predictive classifier;
第二训练模块,用于基于第二参与方的设备数据和权重数据,训练预测式梯度提升模型;The second training module is used to train the predictive gradient boosting model based on the equipment data and weight data of the second participant;
预测模块,用于利所预测式梯度提升模型预测第一参与方设备的烟气含氧量负荷。The prediction module is used to predict the flue gas oxygen content load of the equipment of the first participant by using the predicted gradient boosting model.
本公开实施例的第五方面,提供了一种烟气含氧量负荷预测模型选择方法,包括:According to the fifth aspect of the embodiments of the present disclosure, a method for selecting a flue gas oxygen content load prediction model is provided, including:
基于联合学习架构,接收来自参与方的预测设备的训练数据集和测试数据集;Based on the federated learning architecture, receive the training data set and test data set from the prediction equipment of the participating parties;
对预测设备的训练数据集中的数据和测试数据集中的数据进行预处理,并得到预处理后的设备数据集;Preprocess the data in the training data set and the test data set of the prediction device, and obtain the preprocessed device data set;
根据建立预测模型组,计算预处理后的设备数据集中的每条数据的评价指标值;Calculate the evaluation index value of each piece of data in the preprocessed equipment data set according to the established prediction model group;
根据最小的评价指标值,确定适合预测设备的烟气含氧量负荷预测模型。According to the minimum evaluation index value, the flue gas oxygen content load prediction model suitable for the prediction equipment is determined.
本公开实施例的第六方面,提供了一种烟气含氧量负荷预测模型选择装置,包括:The sixth aspect of the embodiments of the present disclosure provides a flue gas oxygen content load prediction model selection device, including:
接收模块,用于基于联合学习架构,接收来自参与方的预测设备的训练数据集和测试数据集;The receiving module is used to receive training data sets and test data sets from prediction devices of participating parties based on the federated learning architecture;
预处理模块,用于对预测设备的训练数据集中的数据和测试数据集中的数据进行预处理,并得到预处理后的设备数据集;The preprocessing module is used to preprocess the data in the training data set and the test data set of the prediction device, and obtain the preprocessed device data set;
计算模块,用于根据建立预测模型组,计算预处理后的设备数据集中的每条数据的评价指标值;A calculation module, configured to calculate the evaluation index value of each piece of data in the preprocessed device data set according to the established prediction model group;
预测模块,用于根据最小的评价指标值,确定适合预测设备的烟气含氧量负荷预测模型。The prediction module is used to determine the flue gas oxygen content load prediction model suitable for the prediction equipment according to the minimum evaluation index value.
本公开实施例的第七方面,提供了一种烟气排放预测方法,包括:The seventh aspect of the embodiments of the present disclosure provides a flue gas emission prediction method, including:
根据本地能源数据,训练本地能源数据测模型;According to the local energy data, train the local energy data measurement model;
基于联合学习框架,根据测试数据和目标能源设备的能源数据训练本地能源数据测模型,分别得到测试数据预测模型和目标能源数据预测模型;Based on the joint learning framework, the local energy data measurement model is trained according to the test data and the energy data of the target energy equipment, and the test data prediction model and the target energy data prediction model are respectively obtained;
基于本地能源数据预测模型、测试数据预测模型和目标能源数据预测模型,计算第一样本迁移权重和第二样本迁移权重,其中,所述第一样本迁移权重为本地能源数据针对目标能源设备的能源数据的样本迁移权重,所述第二样本迁移权重为测试数据针对目标能源设备的能源数据的样本迁移权重;Based on the local energy data prediction model, the test data prediction model and the target energy data prediction model, calculate the first sample migration weight and the second sample migration weight, wherein the first sample migration weight is the local energy data for the target energy equipment The sample migration weight of the energy data, the second sample migration weight is the sample migration weight of the test data for the energy data of the target energy device;
利用本地能源数据和第一样本迁移权重、测试数据和第二样本迁移权重,分别训练本地能源数据网络模型和测试数据网络模型;Using the local energy data and the first sample migration weight, the test data and the second sample migration weight, respectively train the local energy data network model and the test data network model;
接收来自中心节点对本地能源数据网络模型和测试数据网络模型聚合训练后的联合学习预测模型;Receive the joint learning prediction model from the central node after the aggregation and training of the local energy data network model and the test data network model;
根据联合学习预测模型对目标能源设备进行烟气排放的预测。According to the joint learning prediction model, the flue gas emission of the target energy equipment is predicted.
本公开实施例的第八方面,提供了一种烟气排放预测装置,包括:The eighth aspect of the embodiments of the present disclosure provides a flue gas emission prediction device, including:
第一训练模块,用于根据本地能源数据,训练本地能源数据测模型;The first training module is used to train the local energy data measurement model according to the local energy data;
第二训练模块,用于基于联合学习框架,根据测试数据和目标能源设备的能源数据训练本地能源数据测模型,分别得到测试数据预测模型和目标能源数据预测模型;The second training module is used to train the local energy data measurement model based on the joint learning framework according to the test data and the energy data of the target energy equipment, and respectively obtain the test data prediction model and the target energy data prediction model;
计算模块,用于基于本地能源数据预测模型、测试数据预测模型和目标能源数据预测模型,计算第一样本迁移权重和第二样本迁移权重,其中,所述第一样本迁移权重为本地能源数据针对目标能源设备的能源数据的样本迁移权重,所述第二样本迁移权重为测试数据针对目标能源设备的能源数据的样本迁移权重;A calculation module, configured to calculate the first sample migration weight and the second sample migration weight based on the local energy data prediction model, the test data prediction model and the target energy data prediction model, wherein the first sample migration weight is the local energy The sample migration weight of the data for the energy data of the target energy device, the second sample migration weight is the sample migration weight of the test data for the energy data of the target energy device;
第三训练模块,用于利用本地能源数据和第一样本迁移权重、测试数据和第二样本迁移权重,分别训练本地能源数据网络模型和测试数据网络模型;The third training module is used to train the local energy data network model and the test data network model respectively by using the local energy data and the first sample transfer weight, the test data and the second sample transfer weight;
建立模块,用于接收来自中心节点对本地能源数据网络模型和测试数据网络模型聚合训练后的联合学习预测模型;Establishing a module for receiving a joint learning prediction model from the central node after the aggregation and training of the local energy data network model and the test data network model;
预测模块,用于根据联合学习预测模型对目标能源设备进行烟气排放的预测。The prediction module is used to predict the flue gas emission of the target energy equipment according to the joint learning prediction model.
本公开实施例的第九方面,提供了一种计算机设备,包括存储器、处理器以及存储在存储器中并且可以在处理器上运行的计算机程序,该处理器执行计算机程序时实现上述方法的步骤。A ninth aspect of the embodiments of the present disclosure provides a computer device, including a memory, a processor, and a computer program stored in the memory and operable on the processor, and the processor implements the steps of the above method when executing the computer program.
本公开实施例的第十方面,提供了一种计算机可读存储介质,该计算机可读存储介质存储有计算机程序,该计算机程序被处理器执行时实现上述方法的步骤。A tenth aspect of the embodiments of the present disclosure provides a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and when the computer program is executed by a processor, the steps of the above method are implemented.
本公开实施例与现有技术相比存在的有益效果至少包括:通过参与方分别确定本地多组设备的数据集和目标设备数据集对应的样本权重;根据本地多组设备的数据集和对应的样本权重,训练得到多组设备的预测神经网络模型;将本地多组设备的预测神经网络模型上传到中心节点进行模型聚合,以得到聚合后的预测神经网络模型;根据预设训练条件训练聚合后的预测神经网络模型,以得到联合预测模型;根据联合预测模型和目标设备数据集对应的样本权重,预测目标设备烟气含氧量值。本公开实施例解决了现有技术中因无法提高能源设备的烟气含氧量负荷预测的准确性的而造成资源浪费问题。Compared with the prior art, the beneficial effects of the embodiments of the present disclosure at least include: determining the sample weights corresponding to the datasets of multiple groups of local devices and the datasets of the target device through the participating parties; Sample weight, training to obtain the prediction neural network model of multiple groups of devices; upload the prediction neural network model of multiple groups of local devices to the central node for model aggregation to obtain the aggregated prediction neural network model; train the aggregated model according to preset training conditions predictive neural network model to obtain a joint forecasting model; according to the joint forecasting model and the sample weight corresponding to the target equipment data set, predict the oxygen content value of the flue gas of the target equipment. The embodiments of the present disclosure solve the problem of waste of resources caused by the inability to improve the accuracy of load prediction of flue gas oxygen content of energy equipment in the prior art.
附图说明Description of drawings
为了更清楚地说明本公开实施例中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本公开的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其它的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present disclosure, the following will briefly introduce the drawings that need to be used in the embodiments or the description of the prior art. Obviously, the drawings in the following description are only of the present disclosure For some embodiments, those skilled in the art can also obtain other drawings based on these drawings without creative efforts.
图1是本公开实施例的一种联合学习的架构示意图;FIG. 1 is a schematic diagram of a joint learning architecture according to an embodiment of the present disclosure;
图2是本公开实施例提供的一种烟气含氧量负荷预测方法的流程图;Fig. 2 is a flowchart of a flue gas oxygen content load prediction method provided by an embodiment of the present disclosure;
图3是本公开实施例提供的一种烟气含氧量负荷预测装置的框图;Fig. 3 is a block diagram of a flue gas oxygen content load prediction device provided by an embodiment of the present disclosure;
图4是本公开实施例提供的另一种烟气含氧量负荷预测方法的流程图;Fig. 4 is a flow chart of another flue gas oxygen content load prediction method provided by an embodiment of the present disclosure;
图5是本公开实施例提供的另一种烟气含氧量负荷预测装置的框图;Fig. 5 is a block diagram of another flue gas oxygen content load prediction device provided by an embodiment of the present disclosure;
图6是本公开实施例提供的一种烟气含氧量负荷预测模型选择方法的流程图;Fig. 6 is a flow chart of a method for selecting a flue gas oxygen content load prediction model provided by an embodiment of the present disclosure;
图7是本公开实施例提供的一种烟气含氧量负荷预测模型选择装置的框图;Fig. 7 is a block diagram of a flue gas oxygen content load prediction model selection device provided by an embodiment of the present disclosure;
图8是本公开实施例提供的一种烟气排放预测方法的流程图;Fig. 8 is a flow chart of a flue gas emission prediction method provided by an embodiment of the present disclosure;
图9是本公开实施例提供的一种烟气排放预测装置的框图;Fig. 9 is a block diagram of a flue gas emission prediction device provided by an embodiment of the present disclosure;
图10是本公开实施例提供的一种计算机设备的示意图。Fig. 10 is a schematic diagram of a computer device provided by an embodiment of the present disclosure.
具体实施方式Detailed ways
以下描述中,为了说明而不是为了限定,提出了诸如特定系统结构、技术之类的具体细节,以便透彻理解本公开实施例。然而,本领域的技术人员应当清楚,在没有这些具体细节的其它实施例中也可以实现本公开。在其它情况中,省略对众所周知的系统、装置、电路以及方法的详细说明,以免不必要的细节妨碍本公开的描述。In the following description, for the purpose of illustration rather than limitation, specific details such as specific system structures and techniques are presented for a thorough understanding of the embodiments of the present disclosure. It will be apparent, however, to one skilled in the art that the present disclosure may be practiced in other embodiments without these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present disclosure with unnecessary detail.
联合学习是指在确保数据安全及用户隐私的前提下,综合利用多种AI(Artificial  Intelligence,人工智能)技术,联合多方合作共同挖掘数据价值,催生基于联合建模的新的智能业态和模式。联合学习至少具有以下特点:Joint learning refers to the comprehensive utilization of various AI (Artificial Intelligence, artificial intelligence) technologies on the premise of ensuring data security and user privacy, and joint multi-party cooperation to jointly mine the value of data, and to promote new intelligent business forms and models based on joint modeling. Federated learning has at least the following characteristics:
(1)参与节点控制自有数据的弱中心化联合训练模式,确保共创智能过程中的数据隐私安全。(1) Participating nodes control the weakly centralized joint training mode of their own data to ensure data privacy and security in the process of co-creating intelligence.
(2)在不同应用场景下,利用筛选和/或组合AI算法、隐私保护计算,建立多种模型聚合优化策略,以获取高层次、高质量的模型。(2) In different application scenarios, use screening and/or combining AI algorithms and privacy-preserving calculations to establish multiple model aggregation optimization strategies to obtain high-level, high-quality models.
(3)在确保数据安全及用户隐私的前提下,基于多种模型聚合优化策略,获取提升联合学习的效能方法,其中效能方法可以是通过解决包括计算架构并行、大规模跨域网络下的信息交互、智能感知、异常处理机制等,提升联合学习的整体效能。(3) On the premise of ensuring data security and user privacy, based on a variety of model aggregation optimization strategies, obtain an efficiency method to improve joint learning, where the efficiency method can be solved by solving information including computing architecture parallelism and large-scale cross-domain network Interaction, intelligent perception, exception handling mechanism, etc., to improve the overall efficiency of joint learning.
(4)获取各场景下多方用户的需求,通过互信机制,确定合理评估各联合参与方的真实贡献度,进行分配激励。(4) Obtain the needs of multi-party users in each scenario, determine and reasonably evaluate the true contribution of each joint participant through the mutual trust mechanism, and distribute incentives.
基于上述方式,可以建立基于联合学习的AI技术生态,充分发挥行业数据价值,推动垂直领域的场景落地。Based on the above methods, it is possible to establish an AI technology ecology based on joint learning, give full play to the value of industry data, and promote the implementation of scenarios in vertical fields.
下面将结合附图详细说明根据本公开实施例的一种基于联合学习的烟气排放预测方法和装置。A method and device for predicting flue gas emission based on joint learning according to an embodiment of the present disclosure will be described in detail below with reference to the accompanying drawings.
图1是本公开实施例的一种联合学习的架构示意图。如图1所示,联合学习的架构可以包括服务器(中心节点)101以及参与方102、参与方104和参与方104。参与方可以是一个或多个客户端组成。FIG. 1 is a schematic diagram of a joint learning architecture according to an embodiment of the present disclosure. As shown in FIG. 1 , the architecture of joint learning may include a server (central node) 101 , a participant 102 , a participant 104 and a participant 104 . A participant can be composed of one or more clients.
在联合学习过程中,基本模型可以通过服务器101建立,服务器101将该模型发送至与其建立通信连接的参与方102、参与方104和参与方104。基本模型还可以是任一参与方建立后上传至服务器101,服务器101将该模型发送至与其建立通信连接的其他参与方。参与方102、参与方104和参与方104根据下载的基本结构和模型参数构建模型,利用本地数据进行模型训练,获得更新的模型参数,并将更新的模型参数加密上传至服务器101。服务器101对参与方102、参与方104和参与方104发送的模型参数进行聚合,获得全局模型参数,并将全局模型参数传回至参与方102、参与方104和参与方104。参与方102、参与方104和参与方104根据接收的全局模型参数对各自的模型进行迭代,直到模型最终收敛,从而实现对模型的训练。在联合学习过程中,参与方102、参与方104和参与方104上传的数据为模型参数,本地数据并不会上传至服务器101,且所有参与方可以共享最终的模型参数,因此可以在保证数据隐私的基础上实现共同建模。In the joint learning process, the basic model can be established by the server 101, and the server 101 sends the model to the participant 102, the participant 104 and the participant 104 with which a communication connection is established. The basic model can also be uploaded to the server 101 after being created by any participant, and the server 101 sends the model to other participants that have established communication connections with it. Participant 102, participant 104, and participant 104 build a model according to the downloaded basic structure and model parameters, use local data for model training, obtain updated model parameters, and encrypt and upload the updated model parameters to the server 101. The server 101 aggregates the model parameters sent by the participant 102 , the participant 104 and the participant 104 to obtain global model parameters, and returns the global model parameters to the participant 102 , the participant 104 and the participant 104 . The participant 102, the participant 104 and the participant 104 iterate their respective models according to the received global model parameters until the models finally converge, thereby realizing the training of the models. In the joint learning process, the data uploaded by participant 102, participant 104, and participant 104 are model parameters, local data will not be uploaded to the server 101, and all participants can share the final model parameters, so data can be guaranteed Co-modeling is achieved on the basis of privacy.
需要说明的是,参与方的数量不限于如上所述的三个,而是可以根据需要进行设置,本公开实施例对此不作限制。It should be noted that the number of participants is not limited to the above three, but can be set according to needs, which is not limited in this embodiment of the present disclosure.
图2是本公开实施例提供的一种烟气含氧量负荷预测方法的流程图。其中,图2中执行主体设置为参与方,参与方可以是客户端或独立服务器,在此统一简称为参与方;中心节点可以是云端、集成服务器。如图2所示,该烟气含氧量负荷预测方法包括:Fig. 2 is a flow chart of a method for predicting the load of oxygen content in flue gas provided by an embodiment of the present disclosure. Wherein, in Fig. 2, the execution subject is set as a participant, and the participant can be a client or an independent server, which is collectively referred to as a participant here; the central node can be a cloud or an integrated server. As shown in Figure 2, the flue gas oxygen content load prediction method includes:
S201,参与方分别确定本地多组设备的数据集和目标设备数据集对应的样本权重。S201, the participant respectively determines sample weights corresponding to data sets of multiple groups of local devices and target device data sets.
具体地,参与方分别确定本地多组设备的数据集和目标设备数据集对应的样本权重可以通过下述方式实现:Specifically, the participants respectively determine the sample weights corresponding to the data sets of multiple groups of local devices and the target device data sets in the following ways:
步骤一、参与方选取本地多组设备的数据集与目标设备的数据集;Step 1. The participant selects the data sets of multiple sets of local devices and the data sets of the target device;
其中,数据集可以是蒸汽锅炉烟气温度,节能器出口温度,烟气流量瞬时值,蒸汽锅炉燃气温度,蒸汽锅炉烟气标况流量,蒸汽锅炉天然气入口压力,蒸汽锅炉烟气流速,蒸汽锅炉冷凝器进口烟温,蒸汽锅炉排烟温度,蒸汽锅炉烟气压力,蒸汽锅炉冷凝器进口压力,蒸汽锅炉主蒸汽瞬时流量,蒸汽锅炉运行状态,蒸汽锅炉天然气入口瞬时流量等,对此本公开不限定是蒸汽锅炉,还可以是燃气炉或其他能源设备。Among them, the data set can be steam boiler flue gas temperature, economizer outlet temperature, instantaneous value of flue gas flow, steam boiler gas temperature, steam boiler flue gas standard flow rate, steam boiler natural gas inlet pressure, steam boiler flue flow rate, steam boiler Condenser inlet smoke temperature, steam boiler exhaust gas temperature, steam boiler flue gas pressure, steam boiler condenser inlet pressure, steam boiler main steam instantaneous flow, steam boiler operating status, steam boiler natural gas inlet instantaneous flow, etc. The limit is a steam boiler, but it can also be a gas furnace or other energy equipment.
步骤二、合并本地多组的设备的数据集与目标设备的数据集,以得到合并数据;Step 2, merging the data sets of the local multiple groups of devices and the data sets of the target device to obtain merged data;
步骤三、利用所述合并数据训练核密度估计模型;Step 3, using the merged data to train a kernel density estimation model;
步骤四、根据核密度估计模型,分别确定本地多组的设备的数据集和目标设备的数据 集对应的样本权重。Step 4. According to the kernel density estimation model, respectively determine the sample weights corresponding to the data sets of the local multiple groups of devices and the data sets of the target device.
S202,根据本地多组设备的数据集和对应的样本权重,训练得到多组设备的预测神经网络模型。S202. According to the data sets of multiple sets of local equipment and the corresponding sample weights, train the prediction neural network models of multiple sets of equipment.
S203,将本地多组设备的预测神经网络模型上传到中心节点进行模型聚合,以得到聚合后的预测神经网络模型。S203. Upload the predictive neural network models of multiple groups of local devices to the central node for model aggregation, so as to obtain the aggregated predictive neural network models.
具体地,将本地多组设备的预测神经网络模型上传到中心节点进行模型聚合,以得到聚合后的预测神经网络模型可以通过下述方式实现:Specifically, uploading the prediction neural network models of multiple groups of local devices to the central node for model aggregation to obtain the aggregated prediction neural network model can be achieved in the following ways:
步骤一、将本地多组设备的预测神经网络模型上传到中心节点;Step 1. Upload the prediction neural network models of multiple groups of local devices to the central node;
步骤二、响应于中心节点反馈的信息;Step 2. Responding to the information fed back by the central node;
步骤三、接收中心节点下发的聚合后的预测神经网络模型。Step 3, receiving the aggregated prediction neural network model delivered by the central node.
S204,根据预设训练条件训练聚合后的预测神经网络模型,以得到联合预测模型。S204. Train the aggregated prediction neural network model according to preset training conditions to obtain a joint prediction model.
其中,预设训练条件可以包括预设训练次数或预计模型训练的收敛状态值等。Wherein, the preset training condition may include a preset number of training times or a predicted convergence state value of model training, and the like.
具体地,根据预设训练条件训练聚合后的预测神经网络模型,以得到联合预测模型可以通过下述方式实现:Specifically, training the aggregated prediction neural network model according to preset training conditions to obtain a joint prediction model can be achieved in the following manner:
步骤一、响应于中心节点下发的聚合后的预测神经网络模型;Step 1. Responding to the aggregated prediction neural network model issued by the central node;
步骤二、确定预设训练条件;Step 2. Determine the preset training conditions;
步骤三、根据预设训练条件训练聚合后的预测神经网络模型,以得到联合预测模型。Step 3: Train the aggregated prediction neural network model according to preset training conditions to obtain a joint prediction model.
S205,根据联合预测模型和目标设备数据集对应的样本权重,预测目标设备烟气含氧量值。S205. Predict the oxygen content value of the flue gas of the target device according to the joint prediction model and the sample weight corresponding to the target device data set.
具体地,根据联合预测模型和目标设备数据集对应的样本权重,预测目标设备烟气含氧量值可以通过下述方式实现:Specifically, according to the joint prediction model and the sample weight corresponding to the target equipment data set, the prediction of the oxygen content value of the flue gas of the target equipment can be achieved in the following ways:
步骤一、参与方将联合预测模型上传至中心节点进行联合学习训练;Step 1. Participants upload the joint prediction model to the central node for joint learning and training;
步骤二、响应与中收引擎反馈的对联合预测模型的联合学习训练;Step 2: Joint learning and training of the joint prediction model in response to feedback from the receiving engine;
步骤三、将经过联合学习训练的联合预测模型发送至目标设备;Step 3. Send the joint prediction model trained by joint learning to the target device;
步骤四、根据联合预测模型和目标设备数据集对应的样本权重,预测目标设备烟气含氧量值。Step 4. According to the joint prediction model and the sample weight corresponding to the target equipment data set, predict the oxygen content value of the flue gas of the target equipment.
进一步地,本公开还针对预测神经网络模型的优化给出相关实施例:Further, the present disclosure also provides relevant embodiments for the optimization of the predictive neural network model:
步骤一、参与方利用本地多组设备的数据集建立预测神经网络模型训练样本;Step 1. Participants use the data sets of multiple sets of local devices to establish training samples for predictive neural network models;
步骤二、利用目标设备的数据集建立预测神经网络模型测试样本;Step 2, using the data set of the target device to establish a predictive neural network model test sample;
步骤三、根据预测神经网络模型训练样本和模型测试样本,得到样本预测值;Step 3. According to the prediction neural network model training sample and the model test sample, the sample prediction value is obtained;
步骤四、根据样本预测值与样本期望值的误差矩阵的范数,得到预测神经网络模型的适应度值;Step 4, according to the norm of the error matrix of the sample predicted value and the sample expected value, the fitness value of the predicted neural network model is obtained;
步骤五、根据预测神经网络模型的适应度值对预测神经网络模型中的种群中的粒子进行更新,以得到优化的预测神经网络模型。Step 5: Update the particles in the population in the predictive neural network model according to the fitness value of the predictive neural network model to obtain an optimized predictive neural network model.
具体地,可进一步通过下述实施例说明:Specifically, it can be further illustrated by the following examples:
优化预测神经网络模型如下:The optimization prediction neural network model is as follows:
(a)对预测神经网络模型的学习参数进行编码,得到初始粒子种群,编码规则:每个参数用13bit二进制码表示,并将这些参数拼接成一个粒子;(a) Coding the learning parameters of the predictive neural network model to obtain the initial particle population, coding rules: each parameter is represented by 13bit binary code, and these parameters are spliced into a particle;
(b)解码得到预测神经网络模型的参数,并将参数赋值给预测神经网络模型模型;(b) decoding to obtain the parameters of the predictive neural network model, and assigning the parameters to the predictive neural network model;
(c)使用训练样本训练预测神经网络模型模型;(c) training a predictive neural network model using training samples;
(d)使用测试样本测试预测神经网络模型模型,得到样本预测值;(d) using the test sample to test the predictive neural network model to obtain the sample predicted value;
(e)选择预测样本的预测值与期望值的误差矩阵的范数作为适应度值;(e) Select the norm of the error matrix of the predicted value of the predicted sample and the expected value as the fitness value;
(f)对种群中的粒子进行更新。(f) Update the particles in the population.
粒子算法更新涉及到的参数为:速度,位置,个体极值,种群的群体极值。速度和位置更新方式为如下公式(1)和(2)所示,同时为了防止粒子的盲目搜索,一般建议将其位置和速度限制在区间[-Xmax,Xmax],[-Vmax,Vmax]中。The parameters involved in the update of the particle algorithm are: speed, position, individual extremum, and group extremum of the population. The speed and position update methods are shown in the following formulas (1) and (2). At the same time, in order to prevent the blind search of particles, it is generally recommended to limit their position and speed to the interval [-Xmax, Xmax], [-Vmax, Vmax] .
Figure PCTCN2022116583-appb-000001
Figure PCTCN2022116583-appb-000001
Figure PCTCN2022116583-appb-000002
Figure PCTCN2022116583-appb-000002
变量说明:Variable description:
X i=(x i1,x i2,........x iD)表示含有D维的一个种群粒子,亦代表问题的一个解 X i =(x i1 , x i2 ,..... x iD ) represents a population particle with D dimension, and also represents a solution to the problem
V i=(v i1,v i2,.....v iD)表示含有D维的一个种群粒子的速度 V i =(v i1 ,v i2 ,.....v iD ) means the velocity of a population particle with D dimension
P i=(p i1,p i2,.......p iD)表示含有D维的个体粒子极值 P i =(p i1 ,p i2 ,.......p iD ) means the extremum value of individual particles with D dimension
P g=(p g1,p g2,......p gD)表示含有D维的群体极值 P g =(p g1 ,p g2 ,...p gD ) means the population extremum with D dimension
w为惯性权重,d=1,2,......D,i=1,2,.......n,k为当前迭代次数,Vid为粒子的速度,c1,c2是非负的常数,称为加速度因子,r1,r2是分布在[0,1]的随机数。w is the inertia weight, d=1, 2,...D, i=1,2,...n, k is the current iteration number, Vid is the speed of the particle, c1, c2 are non Negative constants, called acceleration factors, r1, r2 are random numbers distributed in [0,1].
(g)对种群中个体进行最优交叉,个体粒子通过和个体极值粒子进行交叉更新,交叉方法采用整数交叉法,首先选定两个交叉位置,然后把个体和个体极值进行交叉,对得到的新个体采用了保留优秀个体策略,只有当新粒子适应度值大于旧粒子适应度值时才更新粒子。(g) Perform optimal crossover on individuals in the population. Individual particles are updated by crossing with individual extremum particles. The crossover method uses an integer crossover method. First, two crossover positions are selected, and then the individual and the individual extremum are crossed. The obtained new individuals adopt the strategy of retaining excellent individuals, and update the particles only when the fitness value of the new particle is greater than the fitness value of the old particle.
(h)对种群进行最优交叉,此步骤和第八操作相似,只是将个体极值换成群体极值。(h) Perform optimal crossover on the population. This step is similar to the eighth operation, except that the individual extremum is replaced by the group extremum.
(i)对种群中的粒子操作进行变异操作,变异操作采用个体内部两位互换方法,首先随机选择变异位置pos1和pos2,然后把两个变异位置互换。对得到的新个体采用了保留优秀个体策略,只有当新粒子适应度值好于旧粒子时才更新粒子。(i) The mutation operation is performed on the particle operation in the population. The mutation operation adopts the two-bit exchange method within the individual. First, the mutation positions pos1 and pos2 are randomly selected, and then the two mutation positions are exchanged. For the obtained new individuals, the strategy of retaining excellent individuals is adopted, and the particles are updated only when the fitness value of the new particles is better than that of the old particles.
(j)得到新种群。(j) Get a new population.
(k)判断是否满足终止条件,或者达到最大迭代次数,或者满足小于限定的误差。(k) Judging whether the termination condition is satisfied, or the maximum number of iterations is reached, or an error smaller than the limit is satisfied.
(l)如果不满足条件,则转到第三步,否则,对粒子群进行解码,得到最佳预测神经网络模型网络的初始参数。(l) If the condition is not satisfied, go to the third step, otherwise, decode the particle swarm to obtain the initial parameters of the optimal prediction neural network model network.
针对本公开所提供的一种基于联合学习的烟气含氧量负荷预测方法进一步举例如下:设有锅炉1,锅炉2,锅炉3三台锅炉的数据集,其中,锅炉1和锅炉2为本地设备,锅炉3为目标设备。A further example of a flue gas oxygen content load prediction method based on joint learning provided in this disclosure is as follows: There are three boiler data sets of boiler 1, boiler 2, and boiler 3, where boiler 1 and boiler 2 are local equipment, boiler 3 is the target equipment.
首先,选取锅炉1,锅炉2,锅炉3三台锅炉的数据,分别得到本地设备数据集A和本地设备数据集B(下述分别简称,数据集A、数据集B),以及目标设备数据集C(下述简称数据集C)。First, select the data of boiler 1, boiler 2, and boiler 3 to obtain local equipment data set A and local equipment data set B (hereinafter referred to as data set A and data set B respectively), and target equipment data set C (hereinafter referred to as data set C).
然后,对数据集A,数据集B,数据集C处的数据进行合并,用合并的数据去训练一个KDE(核密度估计模型)模型,将数据集A的数据输入到KDE模型中去,得到锅炉1数据集A的样本权重,将锅炉2数据集B数据输入到KDE模型中去,得到B的样本权重。Then, merge the data at dataset A, dataset B, and dataset C, use the merged data to train a KDE (kernel density estimation model) model, input the data of dataset A into the KDE model, and get The sample weight of boiler 1 data set A, input the data of boiler 2 data set B into the KDE model, and obtain the sample weight of B.
具体地,在一个泛能站内,会有很多不同型号不同工艺的锅炉数据,此时可以利用多台锅炉的数据去提高预测的精度,实现减少锅炉传感器的安装,从而实现降低成本减少资源浪费。Specifically, in a multi-energy station, there will be many boiler data of different models and different processes. At this time, the data of multiple boilers can be used to improve the prediction accuracy and reduce the installation of boiler sensors, thereby reducing costs and resource waste.
第三,可以在锅炉1处,利用数据集A的数据和样本权重训练预测神经网络模型;同时可以在锅炉2处,利用数据集B的数据和样本权重训练预测神经网络模型。然后将数据集A与数据集B训练的预测神经网络模型都上传到中心节点处进行模型聚合。当中心节点再将聚合后的模型在下发到锅炉1和锅炉2处时,锅炉1和锅炉2分别再用聚合的模型去训练,以得到联合预测模型,如此重复多次,直到将模型训练到收敛为止,再将联合预测模型上传至中心节点。Thirdly, at boiler 1, the predictive neural network model can be trained using data from dataset A and sample weights; at the same time, at boiler 2, the predictive neural network model can be trained using data from dataset B and sample weights. Then upload the predictive neural network models trained by dataset A and dataset B to the central node for model aggregation. When the central node sends the aggregated model to Boiler 1 and Boiler 2, Boiler 1 and Boiler 2 use the aggregated model to train respectively to obtain a joint prediction model, and repeat this many times until the model is trained to Until it converges, upload the joint prediction model to the central node.
第四、中心节点将第三步中训练好的联合预测模型下发到锅炉3处,用联合预测模型去预测锅炉3处的烟气含氧量值。Fourth, the central node sends the joint prediction model trained in the third step to boiler 3, and uses the joint prediction model to predict the oxygen content value of the flue gas at boiler 3.
根据本公开实施例提供的技术方案,通过参与方分别确定本地多组设备的数据集和目标设备数据集对应的样本权重;根据本地多组设备的数据集和对应的样本权重,训练得到多组设备的预测神经网络模型;将本地多组设备的预测神经网络模型上传到中心节点进行模型 聚合,以得到聚合后的预测神经网络模型;根据预设训练条件训练聚合后的预测神经网络模型,以得到联合预测模型;根据联合预测模型和目标设备数据集对应的样本权重,预测目标设备烟气含氧量值。以解决现有技术中因无法提高能源设备的烟气含氧量负荷预测的准确性的而造成资源浪费问题。According to the technical solution provided by the embodiments of the present disclosure, the participants respectively determine the sample weights corresponding to the data sets of the local multiple groups of devices and the target device data sets; according to the data sets of the local multiple groups of devices and the corresponding sample weights, multiple groups The predictive neural network model of the device; upload the predictive neural network model of multiple groups of local devices to the central node for model aggregation to obtain the aggregated predictive neural network model; train the aggregated predictive neural network model according to preset training conditions to A joint prediction model is obtained; according to the joint prediction model and the sample weight corresponding to the target equipment data set, the oxygen content value of the flue gas of the target equipment is predicted. In order to solve the resource waste problem caused by the inability to improve the accuracy of load prediction of flue gas oxygen content of energy equipment in the prior art.
上述所有可选技术方案,可以采用任意结合形成本申请的可选实施例,在此不再一一赘述。All the above optional technical solutions may be combined in any way to form optional embodiments of the present application, which will not be repeated here.
图3是本公开实施例提供的一种烟气含氧量负荷预测装置的示意图。如图3所示,该烟气含氧量负荷预测装置包括:Fig. 3 is a schematic diagram of a flue gas oxygen content load prediction device provided by an embodiment of the present disclosure. As shown in Figure 3, the flue gas oxygen content load prediction device includes:
确定模块301,用于参与方分别确定本地多组设备的数据集和目标设备数据集对应的样本权重;Determining module 301, used for participants to respectively determine the sample weights corresponding to the data sets of multiple groups of local devices and the data sets of the target device;
第一训练模块302,用于根据本地多组设备的数据集和对应的样本权重,训练得到多组设备的预测神经网络模型;The first training module 302 is used to train the prediction neural network models of multiple sets of equipment according to the data sets of local multiple sets of equipment and the corresponding sample weights;
聚合模块303,用于将本地多组设备的预测神经网络模型上传到中心节点进行模型聚合,以得到聚合后的预测神经网络模型;Aggregation module 303, for uploading the predictive neural network models of multiple groups of local devices to the central node for model aggregation, so as to obtain the aggregated predictive neural network models;
第二训练模块304,根据预设训练条件训练聚合后的预测神经网络模型,以得到联合预测模型;The second training module 304 trains the aggregated prediction neural network model according to preset training conditions to obtain a joint prediction model;
预测模块305,用于根据联合预测模型和目标设备数据集对应的样本权重,预测目标设备烟气含氧量值。The prediction module 305 is configured to predict the oxygen content value of the flue gas of the target device according to the joint prediction model and the sample weight corresponding to the target device data set.
根据本公开实施例提供的技术方案,通过参与方分别确定本地多组设备的数据集和目标设备数据集对应的样本权重;根据本地多组设备的数据集和对应的样本权重,训练得到多组设备的预测神经网络模型;将本地多组设备的预测神经网络模型上传到中心节点进行模型聚合,以得到聚合后的预测神经网络模型;根据预设训练条件训练聚合后的预测神经网络模型,以得到联合预测模型;根据联合预测模型和目标设备数据集对应的样本权重,预测目标设备烟气含氧量值。以解决现有技术中因无法提高能源设备的烟气含氧量负荷预测的准确性的而造成资源浪费问题。According to the technical solution provided by the embodiments of the present disclosure, the participants respectively determine the sample weights corresponding to the data sets of the local multiple groups of devices and the target device data sets; according to the data sets of the local multiple groups of devices and the corresponding sample weights, multiple groups The predictive neural network model of the device; upload the predictive neural network model of multiple groups of local devices to the central node for model aggregation to obtain the aggregated predictive neural network model; train the aggregated predictive neural network model according to preset training conditions to A joint prediction model is obtained; according to the joint prediction model and the sample weight corresponding to the target equipment data set, the oxygen content value of the flue gas of the target equipment is predicted. In order to solve the resource waste problem caused by the inability to improve the accuracy of load prediction of flue gas oxygen content of energy equipment in the prior art.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本公开实施例的实施过程构成任何限定。It should be understood that the sequence numbers of the steps in the above embodiments do not mean the order of execution, and the execution order of each process should be determined by its functions and internal logic, and should not constitute any limitation on the implementation process of the embodiments of the present disclosure.
图4是本公开实施例提供的另一种烟气含氧量负荷预测方法的流程图。图4的基于样本迁移的烟气含氧量负荷预测方法可以由图1的服务器执行。如图4所示,该烟气含氧量负荷预测方法包括:Fig. 4 is a flow chart of another method for predicting the load of oxygen content in flue gas provided by an embodiment of the present disclosure. The method for predicting the load of flue gas oxygen content based on sample migration in FIG. 4 can be executed by the server in FIG. 1 . As shown in Figure 4, the flue gas oxygen load prediction method includes:
S401,获取联合学习架构下的第一参与方的设备数据和第二参与方的设备数据。S401. Acquire device data of the first participant and device data of the second participant under the joint learning architecture.
其中,第一参与方为提出预测需求的参与方,第二参与方为除第一参与方外的其他参与方。Among them, the first participant is the participant who proposes the forecast demand, and the second participant is other participants except the first participant.
具体地,可以通过接收来自第一参与方的设备数据集与第二参与方的设备数据集;再根据预设筛选特征,筛选第一参与方的设备数据集与第二参与方的设备数据集,以分别得到第一参与方的设备数据本样量和第二参与方的设备数据量;进而将第一参与方的设备数据本样量和第二参与方的设备数据量,分别确定为第一参与方的设备数据和第二参与方的设备数据。Specifically, by receiving the equipment data set from the first participant and the equipment data set from the second participant; and then filtering the equipment data set of the first participant and the equipment data set of the second participant according to the preset screening features , to obtain the sample size of the equipment data of the first participant and the volume of equipment data of the second participant respectively; Device data for one party and device data for a second party.
S402,利用第一参与方的设备数据和第二参与方的设备数据,训练预测分类器。S402. Using the device data of the first participant and the device data of the second participant, train a prediction classifier.
具体地,可以通过标签化处理第一参与方的设备数据和第二参与方的设备数据,得到第一参与方的设备数据的标签数据和第二参与方的设备数据的标签数据;进而合并第一参与方的设备数据的标签数据和第二参与方的设备数据的标签数据,以得到合并后的标签数据;最后,根据合并后的标签数据,训练预测分类器。Specifically, the device data of the first participant and the device data of the second participant can be processed through tagging to obtain the tag data of the device data of the first participant and the tag data of the device data of the second participant; The label data of the equipment data of one participant and the label data of the equipment data of the second participant to obtain the combined label data; finally, according to the combined label data, a predictive classifier is trained.
S403,根据预测分类器,确定第一参与方的设备数据关于第二参与方的设备数据的权重数据。S403. Determine, according to the prediction classifier, weight data of the equipment data of the first participant with respect to the equipment data of the second participant.
具体地,可以通过利用预测分类器,分别得到第一参与方的设备数据对应的设备故障概率值和第二参与方的设备数据对应的设备故障概率值;然后可以根据第一参与方的设备数据 对应的设备故障概率值和第二参与方的设备数据对应的设备故障概率值,确定第一参与方的设备数据关于第二参与方的设备数据的权重数据。Specifically, by using the predictive classifier, the equipment failure probability value corresponding to the equipment data of the first participant and the equipment failure probability value corresponding to the equipment data of the second participant can be respectively obtained; and then according to the equipment data of the first participant The corresponding equipment failure probability value and the equipment failure probability value corresponding to the equipment data of the second participant determine the weight data of the equipment data of the first participant with respect to the equipment data of the second participant.
进一步地,对于实现利用所述预测分类器,分别得到第一参与方的设备数据对应的设备故障概率值和第二参与方的设备数据对应的设备故障概率值,可以通过利用预测分类器,分别对第一参与方的设备数据和第二参与方的设备数据进行分类,以得到第一参与方的设备数据对应的设备故障数据和第二参与方的设备数据对应的设备故障数据;然后,分别计算第一参与方的设备数据对应的设备故障数据和第二参与方的设备数据对应的设备故障数据对应设备故障概率值。Further, for realizing using the predictive classifier to obtain the equipment failure probability value corresponding to the equipment data of the first participant and the equipment failure probability value corresponding to the equipment data of the second participant, respectively, by using the predictive classifier, respectively Classifying the equipment data of the first participant and the equipment data of the second participant to obtain equipment failure data corresponding to the equipment data of the first participant and equipment failure data corresponding to the equipment data of the second participant; then, respectively Calculate the equipment failure probability value corresponding to the equipment failure data corresponding to the equipment data of the first participant and the equipment failure data corresponding to the equipment data of the second participant.
S404,基于第二参与方的设备数据和所述权重数据,训练预测式梯度提升模型。S404. Train a predictive gradient boosting model based on the device data of the second participant and the weight data.
具体地,可以基于第二参与方的设备数据,和第一参与方的设备数据关于第二参与方的设备数据的权重数据,训练预测式梯度提升模型;然后,利用获取第一参与方的测试数据训练预测式梯度提升模型,以得到第一参与方的设备预测值;再根据第一参与方的设备预测值与设备期望值的误差矩阵的范数,得到预测式梯度提升模型的适应度值;最后,根据预测式梯度提升模型的适应度值,对预测式梯度提升模型中的种群中的粒子进行更新,以得到优化的预测式梯度提升模型。Specifically, the predictive gradient boosting model can be trained based on the equipment data of the second participant and the weight data of the equipment data of the first participant with respect to the equipment data of the second participant; Data training predictive gradient boosting model to obtain the predicted value of the equipment of the first participant; then according to the norm of the error matrix between the predicted value of the equipment of the first participant and the expected value of the equipment, the fitness value of the predictive gradient boosting model is obtained; Finally, according to the fitness value of the predictive gradient boosting model, the particles in the population in the predictive gradient boosting model are updated to obtain an optimized predictive gradient boosting model.
进一步地,对于优化预测式梯度提升模型可通过下述方式实现:Further, optimizing the predictive gradient boosting model can be achieved in the following ways:
首先,确定预测式梯度提升模型中的种群和所述种群中粒子;First, determine the population in the predictive gradient boosting model and the particles in the population;
然后,判断当前的所述种群中粒所对应的适应度值是否大于之前的旧粒子的适应度值;如果小于则需要更新所述预测式梯度提升模型中的种群和种群中粒子。Then, it is judged whether the fitness value corresponding to the current particle in the population is greater than the fitness value of the previous old particle; if it is smaller, it is necessary to update the population and the particle in the population in the predictive gradient boosting model.
S405,利用预测式梯度提升模型预测第一参与方设备的烟气含氧量负荷。S405. Use the predictive gradient boosting model to predict the oxygen content load of the flue gas of the equipment of the first participant.
根据本公开实施例提供的技术方案,通过获取联合学习架构下的第一参与方的设备数据和第二参与方的设备数据;其中,第一参与方为提出预测需求的参与方,第二参与方为除第一参与方外的其他参与方;利用第一参与方的设备数据和第二参与方的设备数据,训练预测分类器;根据预测分类器,确定第一参与方的设备数据关于第二参与方的设备数据的权重数据;基于第二参与方的设备数据和权重数据,训练预测式梯度提升模型;利用预测式梯度提升模型预测第一参与方设备的烟气含氧量负荷。以解决现有技术中因不同工艺下的所产生的能源设备的数据分布差异,而造成烟气含氧量负荷预测的不准确性的问题,并且节约能源设备传感器的成本。According to the technical solution provided by the embodiments of the present disclosure, by obtaining the device data of the first participant and the device data of the second participant under the joint learning architecture; wherein, the first participant is the participant who proposes the prediction demand, and the second participant The party is other parties except the first party; use the equipment data of the first party and the equipment data of the second party to train the predictive classifier; according to the predictive classifier, determine that the equipment data of the first party The weight data of the equipment data of the second participant; based on the equipment data and weight data of the second participant, train the predictive gradient boosting model; use the predictive gradient boosting model to predict the flue gas oxygen content load of the equipment of the first participant. In order to solve the problem of inaccuracy in flue gas oxygen content load prediction due to the difference in data distribution of energy equipment generated under different processes in the prior art, and save the cost of energy equipment sensors.
上述所有可选技术方案,可以采用任意结合形成本申请的可选实施例,在此不再一一赘述。All the above optional technical solutions may be combined in any way to form optional embodiments of the present application, which will not be repeated here.
图5是本公开实施例提供的一种烟气含氧量负荷预测装置的示意图,应用于联合学习框架中。如图5所示,该基于样本迁移的烟气含氧量负荷预测装置包括:Fig. 5 is a schematic diagram of a flue gas oxygen content load prediction device provided by an embodiment of the present disclosure, which is applied in a joint learning framework. As shown in Figure 5, the flue gas oxygen content load prediction device based on sample migration includes:
获取模块501,用于获取联合学习架构下的第一参与方的设备数据和第二参与方的设备数据;其中,第一参与方为提出预测需求的参与方,第二参与方为除第一参与方外的其他参与方;The acquisition module 501 is used to acquire the equipment data of the first participant and the equipment data of the second participant under the joint learning framework; wherein, the first participant is the participant who proposes the prediction demand, and the second participant is the a party other than the party;
第一训练模块502,利用第一参与方的设备数据和第二参与方的设备数据,训练预测分类器;The first training module 502 uses the device data of the first participant and the device data of the second participant to train a predictive classifier;
计算模块503,用于根据预测分类器,确定第一参与方的设备数据关于第二参与方的设备数据的权重数据;A calculation module 503, configured to determine the weight data of the equipment data of the first participant with respect to the equipment data of the second participant according to the predictive classifier;
第二训练模块504,用于基于第二参与方的设备数据和权重数据,训练预测式梯度提升模型;The second training module 504 is configured to train a predictive gradient boosting model based on the equipment data and weight data of the second participant;
预测模块505,用于利所预测式梯度提升模型预测第一参与方设备的烟气含氧量负荷。The prediction module 505 is configured to use the predicted gradient boosting model to predict the flue gas oxygen content load of the equipment of the first participant.
根据本公开实施例提供的技术方案,通过获取联合学习架构下的第一参与方的设备数据和第二参与方的设备数据;其中,第一参与方为提出预测需求的参与方,第二参与方为除第一参与方外的其他参与方;利用第一参与方的设备数据和第二参与方的设备数据,训练预测分类器;根据预测分类器,确定第一参与方的设备数据关于第二参与方的设备数据的权重数 据;基于第二参与方的设备数据和权重数据,训练预测式梯度提升模型;利用预测式梯度提升模型预测第一参与方设备的烟气含氧量负荷。以解决现有技术中因不同工艺下的所产生的能源设备的数据分布差异,而造成烟气含氧量负荷预测不准确的问题,并且节约能源设备传感器的成本。According to the technical solution provided by the embodiments of the present disclosure, by obtaining the device data of the first participant and the device data of the second participant under the joint learning architecture; wherein, the first participant is the participant who proposes the prediction demand, and the second participant The party is other parties except the first party; use the equipment data of the first party and the equipment data of the second party to train the predictive classifier; according to the predictive classifier, determine that the equipment data of the first party The weight data of the equipment data of the second participant; based on the equipment data and weight data of the second participant, train the predictive gradient boosting model; use the predictive gradient boosting model to predict the flue gas oxygen content load of the equipment of the first participant. In order to solve the problem of inaccurate prediction of the oxygen content load in the flue gas caused by the difference in the data distribution of the energy equipment generated under different processes in the prior art, and save the cost of the energy equipment sensor.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本公开实施例的实施过程构成任何限定。It should be understood that the sequence numbers of the steps in the above embodiments do not mean the order of execution, and the execution order of each process should be determined by its functions and internal logic, and should not constitute any limitation on the implementation process of the embodiments of the present disclosure.
图6是本公开实施例提供的一种烟气含氧量负荷预测模型选择方法的流程示意图,该方法可以由图1的服务器执行。如图6所示,该烟气含氧量负荷预测模型选择方法包括:Fig. 6 is a schematic flowchart of a method for selecting a flue gas oxygen content load prediction model provided by an embodiment of the present disclosure, and the method may be executed by the server in Fig. 1 . As shown in Figure 6, the selection method of the flue gas oxygen content load prediction model includes:
S601,基于联合学习架构,接收来自参与方的预测设备的训练数据集和测试数据集。S601. Based on the federated learning architecture, receive a training data set and a testing data set from a prediction device of a participant.
其中,训练数据集可以是不同能源设备的型号(例如不同的锅炉型号)测试集数据可以是不同工艺下的能源设备的烟气含氧量数据以及其对应的特征数据。Wherein, the training data set may be different energy equipment models (for example, different boiler models), and the test set data may be flue gas oxygen content data and corresponding characteristic data of energy equipment under different processes.
具体地,可以通过根据预测设备的属性,确定预测设备的属性对应预测设备的烟气含氧量数据;然后,提取预测设备的烟气含氧量数据的特征;进而,利用预测设备的烟气含氧量数据的特征,分别组建预测设备的训练数据集和测试数据集。Specifically, according to the attributes of the prediction equipment, it can be determined that the attributes of the prediction equipment correspond to the oxygen content data of the flue gas of the prediction equipment; then, the features of the oxygen content data of the flue gas of the prediction equipment are extracted; The characteristics of the oxygen content data are used to construct the training data set and test data set of the prediction equipment respectively.
S602,对预测设备的训练数据集中的数据和测试数据集中的数据进行预处理,并得到预处理后的设备数据集。S602. Perform preprocessing on the data in the training data set and the data in the testing data set for predicting the device, and obtain a preprocessed device data set.
具体地,可以通过判断预测设备的训练数据集和测试数据集中的数据是否存在异常;如果存在异常,对预测设备的训练数据集和测试数据集中的数据进行异常处理。然后,对异常处理后的预测设备的训练数据集中的数据和测试数据集中的数据,进行数据归一化处理。Specifically, it may be determined whether the data in the training data set and the test data set of the prediction device are abnormal; if there is an exception, the data in the training data set and the test data set of the prediction device are abnormally processed. Then, data normalization processing is performed on the data in the training data set and the data in the testing data set of the prediction device after the exception processing.
S603,根据建立预测模型组,计算预处理后的设备数据集中的每条数据的评价指标值。S603. Calculate the evaluation index value of each piece of data in the preprocessed device data set according to the established prediction model group.
其中,预测模型组可以由选取xgboost算法,SVR算法,神经网络算法,置信网络算法,决策树算法,随机森林回归算法,梯度提升树回归算法,线性回归算法,深度学习算法等算法组成,对此本发明不做限定。Among them, the prediction model group can be composed of xgboost algorithm, SVR algorithm, neural network algorithm, belief network algorithm, decision tree algorithm, random forest regression algorithm, gradient boosting tree regression algorithm, linear regression algorithm, deep learning algorithm and other algorithms. The present invention is not limited.
具体地,可以根据预测设备的属性和预测预处理后的设备数据集中的每条数据,建立预测模型组;然后利用预测模型组,分别计算训练集和测试集的每条数据的均方根误差;进而可以将得到的训练集和测试集的每条数据的均方根误差,作为预处理后的设备数据集中的每条数据的评价指标值。Specifically, a prediction model group can be established according to the properties of the predicted device and each piece of data in the preprocessed device data set; then the root mean square error of each piece of data in the training set and the test set can be calculated using the prediction model group ; Furthermore, the root mean square error of each piece of data in the obtained training set and test set can be used as the evaluation index value of each piece of data in the preprocessed device data set.
进一步地,对于利用预测模型组,分别计算训练集和测试集的每条数据的均方根误差的实现方式优选地,可以利用预测设备的训练数据集,训练预测模型组中的算法,以得到预测结果;可以利用预测设备的测试数据集,训练所述预测模型组中的算法,以得到测试结果;进而,根据预测结果和测试结果,得到训练集和测试集的每条数据的均方根误差。Further, for the implementation of using the prediction model group to calculate the root mean square error of each piece of data in the training set and the test set, preferably, the training data set of the prediction device can be used to train the algorithm in the prediction model group to obtain Prediction result; the algorithm in the prediction model group can be trained by using the test data set of the prediction device to obtain the test result; then, according to the prediction result and the test result, the root mean square of each piece of data in the training set and the test set can be obtained error.
例如,计算训练集和测试集的每条数据做预测,得到每条数据的均方根误差(设置为rmse),并将该均方根误差作为预处理后的设备数据集中的每条数据的评价指标值,然后选取rmse值最小的那个算法作为训练集和测试集的对应的算法标签(对算法组中的每个算法进行标号,标号值是1,2,3等)For example, calculate each piece of data in the training set and test set for prediction, get the root mean square error (set to rmse) of each piece of data, and use the root mean square error as the value of each piece of data in the preprocessed device data set Evaluate the index value, and then select the algorithm with the smallest rmse value as the corresponding algorithm label of the training set and test set (label each algorithm in the algorithm group, and the label value is 1, 2, 3, etc.)
关于步骤中rmse得到的过程:用训练集训练算法组给出的算法,用测试集测试训练算法得到的预测结果,用预测结果和测试集得出rmse指标。rmse指标计算公式如下:About the process of obtaining rmse in the steps: use the training set to train the algorithm given by the algorithm group, use the test set to test the prediction results obtained by the training algorithm, and use the prediction results and the test set to obtain the rmse index. The calculation formula of rmse index is as follows:
Figure PCTCN2022116583-appb-000003
Figure PCTCN2022116583-appb-000003
其中,n≥1为标号值;y i为训练集,
Figure PCTCN2022116583-appb-000004
为测试集;i≥1为对应的数据集编号。
Among them, n≥1 is the label value; y i is the training set,
Figure PCTCN2022116583-appb-000004
is the test set; i≥1 is the corresponding data set number.
S604,根据最小的评价指标值,确定适合预测设备的烟气含氧量负荷预测模型。S604, according to the smallest evaluation index value, determine a flue gas oxygen content load prediction model suitable for the prediction equipment.
具体地,可以通过对预处理后的设备数据集中的每条数据的评价指标值进行从小到大排序;并根据排序结果,选取最小的评价指标值;然后调取预测模型组中的预测模型对应的标签值;Specifically, the evaluation index value of each piece of data in the preprocessed equipment data set can be sorted from small to large; and according to the sorting result, the smallest evaluation index value is selected; and then the corresponding prediction model in the prediction model group is called tag value;
当最小的所述评价指标值与预测模型组中的预测模型对应的标签值匹配时,确定预测 模型对应的标签值对应的预测模型为适合预测设备的烟气含氧量负荷预测模型。When the minimum evaluation index value matches the label value corresponding to the prediction model in the prediction model group, it is determined that the prediction model corresponding to the label value corresponding to the prediction model is a flue gas oxygen content load prediction model suitable for the prediction equipment.
其中,对于调取预测模型组中的预测模型对应的标签值的方式,可以利用分类算法对预测设备的训练数据集进行聚类,以得到至少两类训练聚类数据;然后,通过调取分类器,分类至少两类训练聚类的数据;再根据分类的至少两类训练聚类的数据,训练出至少两类训练聚类对应的至少两个分类器;再通过对预测设备的测试数据进行聚类预测,以得到至少两类训练聚类中的至少一个类别。根据至少两类训练聚类中的至少一个类别,确定至少两类训练聚类中的至少一个类别对应的分类器;最后,根据分类器对应的类别标签值,确定预测模型组中的预测模型对应的标签值。Among them, for the method of calling the label value corresponding to the prediction model in the prediction model group, the classification algorithm can be used to cluster the training data set of the prediction device to obtain at least two types of training cluster data; then, by calling the classification Classify the data of at least two types of training clusters; then according to the classified data of at least two types of training clusters, train at least two classifiers corresponding to at least two types of training clusters; Cluster prediction to obtain at least one class from at least two training clusters. According to at least one category in the at least two training clusters, determine the classifier corresponding to at least one category in the at least two training clusters; finally, determine the corresponding prediction model in the prediction model group according to the category label value corresponding to the classifier tag value.
进一步举例说明:可以采用二分算法对训练集数据进行聚类,得到聚类数K个类别(K是常数),然后选用分类算法进行K类聚类,得到K类中对应的数据,选取一个分类器,比如梯度提升回归树分别对K类数据进行分类,训练数据对应的标签(由步骤S203给出),有K类数据,因此训练出K个分类器。然后,对测试集数据进行聚类预测操作,预测结果是1-K聚类中的某个类别,然后用此类对应的分类器进行分类操作,得到一个输出结果,这个结果对应算法组中的某个预测模型,然后用这个算法进行数据的预测,进而判断得到该预测模型的正确性。Further example: the binary algorithm can be used to cluster the training set data to obtain the number of clusters K categories (K is a constant), and then use the classification algorithm to cluster the K categories to obtain the corresponding data in the K categories, and select a category For example, the gradient boosting regression tree classifies K types of data respectively, and the corresponding label of the training data (given by step S203) has K types of data, so K classifiers are trained. Then, the clustering prediction operation is performed on the test set data. The prediction result is a certain category in the 1-K clustering, and then the corresponding classifier is used to perform the classification operation to obtain an output result, which corresponds to the algorithm group. A predictive model, and then use this algorithm to predict the data, and then judge the correctness of the predictive model.
根据本公开实施例提供的技术方案,通过基于联合学习架构,接收来自参与方的预测设备的训练数据集和测试数据集;对预测设备的训练数据集中的数据和测试数据集中的数据进行预处理,并得到预处理后的设备数据集;根据建立预测模型组,计算预处理后的设备数据集中的每条数据的评价指标值;根据最小的评价指标值,确定适合预测设备的烟气含氧量负荷预测模型。以提高对能源设备的烟气含氧量的预测,并降低现有技术的测量成本。According to the technical solution provided by the embodiments of the present disclosure, based on the federated learning architecture, the training data set and the test data set of the prediction device from the participant are received; the data in the training data set and the test data set of the prediction device are preprocessed , and obtain the preprocessed equipment data set; according to the establishment of the prediction model group, calculate the evaluation index value of each piece of data in the preprocessed equipment data set; according to the minimum evaluation index value, determine the flue gas oxygen content suitable for predicting the equipment load forecasting model. In order to improve the prediction of the oxygen content of the flue gas of the energy equipment, and reduce the measurement cost of the existing technology.
上述所有可选技术方案,可以采用任意结合形成本申请的可选实施例,在此不再一一赘述。All the above optional technical solutions may be combined in any way to form optional embodiments of the present application, which will not be repeated here.
图7是本公开实施例提供的一种烟气含氧量负荷预测模型选择装置的示意图。如图7所示,该装置包括:Fig. 7 is a schematic diagram of a flue gas oxygen content load prediction model selection device provided by an embodiment of the present disclosure. As shown in Figure 7, the device includes:
接收模块701,用于基于联合学习架构,接收来自参与方的预测设备的训练数据集和测试数据集;The receiving module 701 is configured to receive training data sets and test data sets from prediction devices of participating parties based on the joint learning architecture;
预处理模块702,用于对预测设备的训练数据集中的数据和测试数据集中的数据进行预处理,并得到预处理后的设备数据集;A preprocessing module 702, configured to preprocess the data in the training data set and the test data set of the prediction device, and obtain a preprocessed device data set;
计算模块703,用于根据建立预测模型组,计算预处理后的设备数据集中的每条数据的评价指标值;The calculation module 703 is used to calculate the evaluation index value of each piece of data in the preprocessed device data set according to the established prediction model group;
预测模块704,用于根据最小的所述评价指标值,确定适合预测设备的烟气含氧量负荷预测模型。The prediction module 704 is configured to determine a flue gas oxygen content load prediction model suitable for the prediction device according to the smallest value of the evaluation index.
根据本公开实施例提供的技术方案,通过基于联合学习架构,接收来自参与方的预测设备的训练数据集和测试数据集;对预测设备的训练数据集中的数据和测试数据集中的数据进行预处理,并得到预处理后的设备数据集;根据建立预测模型组,计算预处理后的设备数据集中的每条数据的评价指标值;根据最小的评价指标值,确定适合预测设备的烟气含氧量负荷预测模型。以提高对能源设备的烟气含氧量的预测,并降低现有技术的测量成本。According to the technical solution provided by the embodiments of the present disclosure, based on the federated learning architecture, the training data set and the test data set of the prediction device from the participant are received; the data in the training data set and the test data set of the prediction device are preprocessed , and obtain the preprocessed equipment data set; according to the establishment of the prediction model group, calculate the evaluation index value of each piece of data in the preprocessed equipment data set; according to the minimum evaluation index value, determine the flue gas oxygen content suitable for predicting the equipment load forecasting model. In order to improve the prediction of the oxygen content of the flue gas of the energy equipment, and reduce the measurement cost of the existing technology.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本公开实施例的实施过程构成任何限定。It should be understood that the sequence numbers of the steps in the above embodiments do not mean the order of execution, and the execution order of each process should be determined by its functions and internal logic, and should not constitute any limitation on the implementation process of the embodiments of the present disclosure.
在分布式能源领域,小型燃气锅炉为了节约成本,普遍放弃安装氧化锆测量仪,导致无法实现闭环控制和热效率最优运行,特别是当燃气热值不稳定的场景,无法准确测量烟气含氧量。图8是本公开实施例提供的一种烟气排放预测方法的流程示意图,该基于联合学习的烟气排放预测方法可以由图1的参与方执行。如图8所示,该方法包括:In the field of distributed energy, in order to save costs, small gas-fired boilers generally abandon the installation of zirconia measuring instruments, resulting in the inability to achieve closed-loop control and optimal thermal efficiency operation, especially when the calorific value of gas is unstable, and the oxygen content in flue gas cannot be accurately measured quantity. Fig. 8 is a schematic flowchart of a smoke emission prediction method provided by an embodiment of the present disclosure, and the smoke emission prediction method based on joint learning can be executed by the participants in Fig. 1 . As shown in Figure 8, the method includes:
S801,根据本地能源数据,训练本地能源数据测模型。S801. Train a local energy data measurement model according to the local energy data.
具体地,本地能源数据可以是指本地设备的烟气温度、烟气流速、设备入口压力等,例如蒸汽锅炉烟气温度,节能器出口温度,烟气流量瞬时值,蒸汽锅炉燃气温度,蒸汽锅炉烟 气标况流量,蒸汽锅炉天然气入口压力,蒸汽锅炉烟气流速,蒸汽锅炉冷凝器进口烟温,蒸汽锅炉排烟温度,蒸汽锅炉烟气压力,蒸汽锅炉冷凝器进口压力,蒸汽锅炉主蒸汽瞬时流量,蒸汽锅炉运行状态,蒸汽锅炉天然气入口瞬时流量等。Specifically, local energy data can refer to the flue gas temperature, flue gas flow rate, and equipment inlet pressure of local equipment, such as steam boiler flue gas temperature, economizer outlet temperature, instantaneous value of flue gas flow, steam boiler gas temperature, steam boiler Flue gas standard flow rate, steam boiler natural gas inlet pressure, steam boiler flue gas flow rate, steam boiler condenser inlet flue temperature, steam boiler exhaust gas temperature, steam boiler flue gas pressure, steam boiler condenser inlet pressure, steam boiler main steam instantaneous Flow rate, operating status of steam boiler, instantaneous flow rate of natural gas inlet of steam boiler, etc.
进一步地,在根据本地能源数据训练本地能源数据测模型之前,还可以利用下述方式对本地能源数据、测试数据和目标能源设备的能源数据进行整理或筛选:首先,选取样本数据集;对样本数据集中的数据进行标签化,并得到所述样本数据集中的数据对应的标签数据;分别确定本地能源数据、测试数据和目标能源设备的能源数据对应的所述标签数据。Furthermore, before training the local energy data measurement model according to the local energy data, the following methods can also be used to organize or filter the local energy data, test data and energy data of the target energy equipment: first, select the sample data set; Labeling the data in the data set, and obtaining the label data corresponding to the data in the sample data set; respectively determining the label data corresponding to the local energy data, test data, and energy data of the target energy equipment.
S802,基于联合学习框架,根据测试数据和目标能源设备的能源数据训练本地能源数据测模型,分别得到测试数据预测模型和目标能源数据预测模型。S802. Based on the joint learning framework, train the local energy data measurement model according to the test data and the energy data of the target energy equipment, and respectively obtain the test data prediction model and the target energy data prediction model.
其中,测试数据可以是通过本地能源数据中筛选的,也可以是其他相关设备提取的能源数据;目标能源设备的能源数据可以是需要预测的设备的能源数据。Wherein, the test data may be selected from local energy data, or may be energy data extracted from other related equipment; the energy data of the target energy equipment may be the energy data of the equipment to be predicted.
具体地,可以通过下述方式实现:基于联合学习框架,参与方将本地能源数据预测模型发送给中心节点;响应于中心节点的反馈信息,根据测试数据和目标能源设备的能源数据训练本地能源数据测试模型,分别得到测试数据预测模型和目标能源数据预测模型。Specifically, it can be achieved in the following ways: Based on the federated learning framework, the participants send the local energy data prediction model to the central node; in response to the feedback information of the central node, train the local energy data according to the test data and the energy data of the target energy equipment The test model is used to obtain the test data prediction model and the target energy data prediction model respectively.
S803,基于本地能源数据预测模型、测试数据预测模型和目标能源数据预测模型,计算第一样本迁移权重和第二样本迁移权重;其中,第一样本迁移权重为本地能源数据针对目标能源设备的能源数据的样本迁移权重,第二样本迁移权重为测试数据针对目标能源设备的能源数据的样本迁移权重。S803. Calculate the first sample migration weight and the second sample migration weight based on the local energy data prediction model, the test data prediction model, and the target energy data prediction model; wherein, the first sample migration weight is the local energy data for the target energy equipment The sample migration weight of the energy data, the second sample migration weight is the sample migration weight of the test data for the energy data of the target energy device.
具体地,在基于联合学习框架中,首先,参与方将本地能源数据预测模型、测试数据预测模型和目标能源数据预测模型发送给中心节点;中心节点可以根据参与方(或需求方)的目标需求本地能源数据预测模型、测试数据预测模型和目标能源数据预测模型进行整理或调整后,现发送给相关的参与方;然后,响应于中心节点的反馈信息,基于本地能源数据预测模型、测试数据预测模型和目标能源数据预测模型,分别对所述本地能源数据、测试数据和目标能源设备的能源数据进行目标分类;最后,根据目标分类,计算第一样本迁移权重和第二样本迁移权重。Specifically, in the framework based on federated learning, first, the participants send the local energy data prediction model, test data prediction model and target energy data prediction model to the central node; the central node can After the local energy data prediction model, test data prediction model and target energy data prediction model are sorted or adjusted, they are sent to relevant participants; then, in response to the feedback information from the central node, based on the local energy data prediction model, test data prediction The model and the target energy data prediction model perform target classification on the local energy data, test data and energy data of the target energy equipment respectively; finally, calculate the first sample migration weight and the second sample migration weight according to the target classification.
S804,利用本地能源数据和第一样本迁移权重、测试数据和第二样本迁移权重,分别训练本地能源数据网络模型和测试数据网络模型。S804. Using the local energy data and the first sample transfer weight, the test data and the second sample transfer weight, respectively train the local energy data network model and the test data network model.
具体地,可优选通过利用本地能源数据和第一样本迁移权重,建立应用数据集;进而可根据建立的应用数据集,训练本地能源数据网络模型;再利用测试数据和第二样本迁移权重,建立期望数据集;进而可以根据期望数据集,训练测试数据网络模型。对于训练本地能源数据网络模型和测试数据网络模型可以并行训练,也可以先训练两个模型中的一种,对此本公开不限定。Specifically, it is preferable to establish an application data set by using the local energy data and the first sample migration weight; and then according to the established application data set, train the local energy data network model; and then use the test data and the second sample migration weight, Establish the expected data set; then, the test data network model can be trained according to the expected data set. The local energy data network model and the test data network model can be trained in parallel, or one of the two models can be trained first, which is not limited in this disclosure.
S805,接收来自中心节点对本地能源数据网络模型和测试数据网络模型聚合训练后的联合学习预测模型。S805. Receive a joint learning prediction model from the central node after aggregate training of the local energy data network model and the test data network model.
具体地,可以通过将本地能源数据网络模型和测试数据网络模型上传至中心节点;由中心节点对本地能源数据网络模型和测试数据网络模型进行聚合训练。然后,在接收来自中心节点对本地能源数据网络模型和测试数据网络模型聚合训练后的联合学习预测模型。Specifically, the local energy data network model and the test data network model can be uploaded to the central node; the central node performs aggregation training on the local energy data network model and the test data network model. Then, after receiving the joint learning prediction model from the central node, the local energy data network model and the test data network model are aggregated and trained.
进一步地,对联合学习的预测模型可以通过根据预测条件,对联合学习预测模型优化;其中预测条件包含:模型参数的预测值和模型参数的适应度的判断。Furthermore, the joint learning prediction model can be optimized according to the prediction conditions; wherein the prediction conditions include: the prediction value of the model parameters and the judgment of the fitness of the model parameters.
S806,根据联合学习预测模型对目标能源设备进行烟气排放的预测。S806. Predict the flue gas emission of the target energy equipment according to the joint learning prediction model.
根据本公开实施例提供的技术方案,通过根据本地能源数据,训练本地能源数据测模型;基于联合学习框架,根据测试数据和目标能源设备的能源数据训练本地能源数据测模型,分别得到测试数据预测模型和目标能源数据预测模型;基于本地能源数据预测模型、测试数据预测模型和目标能源数据预测模型,计算第一样本迁移权重和第二样本迁移权重,其中,第一样本迁移权重为本地能源数据针对目标能源设备的能源数据的样本迁移权重,第二样本迁移权重为测试数据针对目标能源设备的能源数据的样本迁移权重;利用本地能源数据和第一 样本迁移权重、测试数据和第二样本迁移权重,分别训练本地能源数据网络模型和测试数据网络模型;接收来自中心节点对本地能源数据网络模型和测试数据网络模型聚合训练后的联合学习预测模型;根据联合学习预测模型对目标能源设备进行烟气排放的预测。以解决现有技术中因设备的分布多不同,而对烟气排放测量不准的问题。进而节约了实际传感器安装的资源成本。According to the technical solution provided by the embodiments of the present disclosure, the local energy data measurement model is trained according to the local energy data; based on the joint learning framework, the local energy data measurement model is trained according to the test data and the energy data of the target energy equipment, and the test data predictions are respectively obtained model and target energy data prediction model; based on the local energy data prediction model, the test data prediction model and the target energy data prediction model, the first sample migration weight and the second sample migration weight are calculated, wherein the first sample migration weight is the local The energy data is aimed at the sample migration weight of the energy data of the target energy device, and the second sample migration weight is the sample migration weight of the test data for the energy data of the target energy device; using the local energy data and the first sample migration weight, the test data and the second sample migration weight Two-sample migration weights to train the local energy data network model and the test data network model respectively; receive the joint learning prediction model after the aggregation training of the local energy data network model and the test data network model from the central node; according to the joint learning prediction model, the target energy Equipment for flue gas emission predictions. In order to solve the problem of inaccurate measurement of flue gas emission due to the different distribution of equipment in the prior art. This in turn saves resource costs for the actual sensor installation.
上述所有可选技术方案,可以采用任意结合形成本申请的可选实施例,在此不再一一赘述。All the above optional technical solutions may be combined in any way to form optional embodiments of the present application, which will not be repeated here.
图9是本公开实施例提供的一种烟气排放预测装置的示意图。如图9所示,该烟气排放预测装置包括:Fig. 9 is a schematic diagram of a flue gas emission prediction device provided by an embodiment of the present disclosure. As shown in Figure 9, the flue gas emission prediction device includes:
第一训练模块901,用于根据本地能源数据,训练本地能源数据测模型;The first training module 901 is used to train the local energy data measurement model according to the local energy data;
第二训练模块902,用于基于联合学习框架,根据测试数据和目标能源设备的能源数据训练本地能源数据测模型,分别得到测试数据预测模型和目标能源数据预测模型;The second training module 902 is used to train the local energy data measurement model based on the joint learning framework according to the test data and the energy data of the target energy equipment, and respectively obtain the test data prediction model and the target energy data prediction model;
计算模块903,用于基于本地能源数据预测模型、测试数据预测模型和目标能源数据预测模型,计算第一样本迁移权重和第二样本迁移权重,其中,第一样本迁移权重为本地能源数据针对目标能源设备的能源数据的样本迁移权重,第二样本迁移权重为测试数据针对目标能源设备的能源数据的样本迁移权重;Calculation module 903, configured to calculate the first sample migration weight and the second sample migration weight based on the local energy data prediction model, the test data prediction model and the target energy data prediction model, wherein the first sample migration weight is the local energy data For the sample migration weight of the energy data of the target energy device, the second sample migration weight is the sample migration weight of the test data for the energy data of the target energy device;
第三训练模块904,用于利用本地能源数据和第一样本迁移权重、测试数据和第二样本迁移权重,分别训练本地能源数据网络模型和测试数据网络模型;The third training module 904 is used to train the local energy data network model and the test data network model respectively by using the local energy data and the first sample transfer weight, test data and the second sample transfer weight;
建立模块905,用于接收来自中心节点对本地能源数据网络模型和测试数据网络模型聚合训练后的联合学习预测模型;Establishment module 905, used to receive the joint learning prediction model from the central node after the aggregation and training of the local energy data network model and the test data network model;
预测模块906,用于根据联合学习预测模型对目标能源设备进行烟气排放的预测。The prediction module 906 is used to predict the smoke emission of the target energy equipment according to the joint learning prediction model.
根据本公开实施例提供的技术方案,通过根据本地能源数据,训练本地能源数据测模型;基于联合学习框架,根据测试数据和目标能源设备的能源数据训练本地能源数据测模型,分别得到测试数据预测模型和目标能源数据预测模型;基于本地能源数据预测模型、测试数据预测模型和目标能源数据预测模型,计算第一样本迁移权重和第二样本迁移权重,其中,第一样本迁移权重为本地能源数据针对目标能源设备的能源数据的样本迁移权重,第二样本迁移权重为测试数据针对目标能源设备的能源数据的样本迁移权重;利用本地能源数据和第一样本迁移权重、测试数据和第二样本迁移权重,分别训练本地能源数据网络模型和测试数据网络模型;接收来自中心节点对本地能源数据网络模型和测试数据网络模型聚合训练后的联合学习预测模型;根据联合学习预测模型对目标能源设备进行烟气排放的预测。以解决现有技术中因设备的分布多不同,而对烟气排放测量不准的问题。进而节约了实际传感器安装的资源成本。According to the technical solution provided by the embodiments of the present disclosure, the local energy data measurement model is trained according to the local energy data; based on the joint learning framework, the local energy data measurement model is trained according to the test data and the energy data of the target energy equipment, and the test data predictions are respectively obtained model and target energy data prediction model; based on the local energy data prediction model, the test data prediction model and the target energy data prediction model, the first sample migration weight and the second sample migration weight are calculated, wherein the first sample migration weight is the local The energy data is aimed at the sample migration weight of the energy data of the target energy device, and the second sample migration weight is the sample migration weight of the test data for the energy data of the target energy device; using the local energy data and the first sample migration weight, the test data and the second sample migration weight Two-sample migration weights to train the local energy data network model and the test data network model respectively; receive the joint learning prediction model after the aggregation training of the local energy data network model and the test data network model from the central node; according to the joint learning prediction model, the target energy Equipment for flue gas emission predictions. In order to solve the problem of inaccurate measurement of flue gas emission due to the different distribution of equipment in the prior art. This in turn saves resource costs for the actual sensor installation.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本公开实施例的实施过程构成任何限定。It should be understood that the sequence numbers of the steps in the above embodiments do not mean the order of execution, and the execution order of each process should be determined by its functions and internal logic, and should not constitute any limitation on the implementation process of the embodiments of the present disclosure.
图10是本公开实施例提供的计算机设备10的示意图。如图10所示,该实施例的计算机设备10包括:处理器1001、存储器1002以及存储在该存储器1002中并且可以在处理器1001上运行的计算机程序1003。处理器1001执行计算机程序1003时实现上述各个方法实施例中的步骤。或者,处理器1001执行计算机程序1003时实现上述各装置实施例中各模块/单元的功能。FIG. 10 is a schematic diagram of a computer device 10 provided by an embodiment of the present disclosure. As shown in FIG. 10 , the computer device 10 of this embodiment includes: a processor 1001 , a memory 1002 , and a computer program 1003 stored in the memory 1002 and capable of running on the processor 1001 . When the processor 1001 executes the computer program 1003, the steps in the foregoing method embodiments are implemented. Alternatively, when the processor 1001 executes the computer program 1003, the functions of the modules/units in the foregoing device embodiments are implemented.
示例性地,计算机程序1003可以被分割成一个或多个模块/单元,一个或多个模块/单元被存储在存储器1002中,并由处理器1001执行,以完成本公开。一个或多个模块/单元可以是能够完成特定功能的一系列计算机程序指令段,该指令段用于描述计算机程序1003在计算机设备10中的执行过程。Exemplarily, the computer program 1003 can be divided into one or more modules/units, and one or more modules/units are stored in the memory 1002 and executed by the processor 1001 to complete the present disclosure. One or more modules/units may be a series of computer program instruction segments capable of accomplishing specific functions, and the instruction segments are used to describe the execution process of the computer program 1003 in the computer device 10 .
计算机设备10可以是桌上型计算机、笔记本、掌上电脑及云端服务器等计算机设备。计算机设备10可以包括但不仅限于处理器1001和存储器1002。本领域技术人员可以理解,图4仅仅是计算机设备10的示例,并不构成对计算机设备10的限定,可以包括比图示更多 或更少的部件,或者组合某些部件,或者不同的部件,例如,计算机设备还可以包括输入输出设备、网络接入设备、总线等。The computer device 10 may be a computer device such as a desktop computer, a notebook, a palmtop computer, or a cloud server. The computer device 10 may include, but is not limited to, a processor 1001 and a memory 1002 . Those skilled in the art can understand that FIG. 4 is only an example of the computer device 10, and does not constitute a limitation to the computer device 10. It may include more or less components than those shown in the illustration, or combine certain components, or different components. , for example, computer equipment may also include input and output equipment, network access equipment, bus, and so on.
处理器1001可以是中央处理单元(Central Processing Unit,CPU),也可以是其它通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或者其它可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。The processor 1001 may be a central processing unit (Central Processing Unit, CPU), or other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), on-site Field-Programmable Gate Array (FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. A general-purpose processor may be a microprocessor, or the processor may be any conventional processor, or the like.
存储器1002可以是计算机设备10的内部存储单元,例如,计算机设备10的硬盘或内存。存储器1002也可以是计算机设备10的外部存储设备,例如,计算机设备10上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,存储器1002还可以既包括计算机设备10的内部存储单元也包括外部存储设备。存储器1002用于存储计算机程序以及计算机设备所需的其它程序和数据。存储器1002还可以用于暂时地存储已经输出或者将要输出的数据。The memory 1002 may be an internal storage unit of the computer device 10 , for example, a hard disk or a memory of the computer device 10 . The memory 1002 can also be an external storage device of the computer device 10, for example, a plug-in hard disk equipped on the computer device 10, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) card, a flash memory card ( Flash Card), etc. Further, the storage 1002 may also include both an internal storage unit of the computer device 10 and an external storage device. The memory 1002 is used to store computer programs and other programs and data required by the computer equipment. The memory 1002 can also be used to temporarily store data that has been output or will be output.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。实施例中的各功能单元、模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中,上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。另外,各功能单元、模块的具体名称也只是为了便于相互区分,并不用于限制本申请的保护范围。上述系统中单元、模块的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that for the convenience and brevity of description, only the division of the above-mentioned functional units and modules is used for illustration. In practical applications, the above-mentioned functions can be assigned to different functional units, Completion of modules means that the internal structure of the device is divided into different functional units or modules to complete all or part of the functions described above. Each functional unit and module in the embodiment may be integrated into one processing unit, or each unit may exist separately physically, or two or more units may be integrated into one unit, and the above-mentioned integrated units may adopt hardware It can also be implemented in the form of software functional units. In addition, the specific names of the functional units and modules are only for the convenience of distinguishing each other, and are not used to limit the protection scope of the present application. For the specific working processes of the units and modules in the above system, reference may be made to the corresponding processes in the aforementioned method embodiments, and details will not be repeated here.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述或记载的部分,可以参见其它实施例的相关描述。In the above-mentioned embodiments, the descriptions of each embodiment have their own emphases, and for parts that are not detailed or recorded in a certain embodiment, refer to the relevant descriptions of other embodiments.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本公开的范围。Those skilled in the art can appreciate that the units and algorithm steps of the examples described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are executed by hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may implement the described functionality using different methods for each particular application, but such implementation should not be considered beyond the scope of the present disclosure.
在本公开所提供的实施例中,应该理解到,所揭露的装置/计算机设备和方法,可以通过其它的方式实现。例如,以上所描述的装置/计算机设备实施例仅仅是示意性的,例如,模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通讯连接可以是通过一些接口,装置或单元的间接耦合或通讯连接,可以是电性,机械或其它的形式。In the embodiments provided in the present disclosure, it should be understood that the disclosed apparatus/computer equipment and methods may be implemented in other ways. For example, the device/computer device embodiments described above are only illustrative, for example, the division of modules or units is only a logical function division, and there may be other division methods in actual implementation, and multiple units or components can be Incorporation may either be integrated into another system, or some features may be omitted, or not implemented. In another point, the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical, mechanical or other forms.
作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。A unit described as a separate component may or may not be physically separated, and a component displayed as a unit may or may not be a physical unit, that is, it may be located in one place, or may be distributed to multiple network units. Part or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
另外,在本公开各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present disclosure may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
集成的模块/单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读存储介质中。基于这样的理解,本公开实现上述实施例方法中的全部或部分流程,也可以通过计算机程序来指令相关的硬件来完成,计算机程序可以存储在计算机可读存储介质中,该计算机程序在被处理器执行时,可以实现上述各个方法实施例的步骤。计算机程序可以包括计算机程序代码,计算机程序代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。计算机可读介质可以包括:能够携带计算机程序代码的 任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、电载波信号、电信信号以及软件分发介质等。需要说明的是,计算机可读介质包含的内容可以根据司法管辖区内立法和专利实践的要求进行适当的增减,例如,在某些司法管辖区,根据立法和专利实践,计算机可读介质不包括电载波信号和电信信号。If an integrated module/unit is realized in the form of a software function unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the present disclosure realizes all or part of the processes in the methods of the above embodiments, and can also be completed by instructing related hardware through computer programs. The computer programs can be stored in computer-readable storage media, and the computer programs can be processed. When executed by the controller, the steps in the above-mentioned method embodiments can be realized. A computer program may include computer program code, which may be in source code form, object code form, executable file, or some intermediate form or the like. The computer-readable medium may include: any entity or device capable of carrying computer program code, recording medium, U disk, removable hard disk, magnetic disk, optical disk, computer memory, read-only memory (Read-Only Memory, ROM), random access Memory (Random Access Memory, RAM), electrical carrier signal, telecommunication signal and software distribution medium, etc. It should be noted that the content contained in computer readable media may be appropriately increased or decreased according to the requirements of legislation and patent practice in the jurisdiction. For example, in some jurisdictions, computer readable media may not Including electrical carrier signals and telecommunication signals.
以上实施例仅用以说明本公开的技术方案,而非对其限制;尽管参照前述实施例对本公开进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本公开各实施例技术方案的精神和范围,均应包含在本公开的保护范围之内。The above embodiments are only used to illustrate the technical solutions of the present disclosure, rather than to limit them; although the present disclosure has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: it can still be described in the foregoing embodiments Modifications to the technical solutions recorded, or equivalent replacements for some of the technical features; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the embodiments of the present disclosure, and should be included in this disclosure. within the scope of protection.

Claims (20)

  1. 一种烟气含氧量负荷预测方法,其特征在于,包括:A flue gas oxygen content load prediction method, characterized in that, comprising:
    参与方分别确定本地多组设备的数据集和目标设备数据集对应的样本权重;The participants respectively determine the sample weights corresponding to the data sets of the local multiple groups of devices and the target device data sets;
    根据本地多组设备的数据集和对应的样本权重,训练得到多组设备的预测神经网络模型;According to the data sets of multiple sets of local equipment and the corresponding sample weights, train the prediction neural network model of multiple sets of equipment;
    将本地多组设备的预测神经网络模型上传到中心节点进行模型聚合,以得到聚合后的预测神经网络模型;Upload the predictive neural network models of multiple groups of local devices to the central node for model aggregation to obtain the aggregated predictive neural network models;
    根据预设训练条件训练聚合后的预测神经网络模型,以得到联合预测模型;Train the aggregated prediction neural network model according to preset training conditions to obtain a joint prediction model;
    根据联合预测模型和目标设备数据集对应的样本权重,预测目标设备烟气含氧量值。According to the joint prediction model and the sample weight corresponding to the target equipment data set, the oxygen content value of the flue gas of the target equipment is predicted.
  2. 根据权利要求1所述的方法,其特征在于,参与方分别确定本地多组设备的数据集和目标设备数据集对应的样本权重包括:The method according to claim 1, wherein the participants respectively determine the sample weights corresponding to the data sets of the local multiple groups of devices and the target device data sets include:
    参与方选取本地多组设备的数据集与目标设备的数据集;The participant selects the data sets of multiple sets of local devices and the data sets of the target device;
    合并本地多组的设备的数据集与目标设备的数据集,以得到合并数据;Merge the data sets of the local multiple groups of devices with the data sets of the target device to obtain the merged data;
    利用所述合并数据训练核密度估计模型;training a kernel density estimation model using the merged data;
    根据核密度估计模型,分别确定本地多组的设备的数据集和目标设备的数据集对应的样本权重。According to the kernel density estimation model, the sample weights corresponding to the data sets of the local multiple groups of devices and the data sets of the target device are respectively determined.
  3. 根据权利要求1所述的方法,其特征在于,根据联合预测模型和目标设备数据集对应的样本权重,预测目标设备烟气含氧量值包括:The method according to claim 1, wherein, according to the joint prediction model and the sample weight corresponding to the target equipment data set, predicting the oxygen content value of the flue gas of the target equipment includes:
    参与方将联合预测模型上传至中心节点进行联合学习训练;Participants upload the joint prediction model to the central node for joint learning and training;
    响应与中收引擎反馈的对联合预测模型的联合学习训练;Joint learning and training of the joint prediction model in response to feedback from the receiving engine;
    将经过联合学习训练的联合预测模型发送至目标设备;Send the joint prediction model trained by joint learning to the target device;
    根据联合预测模型和目标设备数据集对应的样本权重,预测目标设备烟气含氧量值。According to the joint prediction model and the sample weight corresponding to the target equipment data set, the oxygen content value of the flue gas of the target equipment is predicted.
  4. 根据权利要求1所述的方法,其特征在于,还包括:The method according to claim 1, further comprising:
    参与方利用本地多组设备的数据集建立预测神经网络模型训练样本;Participants use the data sets of multiple sets of local devices to establish training samples for predictive neural network models;
    利用目标设备的数据集建立预测神经网络模型测试样本;Use the data set of the target device to establish a predictive neural network model test sample;
    根据所述模型训练样本和模型测试样本,得到样本预测值;Obtaining a sample prediction value according to the model training sample and the model testing sample;
    根据样本预测值与样本期望值的误差矩阵的范数,得到预测神经网络模型的适应度值;According to the norm of the error matrix between the sample predicted value and the sample expected value, the fitness value of the predicted neural network model is obtained;
    根据所述适应度值对预测神经网络模型中的种群中的粒子进行更新,以得到优化的预测神经网络模型。The particles in the population in the prediction neural network model are updated according to the fitness value to obtain an optimized prediction neural network model.
  5. 一种烟气含氧量负荷预测装置,其特征在于,包括:A flue gas oxygen content load prediction device, characterized in that it comprises:
    确定模块,用于参与方分别确定本地多组设备的数据集和目标设备数据集对应的样本权重;A determination module, used for the participants to respectively determine the sample weights corresponding to the data sets of multiple groups of local devices and the data sets of the target device;
    第一训练模块,用于根据本地多组设备的数据集和对应的样本权重,训练得到多组设备的预测神经网络模型;The first training module is used to train the prediction neural network models of multiple sets of equipment according to the data sets of multiple sets of local equipment and the corresponding sample weights;
    聚合模块,用于将本地多组设备的预测神经网络模型上传到中心节点进行模型聚合,以得到聚合后的预测神经网络模型;The aggregation module is used to upload the predictive neural network models of multiple groups of local devices to the central node for model aggregation, so as to obtain the aggregated predictive neural network models;
    第二训练模块,根据预设训练条件训练聚合后的预测神经网络模型,以得到联合预测模型;The second training module trains the aggregated prediction neural network model according to preset training conditions to obtain a joint prediction model;
    预测模块,用于根据联合预测模型和目标设备数据集对应的样本权重,预测目标设备烟气含氧量值。The prediction module is used to predict the oxygen content value of the flue gas of the target device according to the joint prediction model and the sample weight corresponding to the target device data set.
  6. 一种烟气含氧量负荷预测方法,其特征在于,所述方法应用于联合学习框架中,包括:A flue gas oxygen content load prediction method, characterized in that the method is applied in a joint learning framework, including:
    获取联合学习架构下的第一参与方的设备数据和第二参与方的设备数据;其中,第一参与方为提出预测需求的参与方,第二参与方为除第一参与方外的其他参与方;Obtain the equipment data of the first participant and the equipment data of the second participant under the federated learning architecture; among them, the first participant is the participant who proposes the prediction demand, and the second participant is other participants except the first participant square;
    利用所述第一参与方的设备数据和第二参与方的设备数据,训练预测分类器;using the device data of the first party and the device data of the second party to train a predictive classifier;
    根据所述预测分类器,确定第一参与方的设备数据关于第二参与方的设备数据的权重数据;determining weight data of the device data of the first party with respect to the device data of the second party based on the predictive classifier;
    基于第二参与方的设备数据和所述权重数据,训练预测式梯度提升模型;training a predictive gradient boosting model based on the device data of the second participant and the weight data;
    利用所述预测式梯度提升模型预测第一参与方设备的烟气含氧量负荷。Using the predictive gradient boosting model to predict the flue gas oxygen content load of the equipment of the first party.
  7. 根据权利要求6所述的方法,其特征在于,获取联合学习架构下的第一参与方的设备数据和第二参与方的设备数据包括:接收来自第一参与方的设备数据集与第二参与方的设备数据集;根据预设筛选特征,筛选第一参与方的设备数据集与第二参与方的设备数据集,以分别得到第一参与方的设备数据本样量和第二参与方的设备数据量;将所述第一参与方的设备数据本样量和第二参与方的设备数据量,分别确定为第一参与方的设备数据和第二参与方的设备数据;The method according to claim 6, wherein acquiring the equipment data of the first participant and the equipment data of the second participant under the joint learning framework comprises: receiving the equipment data set and the second participant's equipment data from the first participant According to the preset screening features, the equipment data set of the first party and the equipment data set of the second party are screened to obtain the sample size of the equipment data of the first party and the data set of the second party respectively. Amount of equipment data; determining the sample volume of equipment data of the first participant and the amount of equipment data of the second participant as the equipment data of the first participant and the equipment data of the second participant, respectively;
    或者,利用所述第一参与方的设备数据和第二参与方的设备数据,训练预测分类器包括:标签化处理第一参与方的设备数据和第二参与方的设备数据,得到第一参与方的设备数据的标签数据和第二参与方的设备数据的标签数据;合并第一参与方的设备数据的标签数据和第二参与方的设备数据的标签数据,以得到合并后的标签数据;根据合并后的标签数据,训练预测分类器。Alternatively, using the equipment data of the first participant and the equipment data of the second participant, training the predictive classifier includes: labeling and processing the equipment data of the first participant and the equipment data of the second participant to obtain the first participant The tag data of the device data of the first party and the tag data of the device data of the second party; the tag data of the device data of the first party and the tag data of the device data of the second party are merged to obtain the combined tag data; Train a predictive classifier on the combined labeled data.
  8. 根据权利要求6所述的方法,其特征在于,根据所述预测分类器,确定第一参与方的设备数据关于第二参与方的设备数据的权重数据包括:The method according to claim 6, wherein, according to the predictive classifier, determining the weight data of the equipment data of the first participant with respect to the equipment data of the second participant comprises:
    利用所述预测分类器,分别得到第一参与方的设备数据对应的设备故障概率值和第二参与方的设备数据对应的设备故障概率值;Using the predictive classifier, respectively obtain the equipment failure probability value corresponding to the equipment data of the first participant and the equipment failure probability value corresponding to the equipment data of the second participant;
    根据所述第一参与方的设备数据对应的设备故障概率值和第二参与方的设备数据对应的设备故障概率值,确定第一参与方的设备数据关于第二参与方的设备数据的权重数据。According to the equipment failure probability value corresponding to the equipment data of the first participant and the equipment failure probability value corresponding to the equipment data of the second participant, determine the weight data of the equipment data of the first participant with respect to the equipment data of the second participant .
  9. 根据权利要求8所述的方法,其特征在于,利用所述预测分类器,分别得到第一参与方的设备数据对应的设备故障概率值和第二参与方的设备数据对应的设备故障概率值包括:The method according to claim 8, characterized in that, using the predictive classifier, respectively obtaining the equipment failure probability value corresponding to the equipment data of the first participant and the equipment failure probability value corresponding to the equipment data of the second participant include :
    利用所述预测分类器,分别对第一参与方的设备数据和第二参与方的设备数据进行分类,以得到第一参与方的设备数据对应的设备故障数据和第二参与方的设备数据对应的设备故障数据;Using the predictive classifier, respectively classify the equipment data of the first participant and the equipment data of the second participant, so as to obtain the equipment fault data corresponding to the equipment data of the first participant and the equipment data corresponding to the second participant equipment failure data;
    分别计算第一参与方的设备数据对应的设备故障数据和第二参与方的设备数据对应的设备故障数据对应设备故障概率值。The equipment failure data corresponding to the equipment data of the first participant and the equipment failure probability value corresponding to the equipment failure data corresponding to the equipment data of the second participant are respectively calculated.
  10. 根据权利要求6所述的方法,其特征在于,基于第二参与方的设备数据和所述权重数据,训练预测式梯度提升模型还包括:The method according to claim 6, wherein, based on the equipment data of the second participant and the weight data, training the predictive gradient boosting model further comprises:
    基于第二参与方的设备数据和所述权重数据,训练预测式梯度提升模型;training a predictive gradient boosting model based on the device data of the second participant and the weight data;
    获取第一参与方的测试数据训练预测式梯度提升模型,以得到第一参与方的设备预测值;Obtain the test data of the first participant to train the predictive gradient boosting model to obtain the device prediction value of the first participant;
    根据第一参与方的设备预测值与设备期望值的误差矩阵的范数,得到预测式梯度提升模型的适应度值;Obtain the fitness value of the predictive gradient boosting model according to the norm of the error matrix between the predicted value of the equipment of the first participant and the expected value of the equipment;
    根据所述适应度值,对所述预测式梯度提升模型中的种群中的粒子进行更新,以得到优化的预测式梯度提升模型。According to the fitness value, the particles in the population in the predictive gradient boosting model are updated to obtain an optimized predictive gradient boosting model.
  11. 根据权利要求10中所述的方法,其特征在于,根据所述适应度值对所述预测式梯度提升模型中的种群中的粒子进行更新,以得到优化的预测式梯度提升模型包括:The method according to claim 10, wherein the particles in the population in the predictive gradient boosting model are updated according to the fitness value, so as to obtain an optimized predictive gradient boosting model comprising:
    确定所述预测式梯度提升模型中的种群和所述种群中粒子;determining a population in the predictive gradient boosting model and particles in the population;
    判断当前的所述种群中粒所对应的适应度值是否大于之前的旧粒子的适应度值;Judging whether the fitness value corresponding to the particle in the current population is greater than the fitness value of the previous old particle;
    如果小于,则需要更新所述预测式梯度提升模型中的种群和所述种群中粒子。If it is less than, the population in the predictive gradient boosting model and the particles in the population need to be updated.
  12. 一种烟气含氧量负荷预测装置,其特征在于,所述装置应用于联合学习框架中包括:A flue gas oxygen content load forecasting device, characterized in that the application of the device in the joint learning framework includes:
    获取模块,用于获取联合学习架构下的第一参与方的设备数据和第二参与方的设备数 据;其中,第一参与方为提出预测需求的参与方,第二参与方为除第一参与方外的其他参与方;The acquisition module is used to acquire the equipment data of the first participant and the equipment data of the second participant under the joint learning architecture; wherein, the first participant is the participant who proposes the prediction demand, and the second participant is the participant other than the first participant parties other than the Party;
    第一训练模块利用所述第一参与方的设备数据和第二参与方的设备数据,训练预测分类器;The first training module uses the device data of the first participant and the device data of the second participant to train a predictive classifier;
    计算模块,用于根据所述预测分类器,确定第一参与方的设备数据关于第二参与方的设备数据的权重数据;A calculation module, configured to determine weight data of the equipment data of the first participant with respect to the equipment data of the second participant according to the predictive classifier;
    第二训练模块,用于基于第二参与方的设备数据和所述权重数据组,训练预测式梯度提升模型;A second training module, configured to train a predictive gradient boosting model based on the equipment data of the second participant and the weight data set;
    预测模块,用于利用所述预测式梯度提升模型预测第一参与方设备的烟气含氧量负荷。A prediction module, configured to use the predictive gradient boosting model to predict the flue gas oxygen content load of the equipment of the first participant.
  13. 一种烟气含氧量负荷预测模型选择方法,其特征在于,包括:A method for selecting a flue gas oxygen content load prediction model, characterized in that it includes:
    基于联合学习架构,接收来自参与方的预测设备的训练数据集和测试数据集;Based on the federated learning architecture, receive the training data set and test data set from the prediction equipment of the participating parties;
    对预测设备的训练数据集中的数据和测试数据集中的数据进行预处理,并得到预处理后的设备数据集;Preprocess the data in the training data set and the test data set of the prediction device, and obtain the preprocessed device data set;
    根据建立的预测模型组,计算预处理后的设备数据集中的每条数据的评价指标值;Calculate the evaluation index value of each piece of data in the preprocessed equipment data set according to the established prediction model group;
    根据最小的评价指标值,确定适合预测设备的烟气含氧量负荷预测模型。According to the minimum evaluation index value, the flue gas oxygen content load prediction model suitable for the prediction equipment is determined.
  14. 根据权利要求13所述的方法,其特征在于,根据建立的预测模型组,计算预处理后的设备数据集中的每条数据的评价指标值包括:根据预测设备的属性和预测预处理后的设备数据集中的每条数据,建立预测模型组;利用预测模型组,分别计算训练集和测试集的每条数据的均方根误差;将得到的训练集和测试集的每条数据的均方根误差,作为预处理后的设备数据集中的每条数据的评价指标值;The method according to claim 13, wherein, according to the established prediction model group, calculating the evaluation index value of each piece of data in the preprocessed equipment data set includes: according to the properties of the predicted equipment and the predicted preprocessed equipment For each piece of data in the data set, a prediction model group is established; using the prediction model group, the root mean square error of each piece of data in the training set and test set is calculated respectively; the root mean square error of each piece of data in the training set and test set is obtained Error, as the evaluation index value of each piece of data in the preprocessed device data set;
    或者,根据最小的所述评价指标值,确定适合预测设备的烟气含氧量负荷预测模型包括:对预处理后的设备数据集中的每条数据的评价指标值进行从小到大排序;根据排序结果,选取最小的所述评价指标值;调取预测模型组中的预测模型对应的标签值;当最小的所述评价指标值与预测模型组中的预测模型对应的标签值匹配时,确定所述标签值对应的预测模型为适合预测设备的烟气含氧量负荷预测模型。Alternatively, according to the smallest evaluation index value, determining a flue gas oxygen content load prediction model suitable for predicting equipment includes: sorting the evaluation index values of each piece of data in the preprocessed equipment data set from small to large; according to the sorting As a result, select the minimum evaluation index value; call the label value corresponding to the prediction model in the prediction model group; when the minimum evaluation index value matches the label value corresponding to the prediction model in the prediction model group, determine the The prediction model corresponding to the above label value is the flue gas oxygen content load prediction model suitable for the prediction equipment.
  15. 一种烟气含氧量负荷预测模型选择装置,其特征在于,包括:A flue gas oxygen content load prediction model selection device, characterized in that it includes:
    接收模块,用于基于联合学习架构,接收来自参与方的预测设备的训练数据集和测试数据集;The receiving module is used to receive training data sets and test data sets from prediction devices of participating parties based on the federated learning architecture;
    预处理模块,用于对预测设备的训练数据集中的数据和测试数据集中的数据进行预处理,并得到预处理后的设备数据集;The preprocessing module is used to preprocess the data in the training data set and the test data set of the prediction device, and obtain the preprocessed device data set;
    计算模块,用于根据建立预测模型组,计算预处理后的设备数据集中的每条数据的评价指标值;A calculation module, configured to calculate the evaluation index value of each piece of data in the preprocessed device data set according to the established prediction model group;
    预测模块,用于根据最小的所述评价指标值,确定适合预测设备的烟气含氧量负荷预测模型。The prediction module is used to determine a flue gas oxygen content load prediction model suitable for the prediction equipment according to the minimum value of the evaluation index.
  16. 一种烟气排放预测方法,其特征在于,包括:A flue gas emission prediction method, characterized in that it comprises:
    根据本地能源数据,训练本地能源数据测模型;According to the local energy data, train the local energy data measurement model;
    基于联合学习框架,根据测试数据和目标能源设备的能源数据训练本地能源数据测模型,分别得到测试数据预测模型和目标能源数据预测模型;Based on the joint learning framework, the local energy data measurement model is trained according to the test data and the energy data of the target energy equipment, and the test data prediction model and the target energy data prediction model are respectively obtained;
    基于本地能源数据预测模型、测试数据预测模型和目标能源数据预测模型,计算第一样本迁移权重和第二样本迁移权重,其中,所述第一样本迁移权重为本地能源数据针对目标能源设备的能源数据的样本迁移权重,所述第二样本迁移权重为测试数据针对目标能源设备的能源数据的样本迁移权重;Based on the local energy data prediction model, the test data prediction model and the target energy data prediction model, calculate the first sample migration weight and the second sample migration weight, wherein the first sample migration weight is the local energy data for the target energy equipment The sample migration weight of the energy data, the second sample migration weight is the sample migration weight of the test data for the energy data of the target energy device;
    利用本地能源数据和第一样本迁移权重、测试数据和第二样本迁移权重,分别训练本地能源数据网络模型和测试数据网络模型;Using the local energy data and the first sample migration weight, the test data and the second sample migration weight, respectively train the local energy data network model and the test data network model;
    接收来自中心节点对本地能源数据网络模型和测试数据网络模型聚合训练后的联合学习预测模型;Receive the joint learning prediction model from the central node after the aggregation and training of the local energy data network model and the test data network model;
    根据联合学习预测模型对目标能源设备进行烟气排放的预测。According to the joint learning prediction model, the flue gas emission of the target energy equipment is predicted.
  17. 根据权利要求16所述的方法,其特征在于,基于本地能源数据预测模型、测试数据预测模型和目标能源数据预测模型,计算第一样本迁移权重和第二样本迁移权重包括:基于联合学习框架,参与方将所述本地能源数据预测模型、测试数据预测模型和目标能源数据预测模型发送给中心节点;响应于中心节点的反馈信息,基于本地能源数据预测模型、测试数据预测模型和目标能源数据预测模型,分别对所述本地能源数据、测试数据和目标能源设备的能源数据进行目标分类;根据所述目标分类,计算第一样本迁移权重和第二样本迁移权重;The method according to claim 16, wherein, based on the local energy data prediction model, the test data prediction model and the target energy data prediction model, calculating the first sample transfer weight and the second sample transfer weight comprises: based on a joint learning framework , the participant sends the local energy data prediction model, test data prediction model and target energy data prediction model to the central node; in response to the feedback information of the central node, based on the local energy data prediction model, test data prediction model and target energy data A predictive model, performing target classification on the local energy data, test data and energy data of the target energy equipment respectively; calculating the first sample migration weight and the second sample migration weight according to the target classification;
    或者,利用本地能源数据和第一样本迁移权重、测试数据和第二样本迁移权重,分别训练本地能源数据网络模型和测试数据网络模型包括:利用本地能源数据和第一样本迁移权重,建立应用数据集;根据所述应用数据集,训练本地能源数据网络模型;利用测试数据和第二样本迁移权重,建立期望数据集;Alternatively, using the local energy data and the first sample migration weights, the test data and the second sample migration weights, respectively training the local energy data network model and the test data network model includes: using the local energy data and the first sample migration weights, establishing Applying the data set; according to the application data set, training the local energy data network model; using the test data and the second sample transfer weight to establish the expected data set;
    根据所述期望数据集,训练测试数据网络模型。According to the expected data set, the test data network model is trained.
  18. 一种烟气排放预测装置,其特征在于,包括:A flue gas emission prediction device, characterized in that it comprises:
    第一训练模块,用于根据本地能源数据,训练本地能源数据测模型;The first training module is used to train the local energy data measurement model according to the local energy data;
    第二训练模块,用于基于联合学习框架,根据测试数据和目标能源设备的能源数据训练本地能源数据测模型,分别得到测试数据预测模型和目标能源数据预测模型;The second training module is used to train the local energy data measurement model based on the joint learning framework according to the test data and the energy data of the target energy equipment, and respectively obtain the test data prediction model and the target energy data prediction model;
    计算模块,用于基于本地能源数据预测模型、测试数据预测模型和目标能源数据预测模型,计算第一样本迁移权重和第二样本迁移权重,其中,所述第一样本迁移权重为本地能源数据针对目标能源设备的能源数据的样本迁移权重,所述第二样本迁移权重为测试数据针对目标能源设备的能源数据的样本迁移权重;A calculation module, configured to calculate the first sample migration weight and the second sample migration weight based on the local energy data prediction model, the test data prediction model and the target energy data prediction model, wherein the first sample migration weight is the local energy The sample migration weight of the data for the energy data of the target energy device, the second sample migration weight is the sample migration weight of the test data for the energy data of the target energy device;
    第三训练模块,用于利用本地能源数据和第一样本迁移权重、测试数据和第二样本迁移权重,分别训练本地能源数据网络模型和测试数据网络模型;The third training module is used to train the local energy data network model and the test data network model respectively by using the local energy data and the first sample transfer weight, the test data and the second sample transfer weight;
    建立模块,用于接收来自中心节点对本地能源数据网络模型和测试数据网络模型聚合训练后的联合学习预测模型;Establishing a module for receiving a joint learning prediction model from the central node after the aggregation and training of the local energy data network model and the test data network model;
    预测模块,用于根据联合学习预测模型对目标能源设备进行烟气排放的预测。The prediction module is used to predict the flue gas emission of the target energy equipment according to the joint learning prediction model.
  19. 一种计算机设备,包括存储器、处理器以及存储在所述存储器中并且可以在所述处理器上运行的计算机程序,其特征在于,所述处理器执行所述计算机程序时实现如权利要求1所述方法的步骤。A computer device, comprising a memory, a processor, and a computer program stored in the memory and capable of running on the processor, characterized in that, when the processor executes the computer program, the computer program according to claim 1 is implemented. steps of the method described above.
  20. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求1所述方法的步骤。A computer-readable storage medium storing a computer program, wherein the computer program implements the steps of the method according to claim 1 when the computer program is executed by a processor.
PCT/CN2022/116583 2021-11-11 2022-09-01 Method and apparatus for predicting oxygen content in flue gas and load, method and apparatus for selecting prediction model, and method and apparatus for predicting flue gas emission WO2023082788A1 (en)

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
CN202111331379.0A CN114118543A (en) 2021-11-11 2021-11-11 Flue gas oxygen content load prediction method and device based on joint learning
CN202111331191.6A CN114118541A (en) 2021-11-11 2021-11-11 Flue gas emission prediction method and device based on joint learning
CN202111331191.6 2021-11-11
CN202111331186.5 2021-11-11
CN202111331379.0 2021-11-11
CN202111331355.5 2021-11-11
CN202111331186.5A CN114118540A (en) 2021-11-11 2021-11-11 Flue gas oxygen content load prediction method and device based on sample migration
CN202111331355.5A CN114118542A (en) 2021-11-11 2021-11-11 Method and device for selecting flue gas oxygen content load prediction model

Publications (1)

Publication Number Publication Date
WO2023082788A1 true WO2023082788A1 (en) 2023-05-19

Family

ID=86335057

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/116583 WO2023082788A1 (en) 2021-11-11 2022-09-01 Method and apparatus for predicting oxygen content in flue gas and load, method and apparatus for selecting prediction model, and method and apparatus for predicting flue gas emission

Country Status (1)

Country Link
WO (1) WO2023082788A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116523276A (en) * 2023-07-04 2023-08-01 天津市蓟州区民力新能源科技有限公司 High-efficiency energy utilization management platform based on intelligent control system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106164980A (en) * 2014-03-27 2016-11-23 佳能株式会社 Information processor and information processing method
CN109255498A (en) * 2018-10-22 2019-01-22 新智数字科技有限公司 Load forecasting method and device neural network based
CN112257579A (en) * 2020-10-21 2021-01-22 平安科技(深圳)有限公司 Model training method, action recognition method, device, equipment and storage medium
CN114118543A (en) * 2021-11-11 2022-03-01 新智我来网络科技有限公司 Flue gas oxygen content load prediction method and device based on joint learning

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106164980A (en) * 2014-03-27 2016-11-23 佳能株式会社 Information processor and information processing method
CN109255498A (en) * 2018-10-22 2019-01-22 新智数字科技有限公司 Load forecasting method and device neural network based
CN112257579A (en) * 2020-10-21 2021-01-22 平安科技(深圳)有限公司 Model training method, action recognition method, device, equipment and storage medium
CN114118543A (en) * 2021-11-11 2022-03-01 新智我来网络科技有限公司 Flue gas oxygen content load prediction method and device based on joint learning

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ZHANG YAN, WANG CUNSONG, LU NINGYUN, JIANG BIN: "Remaining Useful Life Prediction for Aero-Engine Based on the Similarity of Degradation Characteristics", SYSTEMS ENGINEERING AND ELECTRONICS, vol. 41, no. 6, 1 January 2019 (2019-01-01), pages 1414 - 1421, XP093066932, DOI: 10.3969/j.issn.1001-506X.2019.06.32 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116523276A (en) * 2023-07-04 2023-08-01 天津市蓟州区民力新能源科技有限公司 High-efficiency energy utilization management platform based on intelligent control system
CN116523276B (en) * 2023-07-04 2023-09-08 天津市蓟州区民力新能源科技有限公司 High-efficiency energy utilization management platform based on intelligent control system

Similar Documents

Publication Publication Date Title
Liu et al. Short‐term traffic speed forecasting based on attention convolutional neural network for arterials
WO2020077682A1 (en) Service quality evaluation model training method and device
Meng et al. Rating the crisis of online public opinion using a multi-level index system
WO2020077672A1 (en) Method and device for training service quality evaluation model
WO2021088499A1 (en) False invoice issuing identification method and system based on dynamic network representation
CN106022477A (en) Intelligent analysis decision system and method
CN110852881B (en) Risk account identification method and device, electronic equipment and medium
WO2023082788A1 (en) Method and apparatus for predicting oxygen content in flue gas and load, method and apparatus for selecting prediction model, and method and apparatus for predicting flue gas emission
WO2020259391A1 (en) Database script performance testing method and device
CN109919236A (en) A kind of BP neural network multi-tag classification method based on label correlation
CN106097094A (en) A kind of man-computer cooperation credit evaluation new model towards medium-sized and small enterprises
CN110289987B (en) Multi-agent system network anti-attack capability assessment method based on characterization learning
CN115099135A (en) Improved artificial neural network multi-type operation power consumption prediction method
CN115051929A (en) Network fault prediction method and device based on self-supervision target perception neural network
CN114584406A (en) Industrial big data privacy protection system and method for federated learning
CN104537383A (en) Massive organizational structure data classification method and system based on particle swarm
WO2023029944A1 (en) Federated learning method and device
CN116011633B (en) Regional gas consumption prediction method, regional gas consumption prediction system, regional gas consumption prediction equipment and Internet of things cloud platform
WO2022143431A1 (en) Method and apparatus for training anti-money laundering model
CN114118543A (en) Flue gas oxygen content load prediction method and device based on joint learning
CN113935819A (en) Method for extracting checking abnormal features
Xue et al. Tsc-gcn: A face clustering method based on gcn
Shao et al. Early Spatiotemporal Event Prediction via Adaptive Controller and Spatiotemporal Embedding
Li et al. A Study on House Price Prediction Based on Stacking-Sorted-Weighted-Ensemble Model
TWM568442U (en) Cash flow grouping system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22891593

Country of ref document: EP

Kind code of ref document: A1