CN113934158A - Electric arc furnace modeling method based on improved random forest - Google Patents

Electric arc furnace modeling method based on improved random forest Download PDF

Info

Publication number
CN113934158A
CN113934158A CN202111223042.8A CN202111223042A CN113934158A CN 113934158 A CN113934158 A CN 113934158A CN 202111223042 A CN202111223042 A CN 202111223042A CN 113934158 A CN113934158 A CN 113934158A
Authority
CN
China
Prior art keywords
arc furnace
electric arc
random forest
model
electric
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111223042.8A
Other languages
Chinese (zh)
Inventor
王青松
李思唯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Southeast University
Original Assignee
Southeast University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southeast University filed Critical Southeast University
Priority to CN202111223042.8A priority Critical patent/CN113934158A/en
Publication of CN113934158A publication Critical patent/CN113934158A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B17/00Systems involving the use of models or simulators of said systems
    • G05B17/02Systems involving the use of models or simulators of said systems electric

Abstract

The invention relates to the field of electrical engineering, and discloses an electric arc furnace modeling method based on an improved random forest. The invention learns the Stacking algorithm and combines through a secondary learner. Compared with a model established by a single decision tree, the electric arc furnace model based on the improved random forest has extremely high accuracy, due to the introduction of randomness, the model is not easy to be over-fitted and has good anti-noise capability, the model can process high-dimensional data and is not used as feature selection, the workload of data preprocessing can be greatly reduced, and the robustness is further improved due to the introduction of a Stacking method in the model. In conclusion, the electric arc furnace modeling method based on the improved random forest has excellent generalization capability, can reflect the influence of various factors in electric arc furnace production application on the energy consumption, and can provide a good reference for modeling other types of industrial loads.

Description

Electric arc furnace modeling method based on improved random forest
Technical Field
The invention relates to the field of electrical engineering, in particular to an electric arc furnace modeling method based on improved random forest.
Background
With the rapid development of national economy and science and technology, novel loads such as electric vehicles and energy storage are continuously emerging, and great challenges are provided for the supply and demand balance of a power system. Faced with new development problems, the traditional "source-follow-up" regulation mode has not been able to satisfy the new power utilization form which is variable and increasing. In order to solve the problem, the nation emphasizes the enhancement of demand-side management and the improvement of load regulation and control capability. Among a plurality of power loads, the industrial load has large capacity, is distributed intensively and has larger regulation potential. Because a quantitative analysis model is not available for different types of industrial loads, the existing batch load control method is simple and rough and is easy to cause serious influence on the production society. Meanwhile, the lack of planning means for industrial load to participate in power grid interaction is caused, and the regulation and control potential is difficult to fully exploit.
With the development of artificial intelligence, machine learning and deep learning are widely combined in various fields, especially the electrical field, and are combined with the traditional technical method in the electrical field, so that some problems which are difficult to solve in the past are properly treated. The invention provides an electric arc furnace modeling method based on an improved random forest to reflect the relation between electric quantity characteristics in electric arc furnace production application and non-electric quantity production factors Shu electric arc furnace energy consumption by combining an ensemble learning method in machine learning. The modeling method has good generalization capability and can provide a good reference for modeling other types of industrial loads.
Disclosure of Invention
In order to solve the above mentioned disadvantages in the background art, the present invention provides an arc furnace modeling method based on improved random forest
The purpose of the invention can be realized by the following technical scheme:
an electric arc furnace modeling method based on improved random forests, comprising the following steps:
step 1: researching electric and non-electric influence factors of the electric arc furnace and collecting data of the factors;
step 2: cleaning and dividing the collected original data;
and step 3: and (4) building an electric arc furnace model based on the improved random forest.
Further, the electric and non-electric influencing factors of the electric arc furnace in the step 1 comprise: voltage and current of electric arc furnace load, capacity of electric furnace transformer, oxygen blowing amount, energy recovery rate and raw material structure.
Further, the method for cleaning and dividing the collected original data comprises the following steps:
step 2.1: adopting dropna in pandas to carry out deletion value filtration;
step 2.2: for abnormal values in the data, the collected raw data is preprocessed by Local Outlier Factor (LOF)
Further, the LOF method is a high-precision outlier detection method based on density, and each data is assigned with an outlier factor LOF depending on the neighborhood density, and if LOF >1, the data point is an abnormal point, and if LOF >1, the data point is a normal data point.
Further, the electric arc furnace model building method based on the improved random forest comprises the following steps:
step 3.1: randomly dividing an initial training set D into k sets D with the same size1、D2、.....DkLet DjFor the jth test set, the test set,
Figure BDA0003313333010000021
is a corresponding training set;
step 3.2: by using
Figure BDA0003313333010000022
Training T primary learners;
step 3.3: testing with jth test set, for test set DjEach sample x in (1)iVector (z) composed of output results of T primary learnersi1,zi2,...,ziT) Forms part of the secondary learner input;
step 3.4: repeating the step 3.2 and the step 3.3 to generate a secondary training set;
step 3.5: training a secondary learner by utilizing a secondary training set;
step 3.6: and (5) testing the model.
Further, the method for testing the model comprises the following steps:
step 3.6.1: testing the trained model on a test set, wherein the test set covers 120 groups of data, and the content comprises voltage and current of an electric arc furnace load, capacity of an electric furnace transformer, oxygen blowing amount, energy recovery rate and a raw material structure;
step 3.6.2: taking the test set as the input of the model obtained by training to obtain the energy consumption estimated value of the electric arc furnace as an output structure;
step 3.6.3: and measuring the difference between the output of the model and an actual value by adopting the root mean square error, and comparing the difference with the test results of a decision tree, a random forest and an extreme random forest.
The invention has the beneficial effects that:
1. the decision tree is used as the primary learner, and data sample disturbance and input attribute disturbance are introduced in the training of the primary learner, so that the diversity of the primary learner is greatly improved, and the integration effect of the model is improved. And due to the introduction of randomness, the model is not easy to over-fit and has good anti-noise capability.
2. The invention adopts an integrated learning method, and compared with a model formed by a single decision tree, the proposed model has higher accuracy.
3. The combination module of the invention is a secondary learner, the learning algorithm is multi-response linear regression, and meanwhile, in order to prevent overfitting, a cross validation mode is adopted in the training stage of the primary learner, and the design enables the robustness of the model to be improved.
4. The electric arc furnace modeling method based on the improved random forest has excellent generalization capability and can provide a good reference for modeling other types of industrial loads.
Drawings
The invention will be further described with reference to the accompanying drawings.
FIG. 1 is a schematic diagram of the electric arc furnace model learning based on the improved random forest according to the present invention;
FIG. 2 is a flow chart of an electric arc furnace modeling method based on improved random forest according to the present invention;
FIG. 3 is an ensemble learning algorithm of the proposed model of the present invention;
FIG. 4 is a test result of the improved random forest model in the test set, where the solid line part is the actual value and the dotted line part is the model result;
FIG. 5 is a test result of a decision tree on a test set.
FIG. 6 is a test result of a random forest on a test set.
FIG. 7 is the test results of an extreme random forest on the test set.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As shown in fig. 1 to 3, an electric arc furnace modeling method based on improved random forest includes the following steps:
step 1: researching electric and non-electric influence factors of the electric arc furnace and collecting data of the factors;
the electric arc furnace is mainly applied to steel smelting production, and steel and alloy with various components are smelted by melting ores and metals at high temperature generated by electrode arc. The power consumption is huge and generally accounts for more than 50% of the total power consumption of an enterprise. In steel smelting, there are many factors affecting the energy consumption of the electric arc furnace, and the main electrical and non-electrical affecting factors include the voltage and current of the electric arc furnace load, the capacity of the electric furnace transformer, the oxygen blowing amount, the energy recovery rate and the raw material structure without considering human factors and production schemes. The electric arc furnace steel making uses scrap steel as main raw material and pig iron, direct reduced iron and molten iron as auxiliary materials, and the proportion of different raw materials has different influences on the power consumption. In which the ratio of molten iron to pig iron is inversely related to energy consumption, and an increase in the proportion of direct reduced iron leads to an increase in the energy consumption of the electric arc furnace.
Step 2: cleaning and dividing the collected original data;
because the initial data has many attributes and large sample size, including a considerable number of missing values and abnormal values, data cleaning is required before modeling. Aiming at the missing value, due to the large sample amount, dropna is directly adopted for missing value filtering, and abnormal points in the data set are removed by a local outlier detection method. The final training set D contained 1300 sets of data and the test set T contained 120 sets of data.
And step 3: and (4) building an electric arc furnace model based on the improved random forest.
The electric and non-electric influencing factors of the electric arc furnace in the step 1 comprise: voltage and current of electric arc furnace load, capacity of electric furnace transformer, oxygen blowing amount, energy recovery rate and raw material structure.
The method for cleaning and dividing the collected original data comprises the following steps:
step 2.1: adopting dropna in pandas to carry out deletion value filtration;
step 2.2: for abnormal values in the data, the collected raw data is preprocessed by Local Outlier Factor (LOF)
The LOF method is a high-precision outlier detection method based on density, each data is allocated with an outlier LOF depending on neighborhood density, if LOF >1, the data point is an abnormal point, and if LOF is close to 1, the data point is a normal data point.
According to the method, an electric arc furnace model is improved and constructed on the basis of a common random forest, and in order to enhance the diversity of the primary learner, when each node selects the division attribute, a random selection method is adopted, so that compared with the traditional random forest, the diversity of the obtained primary learner is stronger, and the final integration effect is better. In the combination strategy of the model, the invention learns the Stacking algorithm, the output of the primary learner forms a secondary training set, the secondary learner is used for combination, and the secondary learner adopts multi-response linear regression as the learning algorithm. Therefore, the model building flow is as follows:
step 3.1: randomly dividing an initial training set D into k sets D with the same size1、D2、.....DkLet DjFor the jth test set, the test set,
Figure BDA0003313333010000051
is a corresponding training set;
step 3.2: by using
Figure BDA0003313333010000061
Training T primary learners;
step 3.3: testing with jth test set, for test set DjEach sample x in (1)iVector (z) composed of output results of T primary learnersi1,zi2,...,ziT) Forms part of the secondary learner input;
step 3.4: repeating the step 3.2 and the step 3.3 to generate a secondary training set;
step 3.5: training a secondary learner by utilizing a secondary training set;
step 3.6: and (5) testing the model.
The model testing method comprises the following steps:
step 3.6.1: testing the trained model on a test set, wherein the test set covers 120 groups of data, and the content comprises voltage and current of an electric arc furnace load, capacity of an electric furnace transformer, oxygen blowing amount, energy recovery rate and a raw material structure;
step 3.6.2: taking the test set as the input of the model obtained by training to obtain the energy consumption estimated value of the electric arc furnace as an output structure;
step 3.6.3: and measuring the difference between the output of the model and an actual value by adopting the root mean square error, and comparing the difference with the test results of a decision tree, a random forest and an extreme random forest.
The test results are shown in FIGS. 4-7, in which the solid line part is the actual value and the dashed line part is the model result. The following table shows the root mean square error of the test results of each model, and the model provided by the invention has the highest accuracy and is superior to a single decision tree, a traditional random forest and an extreme random forest by comparison.
Figure BDA0003313333010000062
The foregoing shows and describes the general principles, essential features, and advantages of the invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed.

Claims (6)

1. An electric arc furnace modeling method based on improved random forests is characterized by comprising the following steps:
step 1: researching electric and non-electric influence factors of the electric arc furnace and collecting data of the factors;
step 2: cleaning and dividing the collected original data;
and step 3: and (4) building an electric arc furnace model based on the improved random forest.
2. The method as claimed in claim 1, wherein the electric arc furnace modeling method based on the improved random forest is characterized in that the electric and non-electric influence factors of the electric arc furnace in the step 1 comprise: voltage and current of electric arc furnace load, capacity of electric furnace transformer, oxygen blowing amount, energy recovery rate and raw material structure.
3. The electric arc furnace modeling method based on the improved random forest as claimed in claim 1, wherein the method for cleaning and dividing the collected raw data comprises the following steps:
step 2.1: adopting dropna in pandas to carry out deletion value filtration;
step 2.2: for abnormal values in the data, the collected raw data is preprocessed by Local Outlier Factor detection (LOF).
4. The method of claim 3, wherein the LOF method is a density-based high-precision outlier detection method, and each datum is assigned an outlier LOF that depends on the neighborhood density if LOF >1 and a normal datum if LOF is close to 1.
5. The electric arc furnace modeling method based on the improved random forest as claimed in claim 1, wherein the electric arc furnace model building step based on the improved random forest is as follows:
step 3.1: randomly dividing an initial training set D into k sets D with the same size1、D2、.....DkLet DjFor the jth test set, the test set,
Figure FDA0003313332000000011
is a corresponding training set;
step 3.2: by using
Figure FDA0003313332000000012
Training T primary learners;
step 3.3: testing with jth test set, for test set DjEach sample x in (1)iVector (z) composed of output results of T primary learnersi1,zi2,...,ziT) Forms part of the secondary learner input;
step 3.4: repeating the step 3.2 and the step 3.3 to generate a secondary training set;
step 3.5: training a secondary learner by utilizing a secondary training set;
step 3.6: and (5) testing the model.
6. The method for electric arc furnace modeling based on improved random forest as claimed in claim 5, wherein the method for model testing comprises the steps of:
step 3.6.1: testing the trained model on a test set, wherein the test set covers 120 groups of data, and the content comprises voltage and current of an electric arc furnace load, capacity of an electric furnace transformer, oxygen blowing amount, energy recovery rate and a raw material structure;
step 3.6.2: taking the test set as the input of the model obtained by training to obtain the energy consumption estimated value of the electric arc furnace as an output structure;
step 3.6.3: and measuring the difference between the output of the model and an actual value by adopting the root mean square error, and comparing the difference with the test results of a decision tree, a random forest and an extreme random forest.
CN202111223042.8A 2021-10-20 2021-10-20 Electric arc furnace modeling method based on improved random forest Pending CN113934158A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111223042.8A CN113934158A (en) 2021-10-20 2021-10-20 Electric arc furnace modeling method based on improved random forest

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111223042.8A CN113934158A (en) 2021-10-20 2021-10-20 Electric arc furnace modeling method based on improved random forest

Publications (1)

Publication Number Publication Date
CN113934158A true CN113934158A (en) 2022-01-14

Family

ID=79280882

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111223042.8A Pending CN113934158A (en) 2021-10-20 2021-10-20 Electric arc furnace modeling method based on improved random forest

Country Status (1)

Country Link
CN (1) CN113934158A (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106019093A (en) * 2016-05-15 2016-10-12 北华大学 Online soft measurement method for three-phase arc furnace arc length
CN109446189A (en) * 2018-10-31 2019-03-08 成都天衡智造科技有限公司 A kind of technological parameter outlier detection system and method
CN110503251A (en) * 2019-08-12 2019-11-26 江苏方天电力技术有限公司 A kind of non-festivals or holidays load forecasting method based on Stacking algorithm
CN111126423A (en) * 2018-11-01 2020-05-08 百度在线网络技术(北京)有限公司 Feature set acquisition method and device, computer equipment and medium
CN111157899A (en) * 2020-01-20 2020-05-15 南京邮电大学 Method for estimating SOC of battery based on model fusion idea
US20200279025A1 (en) * 2019-02-28 2020-09-03 Kalibrate Technologies Limited Machine-learned model selection network planning

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106019093A (en) * 2016-05-15 2016-10-12 北华大学 Online soft measurement method for three-phase arc furnace arc length
CN109446189A (en) * 2018-10-31 2019-03-08 成都天衡智造科技有限公司 A kind of technological parameter outlier detection system and method
CN111126423A (en) * 2018-11-01 2020-05-08 百度在线网络技术(北京)有限公司 Feature set acquisition method and device, computer equipment and medium
US20200279025A1 (en) * 2019-02-28 2020-09-03 Kalibrate Technologies Limited Machine-learned model selection network planning
CN110503251A (en) * 2019-08-12 2019-11-26 江苏方天电力技术有限公司 A kind of non-festivals or holidays load forecasting method based on Stacking algorithm
CN111157899A (en) * 2020-01-20 2020-05-15 南京邮电大学 Method for estimating SOC of battery based on model fusion idea

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
李正光 等: "基于多维特征量融合的配电网拓扑异常溯源与应用模型研究", 浙江电力, vol. 39, no. 7 *

Similar Documents

Publication Publication Date Title
Zhang et al. Multi-objective optimal reactive power dispatch of power systems by combining classification-based multi-objective evolutionary algorithm and integrated decision making
Arya et al. Fuzzy gain scheduling controllers for automatic generation control of two-area interconnected electrical power systems
CN110112767B (en) Load source optimization control method for peak regulation of wide-area polymorphic demand side load participation system
Wu et al. Source-network-storage joint planning considering energy storage systems and wind power integration
Zhang et al. MOEA/D with many-stage dynamical resource allocation strategy to solution of many-objective OPF problems
CN114065483A (en) Hydrogen production system efficiency optimization method and device, computer equipment and storage medium
Sahoo et al. Self-adaptive fuzzy-PID controller for AGC study in deregulated Power System
CN109934476B (en) Micro-grid source-storage joint planning multi-strategy evolution game analysis method based on subject limited rational decision
He et al. Biobjective optimization-based frequency regulation of power grids with high-participated renewable energy and energy storage systems
Cao et al. Variable universe fuzzy expert system for aluminum electrolysis
CN113934158A (en) Electric arc furnace modeling method based on improved random forest
Agrawal et al. Optimal location of static VAR compensator using evolutionary optimization techniques
CN108734349A (en) Distributed generation resource addressing constant volume optimization method based on improved adaptive GA-IAGA and system
CN108718084B (en) Power supply and power grid coordination planning method suitable for electric power market reformation
CN111049151A (en) NSGA 2-based two-stage optimization algorithm for power distribution network voltage
Ageev et al. Application of the genetic algorithm during electric network mode optimization
Qin et al. Coot algorithm for optimal carbon–energy combined flow of power grid with aluminum plants
CN116526496B (en) Novel auxiliary decision-making method for power system load control
CN116485042B (en) Method and device for optimizing park energy system operation based on load clustering
Zhu et al. Many-objective power flow optimization problems based on an improved MOEA/D with dynamical resource allocation strategy
Liu et al. Enterprise financial risk prevention and control and data analysis method based on blockchain technology
CN116127345B (en) Converter steelmaking process mode design method based on deep clustering generation countermeasure network
Borujeni et al. Fuel cell voltage control using neural network based on model predictive control
CN111666666A (en) Multi-objective optimized scheduling method for thermal load of biomass thermal power plant unit
Sun et al. A Bcs-Gde Multi-Objective Optimization Algorithm for Cchp Model with Decision Strategies

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination