CN115034437A - Hot rolled plate convexity prediction method based on improved XGboost - Google Patents

Hot rolled plate convexity prediction method based on improved XGboost Download PDF

Info

Publication number
CN115034437A
CN115034437A CN202210515319.2A CN202210515319A CN115034437A CN 115034437 A CN115034437 A CN 115034437A CN 202210515319 A CN202210515319 A CN 202210515319A CN 115034437 A CN115034437 A CN 115034437A
Authority
CN
China
Prior art keywords
convexity
strip steel
data
value
xgboost
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210515319.2A
Other languages
Chinese (zh)
Inventor
陈树宗
白佳丽
华长春
李旭
孙杰
姬亚锋
王鹏飞
张欣
张殿华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yanshan University
Original Assignee
Yanshan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yanshan University filed Critical Yanshan University
Priority to CN202210515319.2A priority Critical patent/CN115034437A/en
Publication of CN115034437A publication Critical patent/CN115034437A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B21MECHANICAL METAL-WORKING WITHOUT ESSENTIALLY REMOVING MATERIAL; PUNCHING METAL
    • B21BROLLING OF METAL
    • B21B37/00Control devices or methods specially adapted for metal-rolling mills or the work produced thereby
    • B21B37/28Control of flatness or profile during rolling of strip, sheets or plates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/004Artificial life, i.e. computing arrangements simulating life
    • G06N3/006Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/04Manufacturing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Economics (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • Health & Medical Sciences (AREA)
  • Marketing (AREA)
  • Tourism & Hospitality (AREA)
  • General Health & Medical Sciences (AREA)
  • General Business, Economics & Management (AREA)
  • Mathematical Physics (AREA)
  • Primary Health Care (AREA)
  • Computing Systems (AREA)
  • Molecular Biology (AREA)
  • Software Systems (AREA)
  • Evolutionary Computation (AREA)
  • Manufacturing & Machinery (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mechanical Engineering (AREA)
  • Development Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a hot rolled plate convexity prediction method based on improved XGboost, which comprises the following steps: step 1: collecting production data of strip steel; step 2: preprocessing the collected production data, including removing abnormal values of the convexity of the rolled strip steel of F6, and carrying out normalization and Kmeans clustering on all the strip steel production data; and step 3: establishing an XGboost model by taking the convexity of the rolled strip steel of F6 as an output value and the production data of other strip steels as input values; and 4, step 4: the XGboost model is subjected to super-parameter optimization by adopting a whale optimization algorithm to obtain the optimal parameters, and the final XGboost model is determined.

Description

Hot rolled plate convexity prediction method based on improved XGboost
Technical Field
The invention relates to a hot rolled plate convexity prediction method based on improved XGboost, and belongs to the technical field of automatic control of a rolling process.
Background
As one of the important indexes of the strip shape quality of hot rolled strip steel products, the risk of the subsequent processing process is caused by the unsatisfactory plate convexity precision, so that the final products are unqualified and the raw materials are wasted. The plate convexity, also called as the transverse thickness difference of the strip steel, refers to the thickness difference of the plate strip along the width direction, reflects the section shape after the strip steel is rolled, and the calculation formula can be expressed as follows:
Figure BDA0003639280140000011
wherein C' represents the strip crown, h c Represents the thickness h at the center point of the cross section of the strip width L The thickness h of a left reference point on the cross section of the strip width R The thickness of the reference point on the right side of the cross section of the strip width is shown.
For hot continuous rolling strip steel, factors influencing the strip steel outlet plate convexity comprise the grinding convexity of a roller, the abrasion convexity and thermal expansion convexity of the roller in the rolling process, the roll shifting amount, the roll bending force and the like, which are mutually coupled, and the traditional hot rolling plate convexity prediction is that the establishment of a mechanism model in the hot rolling process often needs to meet certain assumed conditions due to the complexity of the process flow, for example, the boundary conditions established by a differential equation in an analytic model and the determination of material parameters in the finite element model establishing process are simplified, so that the model has certain limitations, the improvement of the precision of a plate convexity prediction model is very unfavorable, and a method for predicting the plate convexity in the hot continuous rolling process by using industrial large data is urgently needed.
In recent years, an artificial neural network has been widely used in many fields as a machine learning method which is widely accepted and used, and among them, it is also used for predicting the plate crown to obtain a good prediction effect.
Disclosure of Invention
The invention provides a hot rolled plate convexity prediction method based on improved XGboost, wherein in the rolling process, according to actually measured rolling data of a production line, an XGboost algorithm with high model prediction accuracy and high prediction efficiency is adopted to construct and optimize a model, so that the problems of low efficiency and large deviation of the existing plate convexity prediction technology are solved, and important support is provided for implementation of a subsequent plate shape control strategy.
In order to solve the technical problems, the technical scheme adopted by the invention is as follows:
a hot rolled plate convexity prediction method based on improved XGboost comprises the following steps:
step 1: collecting production data of strip steel, including the thickness of an intermediate blank, the width of the rolled strip steel, the thickness of the rolled strip steel, the convexity of the rolled strip steel from F1 to F6, the thickness of a finish rolling inlet, the finish rolling temperature, the rolling force of F1 to F6 stands, the bending force of F1 to F6 stands and the roll shifting amount of F1 to F6 stands;
and 2, step: preprocessing the collected production data, including removing abnormal values of the convexity of the rolled strip steel of F6, and carrying out normalization and Kmeans clustering on all the strip steel production data;
and 3, step 3: establishing an XGboost model by taking the convexity of the rolled strip steel of F6 as an output value and the production data of other strip steels as input values;
and 4, step 4: and (4) carrying out super-parameter optimization on the XGboost model by adopting a whale optimization algorithm to obtain the optimal parameters, and determining the final XGboost model.
The technical scheme of the invention is further improved as follows: the specific method for preprocessing in the step 2 comprises the following steps:
step 2.1: calculating the standard deviation S of the convexity of the F6 rolled strip steel y And average value
Figure BDA0003639280140000021
Step 2.2: screening abnormal values of the rolled strip steel convexity data of F6 by adopting a Laplace criterion, and removing the abnormal values, wherein the Laplace criterion calculation formula is as follows:
Figure BDA0003639280140000022
wherein, y i The data of the convexity of the rolled strip steel of the ith F6,
Figure BDA0003639280140000031
is the average value of the convexity S of the rolled strip steel of F6 y The standard deviation of the convexity of the rolled strip steel is F6;
step 2.3: normalization processing is carried out on the rolled strip steel convexity and other strip steel production data of the F6 after the abnormal value is removed, and the calculation formula of the normalization processing is as follows:
Figure BDA0003639280140000032
wherein, x' i Is a normalized value, x i Is an initial value, x max 、x min Respectively, a maximum value and a minimum value in the data set;
step 2.4: performing Kmeans clustering on the data after the normalization processing, wherein the Kmeans clustering comprises the following steps:
step 2.4.1: selecting cluster center, μ 1 (0) ,μ 2 (0) ,μ 3 (0) ,···,μ k (0)
Step 2.4.2: defining a loss function:
Figure BDA0003639280140000033
wherein, c i Represents x' i In cluster, μ ci Representing the center of the cluster, and M represents the number of data after removing the abnormal value;
step 2.4.3: setting an iteration step number t;
step 2.4.4: for each set of data x' i It is assigned to the point closest to it:
c i t =argmin k ||x′ ik t ||;
wherein, c i t Denotes x 'in the t iteration process' i In cluster, μ k t Representing the cluster center of the t-th iteration process;
step 2.4.5: and updating the cluster center for each cluster of data:
Figure BDA0003639280140000034
where b represents the data amount of k clusters, μ k t+1 Representing the updated cluster center;
step 2.4.6: repeating steps 2.4.4 and 2.4.5 until J converges to a minimum value;
step 2.5: and (3) carrying out clustering processing on the data according to the following steps of 7: a ratio of 3 is divided into a training set and a test set.
The technical scheme of the invention is further improved as follows: average value in said step 2.1
Figure BDA0003639280140000043
The calculation formula of (2) is as follows:
Figure BDA0003639280140000041
where m is the number of production data, y i The value of the i-th rolled strip convexity is F6.
The technical scheme of the invention is further improved as follows: standard deviation S in said step 2.1 y The calculation formula of (c) is:
Figure BDA0003639280140000042
where m is the number of production data, y i The value of the i-th rolled strip convexity is F6.
The technical scheme of the invention is further improved as follows: after the XGboost model is established in the step 3, model prediction is carried out by adopting default parameter combination; the default parameters include the maximum depth max _ depth of the tree, the minimum loss function reduction value gamma of the leaf node creation branch, the feature sampling proportion colsample _ byte of the tree, the sum min _ child _ weight of the minimum weight of the leaf node, the learning rate eta, and the subsampling proportion subsample of the training sample.
The technical scheme of the invention is further improved as follows: the specific method of the whale optimization algorithm in the step 4 comprises the following steps:
step 4.1: setting related parameters of a whale optimization algorithm, setting a constant b for adjusting the logarithmic spiral shape to be 1, and setting l as a random number between intervals of < -1,1 >;
step 4.2: setting the number N of whales and the maximum iteration number t max Initializing position information, wherein the position of each whale individual represents a parameter combination of an XGboost model;
step 4.3: designing an adaptability function, adopting MSE as a performance index, substituting the performance index into the XGboost model, calculating the adaptability of each whale individual in the population, and reserving the current optimal whale individual;
step 4.4: calculating parameters a and p and coefficient vectors A and C, judging whether p is less than 0.5, if so, performing the step 4.5, otherwise, adopting the bubble net to prey, and updating the trapping position of the bubble net as follows:
X(t+1)=D′×e bl ×cos(2πl)+X * (t)
D′=|X * (t)-X(t)|
wherein X * (t) is the position of the current optimal solution, X (t) is the current search individual position, b is the spiral shape parameter;
step 4.5: judging whether the absolute value of the coefficient vector A is smaller than 1, if so, surrounding a prey, otherwise, carrying out global search, and updating the position as follows:
D=|C·X′(t)-X(t)|
X(t+1)=X′(t)-A·D
wherein A is 2a × r 1 -a,C=2×r 2 A is linearly reduced from 2 to 0, r in the iterative process 1 And r 2 Is [0,1]]X' (t) represents the position vector of the current optimal solution when surrounding a prey, and represents the position vector of a random individual when searching globally;
step 4.6: after the position updating is finished, calculating the fitness of each whale individual, and updating the optimal whale individual;
step 4.7: judging whether the maximum iteration times is reached, if so, ending to obtain an optimal solution, otherwise, returning to the step 4.3;
step 4.8: outputting an optimal parameter combination to obtain optimal parameters of the XGboost model;
step 4.9: and substituting the test sample into the determined model, and evaluating the accuracy of the model.
Due to the adoption of the technical scheme, the invention has the technical progress that:
the invention provides a hot rolled plate convexity prediction method based on improved XGboost, which is characterized in that a whale optimization algorithm is utilized to perform super-parameter optimization on an XGboost model, plate convexity prediction is realized, prediction accuracy is improved, and the defects that the traditional hot rolled plate convexity prediction depends on the principle that the assumed conditions of a model are easy to change and the prediction accuracy is low are overcome.
Meanwhile, the convexity standards of the plates produced on the current hot rolling site are numerous, and if the plates are put into the same model to be trained, a model with accurate data is difficult to obtain, so that the data is mined according to different target convexities, the data is processed by adopting a Kmeans method, each type of the data is modeled, the prediction precision of the model is improved, and the method is effectively applied to the hot rolling site.
In addition, the method provided by the invention adopts hot rolling production data, effectively processes the data, can be put into use through computer programming and has low cost.
Drawings
FIG. 1 is a schematic diagram of a hot rolling line serviced by the present invention;
FIG. 2 is a flow chart of a hot rolled plate convexity prediction method based on the improved XGboost of the invention;
fig. 3a and 3b are scatter-fit plots of cluster 0 and cluster 1 model predicted and measured values, respectively.
Detailed Description
The present invention will be described in further detail with reference to the following examples:
as shown in FIG. 1, the present invention is applied to a hot rolling line for predicting the crown of a sheet (i.e., the crown of a rolled strip of F6), and combines the prediction method with the hot rolling line. As shown in fig. 2, a flow chart of a hot-rolled plate convexity prediction method based on improved XGBoost includes the following specific steps:
step 1: the production data of the strip steel are collected, and the production data comprises coil number, production time, intermediate billet thickness, rolled strip width, rolled strip convexity of F1-F6, rolled strip convexity, finish rolling inlet temperature, finish rolling temperature, rolling force of F1-F6 stands, bending force of F1-F6 stands, roll shifting amount of F1-F6 stands and rolling speed of F1-F6 stands.
In this example, production data of a certain steel mill are collected, 2797 groups of data are counted, and part of sample data is shown in table 1. And selecting 34 characteristics with the largest influence on the crown of the plate, wherein the characteristics comprise the thickness of an intermediate blank, the width of the rolled strip steel, the thickness of the rolled strip steel, the crown of the rolled strip steel of F1-F5, the inlet temperature of finish rolling, the finish rolling temperature, the rolling force of F1-F6 stands, the rolling speed of F1-F6 stands, the bending force of F1-F6 stands and the roll shifting amount of F1-F6 stands. And taking the convexity of the rolled strip steel of F6 as an output value.
TABLE 1 partial sample data of a certain rolling mill
Figure BDA0003639280140000071
Step 2: preprocessing the acquired data, including removing abnormal values of the convexity of the rolled strip steel of F6, and normalizing and Kmeans clustering all the strip steel production data, and the specific steps are as follows:
step 2.1: and calculating the standard deviation and the average value of the convexity of the rolled strip steel of F6.
In this embodiment, the formula for calculating the mean and standard deviation is as follows:
Figure BDA0003639280140000072
Figure BDA0003639280140000073
wherein m is the number of production data, y i Is the ith production data.
Step 2.2: the method comprises the following steps of collecting rolled strip convexity data of F6 at a hot rolling site, screening abnormal values (abnormal values meeting the Lauder criterion) of the rolled strip convexity data of F6 by adopting the Lauder criterion, and removing the abnormal values, wherein the Lauder criterion calculation formula is as follows:
Figure BDA0003639280140000081
wherein x is i For the (i) -th production data,
Figure BDA0003639280140000085
average value of production data, S y Standard deviation of production data.
In this example, a total of 12 data points are culled.
Step 2.3: normalization processing is carried out on the rolled strip steel convexity and other strip steel production data of the F6 after the abnormal value is removed, and the calculation formula of the normalization processing is as follows:
Figure BDA0003639280140000082
wherein, x' i Is a normalized value, x i Is an initial value, x max 、x min Respectively, a maximum value and a minimum value in the data set;
step 2.4: performing Kmeans clustering processing on the data after the normalization processing, wherein the Kmeans clustering step comprises the following steps:
step 2.4.1: determining the number of clusters by combining an elbow method and a contour coefficient method, wherein the number of the clusters is 2 in the embodiment;
step 2.4.2: selecting cluster center, μ 1 (0) ,μ 2 (0)
Step 2.4.3: defining a loss function:
Figure BDA0003639280140000083
wherein, c i Represents x' i The cluster is located in the position of the cluster,
Figure BDA0003639280140000084
representing the cluster center;
step 2.4.4: setting the maximum iteration step number T equal to 20;
step 2.4.5: for each set of data x' i It is assigned to the point closest to it:
c i t =argmin k ||x′ ik t ||;
wherein, c i t Denotes x 'in the t iteration process' i In cluster, μ k t Representing the cluster center of the t-th iteration process; step 2.4.6: and updating the cluster center for each cluster of data:
Figure BDA0003639280140000091
where b represents the data amount of k clusters, μ k t+1 Representing the updated cluster center;
step 2.4.7: repeating steps 2.4.5 and 2.4.6 until J converges to a minimum value;
in this embodiment, after the Kmeans clustering process, the original data set is divided into two data sets of 0 class and 1 class, which respectively include 1527 groups and 1258 groups of data.
Step 2.5: and (3) respectively carrying out 7: a ratio of 3 is divided into a training set and a test set.
And step 3: based on the preprocessed data, an XGboost model is established, and model prediction is carried out by adopting default parameter combination, wherein the default parameters comprise the maximum depth max _ depth of the tree, the minimum loss function descending value gamma of a leaf node creation branch, the feature sampling proportion colsample _ byte of the tree, the sum min _ child _ weight of the minimum weight of the leaf node, the learning rate eta and the sub-sampling proportion subsample of a training sample.
And 4, step 4: optimizing the XGboost model by adopting a whale optimization algorithm, and determining the hyper-parameters of the model, wherein the specific method comprises the following steps:
step 4.1: setting related parameters of a whale optimization algorithm, setting a constant b of a logarithmic spiral shape to be 1, and setting l as a random number between intervals of < -1 > and 1;
step 4.2: generating a population with the whale number N of 30, wherein the position of each whale individual represents a parameter combination of an XGboost model, and the maximum iteration time t max Initializing position information to 10, and determining the maximum depth max _ depth of the parameter value range tree to be [5, 50%]The leaf node creates a branch minimum loss function with a reduced value gamma of 0.001,100]The tree's characteristic sample ratio colsample _ byte is [0.5,0.9 ]]The sum of the minimum weights of the leaf nodes min _ child _ weight is [5, 50%]And the learning rate eta is [0.01,0.2 ]]The subsample ratio of the training sample is [0.5,0.9 ]]。
Step 4.3: designing an adaptability function, substituting a prediction Mean Square Error (MSE) of the convexity of the hot rolled plate as a performance index into an XGboost model, calculating the adaptability of each individual whale in the population, and reserving the current optimal individual whale, wherein the adaptability function calculation formula is as follows:
Figure BDA0003639280140000101
step 4.4: judging whether p is less than 0.5, wherein p is the probability of a predation mechanism and is a random number of a value range [0,1], if the p is less than 0.5, performing the step 4.5, otherwise, predating by adopting a bubble net, and updating the predation position of the bubble net as follows:
X(t+1)=D′×e bl ×cos(2πl)+X * (t)
D′=|X * (t)-X(t)|
wherein X * (t) is the current best solution position, X (t) is the current search individual position, b is the spiral shape parameter, l is [ -1,1]A random number in between;
step 4.5: and gradually reducing the parameter A and the convergence factor a along with the increase of the iteration times t, and if the absolute value of A is less than 1, gradually surrounding the current optimal solution by each whale individual and entering a local optimization stage. Judging whether the absolute value of the coefficient vector A is smaller than 1, if so, surrounding the prey, otherwise, searching the individual to be close to the random individual for global search, and updating the position as follows:
D=|C·X′(t)-X(t)|
X(t+1)=X′(t)-A·D
wherein A is 2a × r 1 -a,C=2×r 2 A is linearly reduced from 2 to 0, r in the iterative process 1 And r 2 Is [0,1]]X' (t) represents the position vector of the current optimal solution when surrounding a prey, and represents the position vector of a random individual when searching globally;
step 4.6: after the position updating is finished, calculating the fitness of each whale individual, and updating the optimal whale individual;
step 4.7: and judging whether the maximum iteration times is reached, if so, ending to obtain the optimal solution, otherwise, returning to the step 4.3.
Step 4.8: and outputting an optimal parameter combination, wherein the maximum depth max _ depth of an optimal parameter tree of the XGboost model is 29, the drop value gamma of a leaf node creation branch minimum loss function is 20.0, the feature sampling proportion colsample _ byte of the tree is 0.9, the sum min _ child _ weight of the leaf node minimum weight is 17.64, the learning rate eta is 0.2, and the training sample sub-sampling proportion subsample is 0.9.
Step 4.9: and substituting the parameter values into the XGboost model for training, traversing the whale population scale and the maximum iteration times within [10,50], and finally determining the whale population scale N to be 25 and the maximum iteration times to be 10.
According to the requirements of the current industrial production, the actual convexity value is regarded as qualified within the range of the set value +/-10 mu m, and more than 98.45 percent of products meet the standard through a scattered point fitting graph of the model predicted value and the measured value shown in figure 3, so that the method can be effectively applied to the current industrial production.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, and not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those skilled in the art; the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; such modifications and substitutions do not depart from the spirit and scope of the corresponding technical solutions as defined in the appended claims.

Claims (6)

1. A hot rolled plate convexity prediction method based on improved XGboost is characterized by comprising the following steps: the method comprises the following steps:
step 1: collecting production data of strip steel, including the thickness of an intermediate blank, the width of the rolled strip steel, the thickness of the rolled strip steel, the convexity of the rolled strip steel from F1 to F6, the thickness of a finish rolling inlet, the finish rolling temperature, the rolling force of F1 to F6 stands, the bending force of F1 to F6 stands and the roll shifting amount of F1 to F6 stands;
and 2, step: preprocessing the collected production data, including removing abnormal values of the convexity of the rolled strip steel of F6, and carrying out normalization and Kmeans clustering on all the strip steel production data;
and step 3: establishing an XGboost model by taking the convexity of the rolled strip steel of F6 as an output value and the production data of other strip steels as input values;
and 4, step 4: and (4) carrying out super-parameter optimization on the XGboost model by adopting a whale optimization algorithm to obtain the optimal parameters, and determining the final XGboost model.
2. The hot-rolled plate convexity prediction method based on the improved XGboost as claimed in claim 1, wherein the method comprises the following steps: the specific method for preprocessing in the step 2 comprises the following steps:
step 2.1: calculating the standard deviation S of the convexity of the rolled strip steel of F6 y And average value
Figure FDA0003639280130000011
Step 2.2: screening abnormal values of the rolled strip steel convexity data of F6 by adopting a Laplace criterion, and removing the abnormal values, wherein the Laplace criterion calculation formula is as follows:
Figure FDA0003639280130000012
wherein, y i The data of the convexity of the rolled strip steel of the ith F6,
Figure FDA0003639280130000013
is the average convexity of the rolled strip steel of F6S y The standard deviation of the convexity of the rolled strip steel is F6;
step 2.3: normalization processing is carried out on the rolled strip steel convexity and other strip steel production data of the F6 after the abnormal value is removed, and the calculation formula of the normalization processing is as follows:
Figure FDA0003639280130000014
wherein x is i ' is a normalized value, x i Is an initial value, x max 、x min Respectively, the maximum value in the data set anda minimum value;
step 2.4: performing Kmeans clustering processing on the data after the normalization processing, wherein the Kmeans clustering step comprises the following steps:
step 2.4.1: selecting cluster center, μ 1 (0) ,μ 2 (0) ,μ 3 (0) ,···,μ k (0)
Step 2.4.2: defining a loss function:
Figure FDA0003639280130000021
wherein, c i Represents x' i The cluster is located in the position of the cluster,
Figure FDA0003639280130000023
representing the center of the cluster, and M represents the number of data after removing the abnormal value;
step 2.4.3: setting iteration step number t;
step 2.4.4: for each set of data x' i It is assigned to the point closest to it:
c i t =arg min k ||x′ ik t ||;
wherein, c i t Denotes x 'in the process of the t iteration' i In cluster, μ k t Representing the cluster center of the t-th iteration process;
step 2.4.5: and updating the cluster center for each cluster of data:
Figure FDA0003639280130000022
where b represents the data amount of k clusters, μ k t+1 Representing the updated cluster center;
step 2.4.6: repeating steps 2.4.4 and 2.4.5 until J converges to a minimum value;
step 2.5: and (4) clustering the processed data according to the following steps of 7: a ratio of 3 is divided into a training set and a test set.
3. The hot-rolled plate convexity prediction method based on the improved XGboost as claimed in claim 2, characterized in that: the calculation formula of the average value y in the step 2.1 is as follows:
Figure FDA0003639280130000031
where m is the number of production data, y i The rolled strip crown value is the ith F6 rolled strip crown value.
4. The hot-rolled plate convexity prediction method based on the improved XGboost as claimed in claim 2, characterized in that: standard deviation S in said step 2.1 y The calculation formula of (2) is as follows:
Figure FDA0003639280130000032
where m is the number of production data, y i The value of the i-th rolled strip convexity is F6.
5. The hot-rolled plate convexity prediction method based on the improved XGboost as claimed in claim 1, wherein the method comprises the following steps: after the XGboost model is established in the step 3, model prediction is carried out by adopting default parameter combination; the default parameters include the maximum depth max _ depth of the tree, the minimum loss function reduction value gamma of the leaf node creation branch, the feature sampling proportion colsample _ byte of the tree, the sum min _ child _ weight of the minimum weight of the leaf node, the learning rate eta, and the subsampling proportion subsample of the training sample.
6. The hot-rolled plate convexity prediction method based on the improved XGboost as claimed in claim 1, wherein the method comprises the following steps: the specific method of the whale optimization algorithm in the step 4 comprises the following steps:
step 4.1: setting related parameters of a whale optimization algorithm, setting a constant b for adjusting the logarithmic spiral shape to be 1, and setting l as a random number between intervals of < -1,1 >;
step 4.2: setting the whale number N and the maximum iteration number t max Initializing position information, wherein the position of each whale individual represents a parameter combination of an XGboost model;
step 4.3: designing an adaptability function, adopting MSE as a performance index, substituting the performance index into the XGboost model, calculating the adaptability of each individual whale in the population, and reserving the current optimal individual whale;
step 4.4: calculating parameters a and p and coefficient vectors A and C, judging whether p is less than 0.5, if so, performing the step 4.5, otherwise, adopting the bubble net to prey, and updating the trapping position of the bubble net as follows:
X(t+1)=D′×e bl ×cos(2πl)+X * (t)
D′=|X * (t)-X(t)|
wherein X * (t) is the position of the current optimal solution, X (t) is the current search individual position, b is the spiral shape parameter;
step 4.5: and judging whether the absolute value of the coefficient vector A is less than 1, if so, enclosing a prey, otherwise, carrying out global search, and updating the position as follows:
D=|C·X′(t)-X(t)|
X(t+1)=X′(t)-A·D
wherein A is 2a × r 1 -a,C=2×r 2 A is linearly reduced from 2 to 0, r in the iterative process 1 And r 2 Is [0,1]]X' (t) represents the position vector of the current optimal solution when surrounding a prey, and represents the position vector of a random individual when searching globally;
step 4.6: after the position updating is finished, calculating the fitness of each whale individual, and updating the optimal whale individual;
step 4.7: judging whether the maximum iteration times is reached, if so, ending to obtain an optimal solution, otherwise, returning to the step 4.3;
step 4.8: outputting an optimal parameter combination to obtain optimal parameters of the XGboost model;
step 4.9: and substituting the test sample into the determined model, and evaluating the accuracy of the model.
CN202210515319.2A 2022-05-11 2022-05-11 Hot rolled plate convexity prediction method based on improved XGboost Pending CN115034437A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210515319.2A CN115034437A (en) 2022-05-11 2022-05-11 Hot rolled plate convexity prediction method based on improved XGboost

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210515319.2A CN115034437A (en) 2022-05-11 2022-05-11 Hot rolled plate convexity prediction method based on improved XGboost

Publications (1)

Publication Number Publication Date
CN115034437A true CN115034437A (en) 2022-09-09

Family

ID=83120582

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210515319.2A Pending CN115034437A (en) 2022-05-11 2022-05-11 Hot rolled plate convexity prediction method based on improved XGboost

Country Status (1)

Country Link
CN (1) CN115034437A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117753795A (en) * 2024-02-07 2024-03-26 东北大学 feedforward control method for hot rolled products with multiple steel types and specifications
CN117840232A (en) * 2024-03-05 2024-04-09 东北大学 Hot rolling process width prediction method based on incremental learning

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117753795A (en) * 2024-02-07 2024-03-26 东北大学 feedforward control method for hot rolled products with multiple steel types and specifications
CN117753795B (en) * 2024-02-07 2024-05-31 东北大学 Feedforward control method for hot rolled products with multiple steel types and specifications
CN117840232A (en) * 2024-03-05 2024-04-09 东北大学 Hot rolling process width prediction method based on incremental learning
CN117840232B (en) * 2024-03-05 2024-05-31 东北大学 Hot rolling process width prediction method based on incremental learning

Similar Documents

Publication Publication Date Title
CN115034437A (en) Hot rolled plate convexity prediction method based on improved XGboost
CN107967542B (en) Long-short term memory network-based electricity sales amount prediction method
CN111241657B (en) Cold-rolled strip steel plate shape prediction method based on deep learning
CN109902861B (en) Order production progress real-time prediction method based on double-layer transfer learning
CN112100745B (en) Automobile girder steel mechanical property prediction method based on LDA theory
CN114897227B (en) Multi-steel mechanical property forecasting method based on improved random forest algorithm
CN110929347A (en) Hot continuous rolling strip steel convexity prediction method based on gradient lifting tree model
CN103745273A (en) Semiconductor fabrication process multi-performance prediction method
CN108817103B (en) Steel rolling model steel family layer classification optimization method
CN109558677A (en) A kind of hot rolling strip crown prediction technique based on data-driven
CN109492748B (en) Method for establishing medium-and-long-term load prediction model of power system based on convolutional neural network
CN108805257A (en) A kind of neural network quantization method based on parameter norm
CN113128124B (en) Multi-grade C-Mn steel mechanical property prediction method based on improved neural network
CN111250548B (en) Board convexity prediction method based on kernel partial least square combined support vector machine
CN114548592A (en) Non-stationary time series data prediction method based on CEMD and LSTM
CN115470595A (en) Hot-rolled strip steel convexity prediction method with data and mechanism fused
CN111882114B (en) Short-time traffic flow prediction model construction method and prediction method
CN117840232B (en) Hot rolling process width prediction method based on incremental learning
CN113102516A (en) Hot continuous rolling strip steel head width prediction method integrating rolling mechanism and deep learning
CN115169453A (en) Hot continuous rolling width prediction method based on density clustering and depth residual error network
CN112308298A (en) Multi-scenario performance index prediction method and system for semiconductor production line
CN110910528B (en) Method and device for predicting tensile strength of paper sheet
CN116826745A (en) Layered and partitioned short-term load prediction method and system in power system background
CN116565850A (en) Wind power ultra-short-term prediction method based on QR-BLSTM
CN112966870B (en) Cold rolled silicon steel quality defect prediction method based on neural network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination