Disclosure of Invention
The invention aims to provide a crystallizer bleed-out forecasting method for generating a countermeasure network based on a characteristic vector and SWGAN-GP, which can timely and accurately detect and forecast the bonded bleed-out and provides a reliable means for monitoring the abnormity of the continuous casting process.
In order to achieve the purpose, the technical scheme of the invention is as follows:
a crystallizer breakout prediction method for generating a countermeasure network based on a feature vector and SWGAN-GP is characterized in that visual feature vectors are extracted from an abnormal region of a crystallizer copper plate temperature rate, and the countermeasure network is generated by the SWGAN-GP to classify the feature vectors, so that the crystallizer breakout is detected and predicted, and the method specifically comprises the following steps:
first, extracting the characteristics of the abnormal area of the temperature rate of the copper plate of the crystallizer
(1) 3 rows of 19 rows of thermocouples are arranged on the wide-surface copper plates of the inner arc and the outer arc of the crystallizer, and 3 rows of 1 row of thermocouples are arranged on the narrow-surface copper plates at the left side and the right side. And detecting the temperature of the thermocouple of the crystallizer copper plate on line, and calculating the temperature value of the crystallizer copper plate at the position of the non-thermocouple measuring point by an interpolation algorithm.
(2) And calculating the temperature change rate of each point of the copper plate through an interframe difference algorithm, and mapping the temperature rate of the copper plate to a two-dimensional plane by using computer graphics and OpenGL technology to obtain a two-dimensional temperature rate thermal image corresponding to the temperature of the copper plate.
(3) After counting and summarizing temperature rate data of a plurality of breakout samples, setting a temperature rate threshold value as TzUsing threshold segmentationThe method comprises the steps of removing a normal temperature fluctuation area with the temperature rate smaller than a threshold value from a two-dimensional temperature rate thermal image, and performing connectivity search on temperature rate abnormal points by using a run-length recursive algorithm to obtain a temperature rate abnormal area.
(4) Extracting the height H, width W, area S and transverse moving speed V of the abnormal temperature speed regionyLongitudinal moving speed VxAnd (6) visualizing the features.
Second step, abnormal region feature vector construction and processing
(1) Combining the abnormal region features extracted in the first step into a feature vector XBSimultaneously combining the normal working condition region features into a feature vector XN:
XB=[HB,WB,SB,VBx,VBy]
XN=[HN,WN,SN,VNx,VNy]
(2) Pulling speed V under combined abnormal and normal working conditionscAnd Vc', for feature vector XBAnd XNPerforming continuous processing to construct continuous feature vector ZBAnd ZN:
(3) For continuous feature vector ZBAnd ZNAnd (3) carrying out normalization treatment:
in the formula, Zmin、ZmaxRespectively representing the minimum and maximum values of the continuous-type eigenvector Z, FiNormalized pair of ith dimension feature representing continuous feature vector ZThe corresponding values.
(4) Respectively obtaining m abnormal area bleed-out feature vector samples F according to the feature vector construction and processing modes of the steps (1), (2) and (3)BAnd n normal working condition area non-breakout eigenvector samples FNAnd establishing a sample set Q:
Q={(FB1,1),(FB2,1),…,(FBm,1),(FN1,0),(FN2,0),…,(FNn,0)}
in the formula, m and n respectively represent the number of the breakout sample and the non-breakout sample. 1 and 0 represent class labels for the breakout and non-breakout specimens, respectively, as (1, 0, 0) and (0, 1, 0) in the form of one-hot codes.
Thirdly, constructing an SWGAN-GP generation confrontation network model
(1) Constructing a generative model G comprising 1 noise input layer and 2 full-connection layer neural networks, wherein the concrete structure of the generative model G sequentially comprises the following steps: noise input layer → first fully connected layer → second fully connected layer. The input noise generates pseudo samples with the same dimension as the original feature vector after G.
(2) Constructing a discrimination model D containing 1 sample input layer and 3 full-connection layer neural networks, wherein the specific structure of the discrimination model D is as follows in sequence: sample input layer → first fully connected layer → parallel branch second fully connected layer → parallel branch third fully connected layer. And finally D, judging the authenticity of the input sample and classifying the sample.
(3) And combining the generation model G with the discrimination model D to construct a SWGAN-GP generation confrontation network model.
Fourthly, training SWGAN-GP to generate a confrontation network model
(1) And (3) training the discrimination model D constructed in the step (2) in the third step. And freezing the generated model when the discriminant model is trained, namely setting the parameters of the generated model to be not updatable.
1.1) randomly obtaining a breakout sample F from a sample set QBAnd non-breakout specimen FNEach train _ samples has a single sample denoted x, and x belongs to the real sample set PrI.e. x to Pr(ii) a Simultaneous acquisition of Gaussian distributed noiseSet of samples z, i.e. z-Pz。
1.2) inputting the noise sample z into the generation model G constructed in the step (1) in the third step to generate a pseudo sample
The label of (a) is represented in a one-hot coded form as (0, 0, 1):
in the formula, a pseudo sample
Belonging to a set of dummy samples P
GI.e. by
1.3) obtaining P
rSample x and P in (1)
GSample of (1)
And interpolation is carried out to obtain a new sample
In which ε follows a uniform distribution over [0,1 ].
1.4) calculating the Total loss L of the discriminant model DD:
In the formula, m is the batch sample number batch _ size, i is 1, and 2 … m is the sample number index.
D (x) are respectively
x corresponds to the output value of the discriminator D. λ represents a gradient penalty coefficient.
Is a gradient penalty term. C denotes the number of sample classes, j is 1, and 2 … C is the class index. y is
jFor the true label corresponding to the jth category, f
j(x) In order for the arbiter to predict the value for that sample,
representing the loss of cross-entropy part of the multivariate classification.
(2) Training the generative model G constructed in the step (1) in the third step. And (4) freezing the discrimination model when the generation model is trained, namely setting the parameters of the discrimination model to be not updatable.
2.1) obtaining a set of noise samples z' -P obeying Gaussian distributionz′。
2.2) inputting the noise sample z' into the generation model G constructed in the step (1) in the third step to generate a pseudo sample
Then will be
Calculating the model loss L of the discriminant model D
G:
(3) Training discriminant model n in each round of trainingcriticThen, generating a model ngenRepeating the training to judge model D and generate model G, and observing LGAnd LDAs a function of the number of training rounds, up to LGAnd LDThe loss curve gradually flattens and fluctuates steadily. At this time, the generative model G and the discriminant model D can be determinedNash balance is achieved, and the training is finished.
Fifthly, detecting and forecasting bleed-out on line based on SWGAN-GP model
(1) Extracting typical visual characteristics of the crystallizer copper plate temperature rate abnormal area in real time, preprocessing the typical visual characteristics, and constructing to obtain an abnormal area characteristic vector Ffv;
(2) Abnormal region feature vector FfvInputting the data into a discrimination model D of the SWGAN-GP generated countermeasure network to obtain a predicted value y of the model:
y=D(Ffv)
(3) and forecasting the crystallizer bleed-out according to the output result y of the discrimination model.
Expressing y as (y) in a form of one-hot coding
1,y
2,y
3): if y
1=max(y
1,y
2,y
3) If the data label corresponding to y is (1, 0, 0), the alarm is given and the casting machine pulling speed is rapidly reduced; if y
2=max(y
1,y
2,y
3) And if the detected steel leakage is normal, continuing to detect and forecast the steel leakage at the next moment corresponding to the data label with the value of y being (0, 1, 0). Feature vector x of abnormal region
fvAll derived from real samples detected on-line rather than pseudo-samples generated by generative model G
So that y cannot occur
3The case of the maximum value.
The method for forecasting the breakout is suitable for forecasting the breakout of continuous casting billets such as plate blanks, square blanks, round blanks, special blanks and the like.
The invention has the beneficial effects that: the method constructs a characteristic vector containing static and dynamic characteristics of a bonding area through a visualized thermograph of the temperature rate of the crystallizer copper plate, and classifies the characteristic vector through a discrimination model of a countermeasure network generated by SWGAN-GP so as to realize detection and forecast of the breakout of the crystallizer. The method is used for detecting and forecasting the crystallizer bleed-out in real time based on the SWGAN-GP model, so that the false alarm rate can be obviously reduced and the forecasting accuracy rate can be effectively improved on the premise of ensuring that the bleed-out is completely reported.
Detailed Description
The invention will be further elucidated by means of specific embodiments, in conjunction with the drawing
Fig. 1 shows a flow chart of a method for predicting breakout of a crystallizer. Firstly, extracting visual characteristics of a crystallizer copper plate temperature rate abnormal area and preprocessing the visual characteristics to construct and obtain a five-dimensional characteristic vector; secondly, constructing and training a SWGAN-GP model; and finally, classifying the feature vectors and forecasting the breakout through a discrimination model of SWGAN-GP.
First step, visualization of temperature rate of crystallizer copper plate and extraction of abnormal area characteristics
(1) Fig. 2 shows the distribution diagram of the copper plate of the mold and its thermocouple. The crystallizer is formed by combining four copper plates, the total height is 900mm, and the effective height during casting is 800 mm. 3 rows of 19 rows of thermocouples 1 are arranged on the inner and outer arc wide- surface copper plates 5 and 2 of the crystallizer, 3 rows of 1 row of thermocouples are arranged on the left and right narrow-surface copper plates 3 and 4, and the total number of the thermocouples is 120. The distance between the upper openings of the first row of thermocouple data crystallizers is 210mm, the distance between the first row of thermocouple data crystallizers and the second row of thermocouple data crystallizers is 115mm, the distance between the second row of thermocouple data crystallizers and the third row of thermocouple data crystallizers is 120mm, and the distance between two adjacent rows of thermocouples is 150 mm. And (3) detecting the temperature of all the thermocouples (1) of the crystallizer copper plate on line, and calculating the temperature value of the crystallizer copper plate at the position of the non-thermocouple measuring point by an interpolation algorithm.
(2) And calculating the temperature change rate of each point of the copper plate through an interframe difference algorithm, and mapping the temperature rate of the copper plate to a two-dimensional plane by using computer graphics and OpenGL technology to obtain a two-dimensional temperature rate thermal image corresponding to the temperature of the copper plate.
(3) Setting a temperature rate threshold value to be 0.3 ℃/s, removing a normal temperature fluctuation area with the temperature rate smaller than the threshold value from the two-dimensional temperature rate thermal image by using a threshold segmentation algorithm, and performing connectivity search on temperature rate abnormal points by using a run recursion algorithm to obtain a temperature rate abnormal area.
(4) Fig. 3 is a graph showing the visualization characteristic of the abnormal temperature rate region. The simulation interval from the first row to the third row along the casting direction is distributed to 100 pixel points; the horizontal simulation interval is from the first row to the nineteenth row of thermocouples distributed to 300 pixel points. T is1~T4Representing 4 moments in time corresponding to the bond region from initial formation to the appearance of a distinct "V" shaped feature, each moment being 3s apart. Extraction of T4Height H of time anomaly regionB20.45cm wide WB56.11cm, shaded area SB=607.08cm2And calculating to obtain the transverse movement velocity V according to the change of the barycentric coordinates of the abnormal region along with the timeByLongitudinal moving speed V of-0.06 m/minBx=0.25m/min。
Fig. 4 is a normal condition area visualization characteristic diagram. The visual characteristics of the region, height H, can be obtained by the same methodN5.64cm, width WN21.5cm, shaded area SN=92.36cm2Transverse moving velocity VNy0m/min, longitudinal moving speed VNx=-0.39m/min。
Second step, abnormal region feature vector construction and processing
(1) Combining the abnormal and normal working condition region characteristics extracted in the first step into a characteristic vector XBAnd XN:
XB=[HB,WB,SB,VBx,VBy]=[20.45,56.11,607.08,0.25,-0.06]
XN=[HN,WN,SN,VNx,VNy]=[5.64,21.5,92.36,-0.39,0]
(2) Pulling speed V under combined abnormal and normal working conditionsc0.9m/min and Vc' 0.65m/min, for feature vector XBAnd XNPerforming continuous processing to construct continuous feature vector ZBAnd ZN:
(3) For continuous feature vector ZBAnd ZNAnd (3) carrying out normalization treatment:
in the formula, Zmin、ZmaxRespectively representing the minimum and maximum values of the continuous-type eigenvector Z, FiAnd representing the corresponding numerical value after the ith dimension feature normalization of the continuous feature vector Z. Normalized feature vector FB=[0.84,0.71,0.86,0.73,0.13],FN=[0.19,0.18,0.02,0,0.25]。
Respectively obtaining 50 bleed-out eigenvector samples F according to the above eigenvector construction and processing modeBAnd 50 non-breakout eigenvector samples FNAnd establishing a sample set Q:
Q={(FB1,1),(FB2,1),…,(FB50,1),(FN1,0),(FN2,0),…,(FN50,0)}
in the formula, 1 and 0 represent class labels of the breakout sample and the non-breakout sample, respectively, and are expressed as (1, 0, 0) and (0, 1, 0) in the form of one-hot codes.
Thirdly, constructing an SWGAN-GP generation confrontation network model
(1) As shown in fig. 5, a generative model including a noise input layer and a fully connected layer neural network is constructed for generating a pseudo sample, denoted as generative model G.
(2) As shown in fig. 6, a discrimination model including a sample input layer and a full connection layer neural network is constructed, and is used for discriminating an authentic sample and classifying the sample, and the discrimination model is recorded as a discrimination model D.
(3) And combining the generation model G with the discrimination model D to construct a SWGAN-GP generation confrontation network model.
Fourthly, training SWGAN-GP to generate a confrontation network model
(1) And (5) training a discrimination model. And freezing the generated model when the discriminant model is trained, namely setting the parameters of the generated model to be not updatable.
1.1) randomly obtaining a breakout sample F from a sample set QBAnd non-breakout specimen FNEach 30, a single sample is denoted x, and x belongs to the set of true samples PrI.e. x to Pr(ii) a Simultaneously obtaining a set of noise samples z, i.e. z-P, which obey a Gaussian distributionz。
1.2) input of noise samples z into the generative model G to generate pseudo samples
The label of (a) is represented in a one-hot coded form as (0, 0, 1):
in the formula, a pseudo sample
Belonging to a set of dummy samples P
GI.e. by
1.3) obtaining P
rSample x and P in (1)
GSample of (1)
And interpolation is carried out to obtain a new sample set
In which ε follows a uniform distribution over [0,1 ].
1.4) calculating the Total loss L of the discriminant model DD:
In the formula, m is the
batch sample number 32, i is 1, and 2 … m is the sample number index.
D (x) are respectively
x corresponds to the output value of the discriminator D. The gradient penalty factor lambda is 10,
is a gradient penalty term. The number of sample types C is 3, and represents the breakout sample F
BNon-bleed-out sample F
NAnd a dummy sample
j is 1,2, 3 is a category index. y is
jFor the true label corresponding to the jth category, f
j(x) In order for the arbiter to predict the value for that sample,
representing the loss of cross-entropy part of the multivariate classification.
(2) And training the generated model. And (4) freezing the discrimination model when the generation model is trained, namely setting the parameters of the discrimination model to be not updatable.
2.1) obtaining a set of noise samples z' -P obeying Gaussian distributionz′。
2.2) inputting the noise sample z' into the generative model G to generate a pseudo sample
Then will be
Calculating the model loss L of the discriminant model D
G:
Training the discriminant model 5 times and generating the model 1 time in each round of training, repeating the training of discriminant model D and generating model G, and observing LGAnd LDWhen the number of training rounds reaches 20000 times as the number of training rounds changes, L is observedGAnd LDThe loss curve gradually flattens and fluctuates steadily. At this time, it can be judged that nash balance is achieved between the generated model G and the discriminant model D, and the training is completed.
Fifthly, detecting and forecasting bleed-out on line based on SWGAN-GP model
(1) Extracting typical visual characteristics of the crystallizer copper plate temperature rate abnormal area in real time, as shown in fig. 7, preprocessing the typical visual characteristics, and constructing to obtain an abnormal area characteristic vector:
Ffv1=[0.78,0.30,0.46,0.84,0.73];Ffv2=[0.06,0.23,0.08,0.63,0.56]
(2) will be abnormalRegion feature vector Ffv1And Ffv2Inputting the predicted value y of the model into a discrimination model D of the SWGAN-GP generated countermeasure network1And y2:
y1=D(Ffv1)=D([0.78,0.30,0.46,0.84,0.73])=(0.81,0.19,0)
y2=D(Ffv2)=D([0.06,0.23,0.08,0.63,0.56])=(0.12,0.88,0)
(3) And forecasting the crystallizer bleed-out according to the output result y of the discrimination model. y is
1If the data label corresponding to y is (1, 0, 0) is breakout, (0.81,0.19,0) gives an alarm and rapidly reduces the casting machine pulling speed; y is
2And if the value is equal to (0.12,0.88 and 0), the data label corresponding to (0, 1 and 0) is a normal working condition, and the steel leakage detection and prediction at the next moment are continued. Feature vector x of abnormal region
fvAll derived from real samples detected on-line rather than pseudo-samples generated by generative model G
So that y cannot occur
3The case of the maximum value.
The above-mentioned embodiments only express the embodiments of the present invention, but not should be understood as the limitation of the scope of the invention patent, it should be noted that, for those skilled in the art, many variations and modifications can be made without departing from the concept of the present invention, and these all fall into the protection scope of the present invention.