CN112308298B

CN112308298B - Multi-scenario performance index prediction method and system for semiconductor production line

Info

Publication number: CN112308298B
Application number: CN202011108406.3A
Authority: CN
Inventors: 乔非; 高陈媛; 刘鹃
Original assignee: Tongji University
Current assignee: Tongji University
Priority date: 2020-10-16
Filing date: 2020-10-16
Publication date: 2022-06-14
Anticipated expiration: 2040-10-16
Also published as: CN112308298A

Abstract

The invention relates to a multi-scenario performance index prediction method and a multi-scenario performance index prediction system for a semiconductor production line, wherein the method comprises the following steps: the production scene quantitative division module is driven by data, quantitatively maps the product value, the average product processing period and the utilization rate of each device of the production line, and divides the production line into three scenes of light load, normal load and heavy load; the main prediction network construction module is used for combining a deep neural network algorithm and semiconductor production line performance prediction by taking the divided normal load data as sample data to construct a prediction network under a normal load scene; and the multi-scene prediction model building module is used for applying the idea of transfer learning to production line prediction and building networks under light load and heavy load scenes according to the built main prediction network so as to form the multi-scene prediction model. Compared with the prior art, the method quantitatively divides the production line scene, can more accurately predict the performance indexes of a plurality of production lines under different load scenes, and can be used on line.

Description

Multi-scenario performance index prediction method and system for semiconductor production line

Technical Field

The invention relates to the technical field of intelligent manufacturing, in particular to a multi-scenario performance index prediction method and system for a semiconductor production line.

Background

The manufacturing industry is the main body of the national economy, is an engine for the high-speed growth of the economy and is the important embodiment of the comprehensive national force. In order to ensure competitiveness, manufacturing enterprises need to perform dynamic scheduling management quickly and reasonably allocate resources. The real-time performance index prediction result can provide a basis for production decision and evaluation, so that the performance index of the production line which is achieved under a certain scheduling mode is required to be known quickly.

At present, three methods, namely mathematical modeling, simulation modeling and machine learning modeling, are mainly used as prediction model modeling methods. Firstly, the semiconductor manufacturing is a complex process with high nonlinearity, multivariable coupling and random uncertainty, and the establishment of a mathematical model is extremely difficult; secondly, the traditional simulation modeling method needs to invest a large amount of capital, is long in prediction time, and is poor in capability of adapting to a dynamic environment. In order to overcome the limitations of mathematical models and traditional simulation prediction models, expert scholars introduce machine learning techniques to study data-driven production line prediction methods.

Chinese patent "Performance prediction method for semiconductor production line dynamic scheduling" (patent publication No. CN 103310285A) invented a semiconductor production line scheduling system based on extreme learning machine, the method collects semiconductor production line historical data to establish sample set and test sample set; constructing a prediction model by adopting an extreme learning machine method; and testing the network performance of the prediction model by using the test sample, and outputting the prediction result after normalizing. Chinese patent "a production line spare part damage rate prediction system based on LR" (patent publication No. CN 108898254A) invented a production line spare part damage rate prediction system, and this method collects the operation record of the equipment through the sensor, the current per hour, the average voltage value, the length of operation and other data; manually acquiring the time for installation of the spare part for replacement; obtaining a model through LR training; and predicting the current loss rate of the equipment in the corresponding category through the model.

It can be seen that most of the existing methods only aim at a single production scene, a semiconductor production line has a plurality of uncertain dynamic production factors such as feeding change, production line load change and the like, the existing identification and prediction methods are difficult to adapt to a dynamic production environment, most production line performance predictions focus on single performance indexes such as productivity and the like, and meanwhile, the prediction of multiple performance indexes for predicting equipment utilization rate and productivity is less. Therefore, a timely performance index prediction method capable of adapting to various production line states is needed, and the method comprises three components of scene division, construction of a main prediction network and construction of each scene network to form a new model, so that the accuracy and adaptability of production line performance prediction are improved, and more powerful support is provided for scheduling decisions. There are no documents and patents related to the above-mentioned production line performance prediction method.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provide a multi-scenario performance index prediction method and system for a semiconductor production line.

The purpose of the invention can be realized by the following technical scheme:

a multi-scenario performance index prediction method for a semiconductor production line comprises the following steps:

step 1: carrying out quantitative mapping on historical data of a production line, and carrying out data division according to scene quantification respectively corresponding to light load scenes, normal load scenes and heavy load scenes;

step 2: taking the divided data under the normal load scene as sample data, and combining a deep neural network algorithm with the performance prediction of a semiconductor production line to construct a prediction network under the normal load scene;

and step 3: based on the prediction network under the normal load scene, the prediction networks under the light load scene and the heavy load scene are further constructed, and the prediction networks under the light load scene, the normal load scene and the heavy load scene form a multi-scene prediction model;

and 4, step 4: and after the online data of the production line are divided into different scene results according to the threshold value, selecting a corresponding network from the multi-scene prediction model for prediction to obtain a performance index prediction result.

Further, the production line historical data includes input quantity and output quantity, wherein the input quantity includes: sampling days, average processing period of products, average daily moving steps, total number of products in process, queue length of each buffer area, total queue length of the buffer areas and total moving steps; the output quantity comprises: average processing period of products and utilization rate of each device.

Further, the process of constructing the prediction network under the normal load scenario in step 2 includes the following sub-steps:

step 201: coding the scheduling rule symbols in the sample data, taking the coding as input to carry out normalization processing, and limiting the number to [0, 1%]Within the interval, the normalization formula is [ x-min (x)_i)]/[max(x_i)-min(x_i)]Wherein x refers to an input variable, and i is a sample variable;

step 202: adopting a deep neural network to construct a prediction network under a normal load scene; combining a deep learning algorithm with the performance prediction of a production line, and obtaining the number of proper hidden layers, the number of neurons of each hidden layer and excitation functions of each layer by adopting a grid search method;

step 203: testing the network performance of the prediction network by adopting a test sample, comparing an output value corresponding to a prediction result obtained by the test sample after reverse normalization processing with an output value of the test sample, and judging whether the accuracy requirement is met;

step 204: if the prediction accuracy of the test result can meet the accuracy requirement, the prediction network under the normal load scene is successfully established, if the prediction accuracy of the test result cannot meet the accuracy requirement, the step 202 is returned, the number of hidden layers, the number of neurons of each hidden layer and the excitation function of each layer are reselected, and the model is trained again.

Further, the scheduling rule symbol in step 201 is encoded in a one-hot encoding manner, so that it can be received by the network.

Further, the inputs of the deep learning algorithm in step 202 are:

for a given training set of k different samples:

X^k＝{S^k,D^k,Ru^k,P^k|S^k∈R^∝,D^k∈R^β,Ru^k∈R^λ,P^k∈R^m}

wherein the content of the first and second substances,

the state is the state of an intelligent workshop system;

feeding information for an intelligent workshop;

scheduling rules for the intelligent workshop;

for the current state S of the intelligent workshop system^kCurrent feeding information D^kCurrent scheduling rule Ru^kPerformance index after 1 day under the circumstances;

the output of the deep learning algorithm in step 202 is o_iWhere i is 1, …, m1, m1 indicates the dimension of the output value o, i.e. there are m1 outputs.

Further, the accuracy requirement in step 204 is based on the root mean square error, which is described by the formula:

where m is the number of output variables, P is the input performance index, Y is the predicted performance index,

is the output variable sample number.

Further, the process of constructing the prediction network under the light load and heavy load scenarios in step 3 includes the following sub-steps:

step 301: assigning the number of neurons of an input layer of the prediction network under a normal load scene to the prediction network under light load and heavy load scenes;

step 302: assigning the number of neurons of each hidden layer except the last hidden layer of the prediction network under the normal load scene to the prediction network under the light load scene and the heavy load scene as the number of neurons of the corresponding front n-1 layer;

step 303: assigning the weight threshold information of each hidden layer except the last hidden layer of the prediction network under the normal load scene to the prediction network under the light load scene and the heavy load scene as initialization data;

step 304: assigning excitation functions of all hidden layers except the last hidden layer of the prediction network under a normal load scene to the prediction network under light load and heavy load scenes;

step 305: and (3) adjusting the number of neurons of the last layer of hidden layer by a network search method, inputting the data under the light load and heavy load scene divided in the step (1) into a prediction network under the light load and heavy load scene for training, and obtaining the prediction network under the light load and heavy load scene after the training is finished.

Further, the multi-scenario prediction model building module comprises:

constructing a light-load scene prediction network, namely changing the number of neurons of the last layer of hidden layer on the basis of a main prediction network by taking light-load scene data as a sample, and constructing the light-load scene prediction network;

constructing a heavy-load scene prediction network, namely changing the number of neurons of a last layer of hidden layer on the basis of a main prediction network by taking heavy-load scene data as a sample, and constructing the heavy-load scene prediction network;

and (4) network combination, namely combining the three prediction networks. And forming a multi-scene prediction model.

Further, the step 1 comprises the following sub-steps:

step 101: collecting historical data of a production line for quantitative mapping under the same sampling days, and drawing a line graph;

step 102: and carrying out data division according to curve change rules in each line graph and scene quantification corresponding to the light load scene, the normal load scene and the heavy load scene respectively.

Further, in the light load scenario in step 102, in the corresponding curve, the starting point is a point corresponding to the total in-process value when the equipment utilization rate starts to enter the stationary stage, and the end point is a point corresponding to the total in-process value when the slope of the average processing cycle curve of the product is maximum;

under a normal load scene, in the corresponding curve, the starting point is the point corresponding to the product value when the slope of the average processing period curve of the product is maximum, and the end point is the point corresponding to the product value when the slope of the average processing period curve of the product is minimum;

under a heavy load scene, the starting point in the corresponding curve is the point corresponding to the total product value when the slope of the average processing cycle curve of the product is minimum.

The invention also provides a system adopting the semiconductor production line-oriented multi-scenario performance index prediction method, which comprises the following steps:

the main prediction network construction module is used for combining a deep neural network algorithm and semiconductor production line performance prediction by taking the divided data under the normal load scene as sample data to construct a prediction network under the normal load scene;

the multi-scene prediction model construction module is used for further constructing prediction networks under light load and heavy load scenes based on the prediction networks under normal load scenes, and forming the prediction networks under the light load scenes, the normal load scenes and the heavy load scenes into a multi-scene prediction model;

and the production scene quantitative dividing module is used for carrying out quantitative mapping on the production line historical data and carrying out data division according to the scene quantification respectively corresponding to the light load scene, the normal load scene and the heavy load scene.

Compared with the prior art, the invention has the following advantages:

1. the invention divides the production environment into three scenes of light load, normal load and heavy load by quantitatively dividing the production scene, so that the prediction method is suitable for the dynamic production environment;

2. the invention combines the deep learning algorithm with the semiconductor production line prediction, and establishes the neural network prediction model by respectively taking the divided light load, normal load and heavy load data as sample data, thereby being capable of responding to the real-time change in the system in time and reducing the need of rescheduling. The system is regarded as a black box, and the provided modeling method excavates useful knowledge in historical data and off-line data which can be obtained by using a production line, so that real-time on-line optimization control is realized;

3. the invention applies the transfer learning idea to the training process of the multi-scenario network, and constructs the network under the light load and heavy load scenarios according to the constructed main prediction network, thereby forming a multi-scenario prediction model, reducing the network training time and improving the accuracy of the production linear performance prediction;

4. the method can predict the performance indexes of a plurality of production lines one day later and provide more data support for production decision.

Drawings

FIG. 1 is a schematic structural view of the present invention;

FIG. 2 is a diagram of a MiniFAB model structure according to an embodiment of the present invention;

fig. 3 is a schematic diagram of the average processing cycles of EDD, SRPT, and EDD products respectively according to the scheduling rules of the sampling Ma to Me devices on the 26 th day in the embodiment of the present invention;

fig. 4 is a diagram illustrating quantities of work in process of EDD, SRPT, EDD, respectively, of the device scheduling rules from Ma to Me sampled on day 26 in the embodiment of the present invention;

fig. 5 is a schematic diagram illustrating utilization rates of EDD, SRPT, EDD, and EDD of the scheduling rules of the equipment from Ma to Me sampled on the 26 th day in the embodiment of the present invention;

FIG. 6 is a schematic diagram illustrating a comparison of root mean square errors of scene models, non-divided scene models, and multi-scene multi-performance index prediction models in terms of productivity according to an embodiment of the present invention;

FIG. 7 is a schematic diagram illustrating a comparison of root mean square errors of scene models, non-divided scene models, and multi-scene multi-performance index prediction models in the Ma device utilization rate according to an embodiment of the present invention;

FIG. 8 is a schematic diagram illustrating a comparison of root mean square errors of scene models, non-divided scene models, and multi-scene multi-performance index prediction models in the Mb device utilization according to an embodiment of the present invention;

FIG. 9 is a schematic diagram illustrating a comparison of root mean square errors of each scene model, an undivided scene model and a multi-scene multi-performance index prediction model in the embodiment of the present invention in the utilization rate of the Mc device;

FIG. 10 is a schematic diagram illustrating a comparison of root mean square errors of scene models, non-divided scene models, and multi-scene multi-performance index prediction models in the Md device utilization according to an embodiment of the present invention;

fig. 11 is a schematic diagram of comparing root mean square errors of each scene model, an undivided scene model and a multi-scene multi-performance index prediction model in the Me device utilization rate in the embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, shall fall within the scope of protection of the present invention.

DETAILED DESCRIPTION OF EMBODIMENT (S) OF INVENTION

As shown in FIG. 1, the invention provides a multi-scenario multi-performance index prediction method for a semiconductor production line, which comprises a production scenario quantitative division module, a main prediction network construction module and a multi-scenario prediction model construction module. The production scene quantitative dividing module is driven by data, quantitatively maps the product value of the production line, the average processing period of products and the utilization rate of each device, and divides the production line into three scenes of light load, normal load and heavy load; the main prediction network construction module is used for combining a deep neural network algorithm and semiconductor production line performance prediction by taking the divided normal load data as sample data to construct a prediction network under a normal load scene; the multi-scene prediction model building module comprises: and applying the idea of transfer learning to production line prediction, and constructing a network under light load and heavy load scenes according to the constructed main prediction network so as to form a multi-scene prediction model.

This example is described by taking the MiniFab model as an example. The MiniFab is an experimental simulation model of a semiconductor production process proposed by doctor Karl Kempf, a chief scientist of the manufacturing system of Intel corporation, and consists of 5 devices and 6 procedures, and the model structure is shown in fig. 2.

The method for predicting the multi-scene multi-performance index for the semiconductor production line applies a MiniFab model as an implementation object, and comprises the following working processes:

step 1, defining sample information X^k＝{S^k,D^k,Ru^k,P^k|S^k∈R^∝,D^k∈R^β,Ru^k∈R^λ,P^k∈R^mTherein of

Describing the state of the intelligent workshop system;

describing feeding information of the intelligent workshop;

describing an intelligent workshop scheduling rule;

described in the current state S^kCurrent feeding mode D^kCurrent scheduling rule Ru^kPerformance index after the next 1 day. Selecting workshop productivity PROD and equipment Ma utilization rate U_aMb utilization U of equipment_bUtilization rate U of Mc of equipment_cMd utilization rate U of equipment_dMe utilization rate U of equipment_eThe six performance indexes are used as prediction targets, and the production line system state S in the sample information^kAnd feeding information D^kScheduling rules Ru^kThe definitions are given in tables 1, 2 and 3. Selecting 10% as a test sample set;

TABLE 1 System State S in MiniFab sample information^kDefinition of

Name of field	Description of the invention	Data type
			Day	Number of days of sampling	Shaping machine
MCT	Average processing cycle of product	Floating point type
			MDayMOV	Average daily moving step number	Shaping machine
WIP	Number of products being processed	Shaping machine
			Mab_Queue	MaMb buffer Captain	Shaping machine
Mcd_Queue	McMd buffer Captain	Shaping machine
			Me_Queue	Me buffer Captain	Shaping machine
Total_Queue	Total queue length of buffer	Shaping machine
			Total_MOV	Total number of moving steps	Shaping machine

TABLE 2 feeding information D in MiniFab sample information^kDefinition of

Name of field	Description of the invention	Data type
			x_Release	Daily average charge of product x	Shaping machine
y_Release	Daily average charge of product y	Shaping machine
			z_Release	Daily average charge of product z	Shaping machine

TABLE 3 scheduling rules Ru in MiniFab sample information^kDefinition of

Name of field	Description of the invention	Data type
			Ma_Rule	Scheduling rules for device Ma	Character type
Mb_Rule	Scheduling rules for device Mb	Character type
			Mc_Rule	Scheduling rules for device Mc	Character type
Md_Rule	Scheduling rules for device Md	Character type
			Me_Rule	Scheduling rules of device Me	Character type

In this embodiment, the number of sampling days is 1 to 30, the daily average feeding of the product a, the product b, and the product c is increased from 1 to 19, the Ma and Mb scheduling rules select EDD or SRPT, the Mc and Md scheduling rules select EDD or SRPT or CR, the Me scheduling rules select EDD or SRPT, and the utilization rate and the productivity performance index of each device after 1 day are obtained.

Step 2, averaging the product processing period MCT, the product number WIP and the equipment Ma utilization rate U_aMb utilization U of equipment_bUtilization rate U of Mc of equipment_cMd utilization rate U of equipment_dMe utilization rate U of equipment_eThe average total material feeding amount of the product day is mapped through the sampling days, and each material feeding amount in the graph is observedAnd quantitatively dividing light load scenes, normal load scenes and heavy load scenes of the production line according to the corresponding relation of curve change rules.

Scene 1: and (5) carrying light scenes. Under the scene, the utilization rate U of the equipment Ma_aMb utilization U of equipment_bUtilization rate U of Mc of equipment_cMd utilization rate U of equipment_dMe utilization rate U of equipment_eAnd the light load scene is quantitatively divided according to the WIP curve corresponding to the starting point and the end point of the scene.

Scene 2: and (5) normal scenes. Under the scene, the utilization rate of each device is stable, the MCT fluctuates, the starting point of the scene is the maximum slope point of the MCT curve, the end point of the scene is the minimum slope point of the MCT curve, and the normal scene is divided quantitatively according to the WIP curve corresponding to the starting point and the end point of the scene.

Scene 3: and (5) overloading the scene. Under the scene, the utilization rate of each device is stable, the MCT curve is always raised without descending, the WIP curve is raised, the starting point of the scene is the minimum point of the slope of the MCT curve, and the heavy load scene is divided quantitatively according to the WIP curve corresponding to the starting point and the end point of the scene.

And quantitatively obtaining WIP values of the three scene partitions according to the initial abscissa corresponding to the three scenes, and laying a foundation for the subsequent online data to partition the scenes according to the specific WIP values.

1a) Drawing a mapping chart: under the same sampling days, selecting the average processing period MCT, the WIP and the equipment Ma utilization rate U of the products under a plurality of scheduling rules_aMb utilization U of equipment_bUtilization rate U of Mc of equipment_cMd utilization rate U of equipment_dMe utilization rate U of equipment_eDrawing a line graph. Here, the EDD, SRPT, EDD, and EDD are drawn by taking the sampling days of 26, Ma, Mb, Mc, Md, and Me device scheduling rules as examples, respectively, fig. 3, 4, and 5;

1b) and (3) quantitative scene division: when the sample id is 6, the equipment utilization rate of fig. 5 tends to be stable, and the WIP value of fig. 4 is approximately 60; when the sample id is 8, the slope of the MCT waving phase of fig. 3 is maximum, when the WIP of fig. 4 waves up and down at 200; when the sample id is 16, MCT only rises and does not fall, and WIP fluctuates at 750. The scene division definition index is WIP, and therefore, it is determined as a light-load scene when WIP is [60,200 ], as a normal scene when WIP is [200,750), and as a heavy-load scene when WIP is [750, ∞).

And 3, coding the scheduling rule symbol to enable the scheduling rule symbol to be received by the network, carrying out normalization processing on the original data through min-max, removing the dimension of the original data, and changing the original data into a decimal between [0,1 ]. The coding table is shown in table 4.

TABLE 4 scheduling rules coding Table

Scheduling rules	Encoding
		EDD	[0,1]
SRPT	[1,0]
		CR	[1,1]

And 4, adopting a DNN algorithm to input the data divided into the normal load scene, taking the feeding information D, MaMb scheduling rule, the McMd scheduling rule, the Me scheduling rule, the number of products in process in the system state S, the MaMb buffer area queue length, the McMd buffer area queue length, the Me buffer area queue length, the total buffer area queue length and the total moving step number as input, taking P representing the performance index as output, and obtaining the matching relation between the feeding information, the scheduling rule and the system state and the performance index, namely a main prediction model.

The specific process of learning and obtaining the main prediction model by adopting the deep neural network algorithm comprises the following steps:

2a) weighting matrix W of network^ηV and threshold matrix theta^qGamma initialization, with the value of [ -1,1 [ ]]A random number in between. A training number counter h is set.

2b) Selecting the kth pair of data samples, and calculating the output result of each hidden layer:

2c) determining an output result of an output layer:

2d) calculating correction error of each neuron of output layer and hidden layer

2e) Calculating layer errors of the eta layer hidden layer

2f) According to the selected gradient descent Optimizer as Adadelta Optimizer

2g) Setting a loss function:

is the output variable sample number.

2h) Setting the learning rate to be 0.05, and setting the network learning times to be 50000;

2i) and calculating an output total error according to a loss function defined in the network, jumping out of a training program if the loss function value is smaller than a set numerical value, and adjusting the weight and the threshold value of each layer to continue training if the loss function value is not satisfied.

2j) Checking whether a training frequency counter h reaches a network set value, if not, judging that the training frequency counter h + 1; and if so, jumping out of the program, and resetting the number of the hidden layers, the number of the neurons and the excitation function.

And finally, determining the number of the network layers of the network A to be 6 layers by a grid search method, wherein the network structure is 10 x 9 x 8 x 6, the excitation function of the hidden layers 1-4 is tanh, and the excitation function of the output layer is sigmoid.

And 5, constructing a light-load scene network and a heavy-load scene network by a transfer learning method according to the network information of the main prediction network, wherein the steps are as follows:

step K1: assigning the number of neurons in the input layer of the main prediction network to a new light load/heavy load network;

step K2: assigning the number of neurons of each hidden layer of the main prediction network except the last hidden layer to a new light load/heavy load network as the number of neurons of the first n-1 layers;

step K3: assigning the weight threshold information of each hidden layer of the main prediction network except the last hidden layer to a new light load/heavy load network as initialization data;

step K4: assigning the excitation function of each hidden layer of the main prediction network except the last hidden layer to a new light load/heavy load network;

step K5: and adjusting the number of neurons in the last layer of hidden layer by a grid search method, and inputting light load/heavy load scene data for training.

In the embodiment, the main prediction network A has 4 hidden layers, the first 3 hidden layers of the network A are transferred to the first 3 hidden layers of the network B and the network C, the network B and the network C are obtained through training by adjusting the number of neurons of the last hidden layer, the number of the network layers of the network B is finally determined to be 6, the network structure is 10 × 9 × 6, the excitation functions of the 1-4 hidden layers are tanh, and the excitation function of the output layer is sigmoid; the number of the network layers of the network C is 6, the network structure is 10 x 9 x 7 x 6, the excitation functions of the 1-4 layers of the hidden layers are tan h, and the excitation function of the output layer is sigmoid. The three networks are combined into a whole to form a multi-scene multi-index prediction model, and the model structure is shown in table 5.

TABLE 5 Multi-scenario Multi-index prediction model Structure

Before the online data enter the model, scene recognition is carried out according to a threshold value, and if the scene is recognized as a normal scene, a main prediction network in the model is selected for prediction; if the light-load scene is identified, selecting a light-load scene prediction network in the model for prediction; and if the model is identified as the heavy-load scene, selecting a heavy-load scene prediction network in the model for prediction.

And respectively applying the non-scene-division model, the single-scene model and the multi-scene multi-performance index prediction model to the MiniFAB model, and comparing the prediction results of the non-scene-division network, the single-scene model and the multi-scene multi-performance index prediction model.

The table for comparing the outputs of the non-scene model and the multi-scene multi-index prediction model with the average value of the test sample is shown in table 6.

TABLE 6 comparison table of average values of test samples and outputs of non-scene model and multi-scene multi-index prediction model

In the case of an undifferentiated scene model, namely, a whole network model is obtained by mixed training of data, the highest relative error reaches 3.4%, and the lowest error is 1.0%, in the case of a multi-scene multi-index prediction model, the highest relative error reaches 1.5%, and the lowest relative error is 0.15%, and from the perspective of the relative error of the output average value, the multi-scene multi-index prediction model is superior to the undifferentiated scene model.

The mean root mean square error output values of the single scene model, the non-divided scene model and the multi-scene multi-index prediction model are shown in table 7.

TABLE 7 RMS error mean value table for each model

On the whole, the multi-scene multi-index prediction model is higher in precision and better in quality.

The root mean square error of the productivity of the single scene model, the non-divided scene model and the multi-scene multi-index prediction model and the root mean square error of the utilization rate of each device are shown in fig. 6 to 11.

As can be seen from fig. 6 to 11, the light-load single scene model has the largest prediction error for the heavy-load scene and the smallest prediction error for the light-load scene; the prediction error of the heavy-load single scene model to the heavy-load scene is minimum, and the prediction error to the light-load scene is maximum; the prediction error of the normal single scene to the normal scene is minimum, and the prediction error to the heavy load scene is maximum; the non-divided scene model has the largest error for the heavy-load scene and the smallest prediction error for the normal scene; the multi-scene multi-index prediction model provided by the invention almost keeps the same prediction error for three scenes and is positioned at the optimal position of the four models.

While the invention has been described with reference to specific embodiments, the invention is not limited thereto, and various equivalent modifications and substitutions can be easily made by those skilled in the art within the technical scope of the invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A multi-scenario performance index prediction method for a semiconductor production line is characterized by comprising the following steps:

and 4, step 4: after dividing the online data of the production line into different scene results according to a threshold value, selecting a corresponding network from the multi-scene prediction model for prediction to obtain a performance index prediction result;

the process of constructing the prediction network under the normal load scene in the step 2 comprises the following sub-steps:

step 201: coding a scheduling rule symbol in the sample data, and performing normalization processing by taking the scheduling rule symbol as input;

step 204: if the prediction precision of the test result can meet the precision requirement, the prediction network under the normal load scene is successfully established, if the prediction precision of the test result cannot meet the precision requirement, the step 202 is returned, the number of hidden layers, the number of neurons of each hidden layer and excitation functions of each layer are selected again, and the model is trained again;

the deep learning algorithm in step 202 has the following inputs:

for a given training set of k different samples:

X^k＝{S^k，D^k，Ru^k，P^k|S^k∈R^∝，D^k∈R^β，Ru^k∈R^λ，P^k∈R^m}

wherein the content of the first and second substances,

the state is the state of an intelligent workshop system;

feeding information for an intelligent workshop;

scheduling rules for the intelligent workshop;

the output of the deep learning algorithm in step 202 is o_iWhere i is 1, …, m1, m1 indicates the dimension of the output value o, i.e. there are m1 outputs;

the accuracy requirement in step 204 is based on the root mean square error, and the description formula is as follows:

is the output variable sample number;

the process of constructing the prediction network under the light load scene and the heavy load scene in the step 3 comprises the following sub-steps:

step 305: adjusting the number of neurons in the last hidden layer by a network search method, inputting the data under the light load and heavy load scenes divided in the step 1 into a prediction network under the light load and heavy load scenes for training, and obtaining the prediction network under the light load and heavy load scenes after the training is finished;

the step 1 comprises the following sub-steps:

step 102: dividing data according to curve change rules in each line graph and scene quantification corresponding to light load scenes, normal load scenes and heavy load scenes respectively, wherein in the corresponding curves, a starting point is a point corresponding to a total product value when the equipment utilization rate starts to enter a stable stage, and an end point is a point corresponding to the total product value when the slope of the curve of the average processing period of the product is maximum;

2. The method as claimed in claim 1, wherein the production line history data includes input quantity and output quantity, wherein the input quantity includes: sampling days, average processing period of products, average daily moving steps, total number of products in process, queue length of each buffer area, total queue length of the buffer areas and total moving steps; the output quantity comprises: average processing period of products and utilization rate of each device.

3. The method as claimed in claim 1, wherein the scheduling rule symbol in step 201 is encoded by using a one-hot encoding method so that it can be received by a network.

4. A system using the semiconductor production line oriented multi-scenario performance index prediction method of claim 1, characterized in that the system comprises:

and the production scene quantitative division module is used for carrying out quantitative mapping on the historical data of the production line and carrying out data division according to the scene quantification respectively corresponding to the light load scene, the normal load scene and the heavy load scene.