CN112257337B - Prediction method for removal rate of wafer CMP (chemical mechanical polishing) material of GMDH (Gaussian mixture distribution) neural network - Google Patents
Prediction method for removal rate of wafer CMP (chemical mechanical polishing) material of GMDH (Gaussian mixture distribution) neural network Download PDFInfo
- Publication number
- CN112257337B CN112257337B CN202011094499.9A CN202011094499A CN112257337B CN 112257337 B CN112257337 B CN 112257337B CN 202011094499 A CN202011094499 A CN 202011094499A CN 112257337 B CN112257337 B CN 112257337B
- Authority
- CN
- China
- Prior art keywords
- hidden layer
- data set
- consumption
- training
- value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000005498 polishing Methods 0.000 title claims abstract description 116
- 238000000034 method Methods 0.000 title claims abstract description 106
- 239000000463 material Substances 0.000 title claims abstract description 33
- 238000013528 artificial neural network Methods 0.000 title claims abstract description 25
- 239000000126 substance Substances 0.000 title description 4
- 239000000203 mixture Substances 0.000 title description 2
- 239000013598 vector Substances 0.000 claims abstract description 86
- 238000012549 training Methods 0.000 claims abstract description 63
- 230000008569 process Effects 0.000 claims abstract description 60
- 230000002159 abnormal effect Effects 0.000 claims abstract description 20
- 238000003062 neural network model Methods 0.000 claims abstract description 17
- 238000012545 processing Methods 0.000 claims abstract description 9
- 238000010606 normalization Methods 0.000 claims abstract description 8
- 238000012216 screening Methods 0.000 claims abstract description 6
- 210000002569 neuron Anatomy 0.000 claims description 68
- 235000012431 wafers Nutrition 0.000 claims description 56
- 238000013316 zoning Methods 0.000 claims description 9
- 238000004364 calculation method Methods 0.000 claims description 7
- 238000010219 correlation analysis Methods 0.000 claims description 5
- 238000007619 statistical method Methods 0.000 claims description 5
- 238000012163 sequencing technique Methods 0.000 claims description 3
- 238000012360 testing method Methods 0.000 description 15
- 239000002002 slurry Substances 0.000 description 12
- 238000005192 partition Methods 0.000 description 11
- 238000010586 diagram Methods 0.000 description 6
- 230000008570 general process Effects 0.000 description 4
- 238000005259 measurement Methods 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 101100001672 Emericella variicolor andG gene Proteins 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000005299 abrasion Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000005530 etching Methods 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 238000001465 metallisation Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 238000007517 polishing process Methods 0.000 description 1
- 238000004886 process control Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F30/00—Computer-aided design [CAD]
- G06F30/20—Design optimisation, verification or simulation
- G06F30/27—Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2111/00—Details relating to CAD techniques
- G06F2111/10—Numerical modelling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2113/00—Details relating to the application field
- G06F2113/18—Chip packaging
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2119/00—Details relating to the type or aim of the analysis or the optimisation
- G06F2119/06—Power analysis or power optimisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2119/00—Details relating to the type or aim of the analysis or the optimisation
- G06F2119/14—Force analysis or force optimisation, e.g. static or dynamic forces
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Medical Informatics (AREA)
- Computer Hardware Design (AREA)
- Geometry (AREA)
- Mechanical Treatment Of Semiconductor (AREA)
Abstract
The invention relates to a method for predicting the removal rate of a wafer CMP material of a GMDH neural network, which comprises the following steps: (1) acquiring a polishing sample data set after removing the abnormal value; (2) analyzing the samples in the polishing sample data set to determine b effective process variables; (3) extracting the mean value, standard deviation, skewness and kurtosis of each effective process variable to obtain 4 × b characteristic vectors; (4) screening the correlation between the 4 × b characteristic vectors and the corresponding MRR values, and determining m characteristic vectors as input characteristic vectors of the GMDH neural network model; (5) carrying out normalization processing on a data set formed by the m feature vectors to obtain a training feature set; (6) obtaining a trained GMDH network model by adopting a binary quadratic Volterra polynomial regression model and taking input characteristic values in the training characteristic set as input layers and corresponding output MRR values as output layers; (7) and inputting the m characteristic values serving as input in the sample to be predicted into the trained GMDH network model, and outputting the predicted MRR value.
Description
Technical Field
The invention belongs to the technical field of semiconductor material prediction methods, and relates to a wafer CMP material removal rate prediction method of a GMDH neural network.
Background
Chemical Mechanical Polishing (CMP) is a mainstream process downstream of the end of wafer fabrication; the purpose of this process is to overcome the problem of wafer multilayer metallization. CMP planarizes the wafer surface by passivating and etching the wafer material with a slurry chemistry, i.e., the wafer is pressed downward to slide its surface over the slurry particles. Wafer CMP processes are very complex and involve a variety of chemical and mechanical phenomena, such as surface dynamics, electrochemical interfaces, contact mechanics, stress mechanics, fluid dynamics, and tribochemistry.
In the CMP process of a wafer, MRR is an important index (MRR value, i.e., material removal rate) for measuring the performance in the process. However, the quality of all wafers is controlled based on measurements of process variables, requiring expensive metrology tools and production cycles; and stringent experimental work procedures are required in the laboratory. The influencing factors influencing the MRR are mainly as follows: the down pressure, the rotation speed and temperature of the polishing disk and the polishing head, the type and flow rate of the polishing solution and the like. Currently, to simulate the physical mechanism of CMP, several models have been proposed: a typical model is the Preston equation, which describes MRR as a function of pressure and relative velocity V between the pad and the wafer, and the prediction accuracy of the physical model is the mean square error MSE of 870.25. There are also improvements based on the aforementioned Preston equation, such as increasing the slurry flow rate, contact stress, and chemical reaction rate into the original Preston equation; only the Depth Belief Network (DBN) of the general process parameters is considered, and the prediction precision of the test set of the model is that the Mean Square Error (MSE) is 7.29. Research efforts have been focused on the development of both physics-based and model-based predictive modeling techniques for CMP.
The wafer MRR is affected by many factors, including the rotational speed, temperature, and type of slurry of the polishing platen and head, in addition to the factors involved in the above model. The above method is limited to only considering general process parameters. The above-mentioned general process optimum parameters have been basically determined by research and experimental studies based on the influence factors of the existing CMP process MRR: such as pressure, rotational speed, and slurry flow rate, are all tightly controlled. However, in actual polishing, wear (i.e., consumption) of worn components such as a polishing pad and a dresser may also have an irreversible effect on the MRR value over time. The existing method only considers few general process variables and neglects the critical influence defect of important consumption variables on MRR value.
In a CMP process system, a large number of process variables are collected for process control purposes. Thus, the input dimensions of a typical modeling approach are very high, sometimes involving hundreds of input variables, where the presence of redundant features can significantly impact the performance of the virtual metrology model.
Therefore, the research of a model which can introduce important consumption variables, simplify feature selection and model selection and has high measurement accuracy is of great significance.
Disclosure of Invention
In order to solve the problem that the material removal rate in the CMP technology cannot be accurately obtained in the prior art, the invention provides a wafer CMP material removal rate prediction method of a GMDH neural network, which is a complex system modeling method based on the combination of physical knowledge and statistics; the self-adaptive model method of the GMDH neural network is adopted, the MRR value is accurately predicted, the MRR value is guaranteed to be within a normal range, so that the removal rate precision is improved, for example, under the wafer rough polishing mode and the wafer fine polishing mode, the MRR value range is respectively controlled to be 140-170 nm/min and 50-110 nm/min, if the predicted value does not conform to the range, the process parameters are timely adjusted, and for example, worn materials such as a trimmer, a polishing pad and the like are timely replaced. Accurate prediction of the model provides a decision analysis basis for evaluating the performance health state of each component of the CMP.
In order to achieve the purpose, the invention adopts the following scheme:
a wafer CMP material removal rate prediction method of a GMDH neural network comprises the following steps:
(1) acquiring a polishing sample data set after removing the abnormal value; where the number of samples is n, each sample contains a process variables and corresponding MRR values (i.e., material removal rates);
(2) performing statistical analysis on main process variables generated by polishing a plurality of wafers in a polishing sample data set, comprehensively considering the distribution uniformity and distribution range of each variable, and determining b effective process variables which are uniformly distributed and have wide ranges; wherein b is less than a;
(3) extracting the mean value, standard deviation, skewness and kurtosis of each effective process variable to obtain 4 × b characteristic vectors;
(4) screening the 4 x b characteristic vectors and corresponding MRR values by adopting a regression correlation analysis method, and determining m characteristic vectors as input characteristic vectors of the GMDH neural network model after setting a correlation coefficient threshold; wherein m < (4 × b);
determining the input dimension of the GMDH neural network model as m, and determining y as the corresponding output MRR value (the input feature dimension m is in direct proportion to the total number n of samples, and the specific data is determined to improve the training precision, the more features are, the higher precision is, but the precision tends to be stable along with the increase of the feature dimension, and even can be reduced);
(5) carrying out normalization processing on a data set A formed by m feature vectors to obtain a training feature set A' and a testing feature set, wherein the sample size of the training feature set is n 1 The sample size of the test feature set is n 2 ,n=n 1 +n 2 . The normalization process is intended to reduce the influence of the inconsistency of the unit dimensions of the input data on the prediction accuracy.
(6) Adopting a binary quadratic Volterra polynomial regression model, and taking the sample size as n 1 The m feature vectors in the training feature set A' are input layers, the corresponding MRR values in the training feature set are output layers, and a GMDH neural network model is trained; namely:
wherein,for the b-th feature vector, x, in the training feature set A a ′ ,b Is the corresponding eigenvalue, y, in the b-th eigenvector in the a-th sample in the training characteristic set A a The actual MRR value, n, corresponding to the a-th sample in the training feature set A 1 For training the sample size, a ∈ {1,2, …, n 1 },b∈{1,2, …,m};
(7) Inputting m characteristic values serving as input in a test characteristic set (a sample set to be tested) into a trained GMDH network model, and outputting a predicted MRR value;
(8) and comparing the predicted MRR value with the MRR values corresponding to the input m characteristic values in the test characteristic set to obtain the accuracy of the model prediction.
Under the rough polishing working mode, the accuracy of a prediction result is as follows: mean square error MSE is 3.95, mean square error RMSE is 1.99;
under the fine polishing working mode, the accuracy of a prediction result is as follows: the mean square error MSE is 9.82 and the mean square error RMSE is 3.31.
As a preferred technical scheme:
according to the prediction method for the removal rate of the GMDH neural network wafer CMP material, in the step (1), a polishing sample data set is acquired through a sensor on CMP equipment, and when an MRR value is 140-170 nm/min, the polishing sample data set refers to a rough polishing sample data set; when the MRR value is 50-110 nm/min, the polishing sample data set is a fine polishing sample data set;
in the step (1), a is 25, and the process variables include chamber pressure, main and external pressure, center pressure, retaining ring pressure, ripple pressure, edge pressure, dresser rotation speed, wafer rotation speed, polishing table rotation speed, a-type slurry flow speed, B-type slurry flow speed, C-type slurry flow speed, consumption of a polishing table backing film, consumption of a polishing pad, consumption of a wafer carrier flexible plate, consumption of a partition film, consumption of a dresser table, dressing liquid state, a chamber for wafer processing, a process treatment stage, a wafer identifier, a time cut, a wafer ring position identifier and a polishing machine identifier. The first 18 main process variables were selected for statistical distribution analysis. Other identifier variables have less impact on the target output and therefore are not analyzed as predictors.
In the step (1), the method for removing the abnormal value comprises the following steps: and detecting an abnormal value by using Grubbs to improve the prediction precision, wherein the abnormal value is generated by the measurement failure of a sensor and the occurrence of random errors of process parameters and is a maximum value or a minimum value.
In the method for predicting the wafer CMP material removal rate of the GMDH neural network, in the step (2), the effective process variable is a variable with a statistical width range of 0.12 to 11 (the width is a difference between a maximum value and a minimum value of the variable). Narrow distribution of variables does not increase the accuracy of MRR values because the data of rotational speed, pressure and flow rate variables are distributed in a narrow range throughout the CMP process, and the pressure, rotational speed and slurry flow rate are all tightly controlled in actual wafer CMP. These process parameters are essentially fixed values to ensure wafer material removal accuracy and removal uniformity. From a physics perspective, pressure and rotational speed are key factors that affect MRR, but from a computational perspective, the inclusion of pressure and rotational speed in the GMDH neural network model does not increase MRR prediction accuracy.
When the polishing sample data set refers to a rough polishing sample data set, the effective process variables are as follows: backing film consumption, polishing pad consumption, zoning film consumption and flexible board consumption; other invalid variables are distributed more discretely and basically have a constant value; these variables therefore cannot be used as predictors for the model;
or when the polishing sample data set refers to a fine polishing sample data set, the effective process variables are as follows: backing film consumption, polishing pad consumption, and zoned film consumption.
In the method for predicting the material removal rate of the wafer CMP of the GMDH neural network, in step (4), when the polishing sample data set is the rough polishing sample data set, m is 8, and the input process variable features corresponding to the input feature vectors are respectively: a mean value of backing film consumption, a warp of backing film consumption, a mean value of polishing pad consumption, a standard deviation of polishing pad consumption, a warp of polishing pad consumption, a mean value of zoning film consumption, a warp of zoning film consumption and a mean value of flexible sheet consumption;
or, when the polishing sample data set is the fine polishing sample data set, where m is 8, and the input process variable features corresponding to the input feature vectors are respectively: the waviness of the consumption of the backing film, the mean of the consumption of the polishing pad, the standard deviation of the consumption of the polishing pad, the waviness of the consumption of the polishing pad, the kurtosis of the consumption of the polishing pad, the mean of the consumption of the partition film, the standard deviation of the consumption of the partition film, and the waviness of the consumption of the partition film.
The manner of obtaining the characteristic values is well known.
The method for predicting the wafer CMP material removal rate of the GMDH neural network as described above, in step (5),
the data set formed by the m feature vectors is a, as follows:
wherein (x) 1,b ,x 2,b ,…,x a,b ,…,x n,b ) T For the b-th feature vector, x, in data set A a,b For the corresponding eigenvalue in the b-th eigenvector in the a-th sample in the dataset a, m ═ 8, a ∈ {1,2, …, n }, and b ∈ {1,2, …, m };
the normalization processing means that feature vectors in the data set A are normalized one by one to obtain a normalized data set, wherein the feature vectors refer to feature vectors (x) 1,b ,x 2,b ,…,x a,b ,…,x n,b ) T A is equal to {1,2, …, n }, b is equal to {1,2, …, m }, and the normalized calculation formula is as follows:
wherein x is normalized For normalized eigenvalue, x, in the b-th eigenvector actual Is the eigenvalue, x, in the b-th eigenvector max Is the largest eigenvalue, x, in the b-th eigenvector min B ∈ {1,2, …, m } which is the smallest eigenvalue in the b-th eigenvector;
in the normalized dataset, random selectionTaking each sample as a training feature set A', and recording as:
wherein,is the b-th feature vector, x 'in the training feature set A' a,b Is the corresponding eigenvalue, n, in the b-th eigenvector in the a-th sample in the training characteristic set A 1 For training sample size, m is 8, a ∈ {1,2, …, n 1 },b∈{1,2,…,m}。
In the method for predicting the wafer CMP material removal rate of the GMDH neural network, in the step (6), the step of training the GMDH neural network model specifically includes:
(61) establishing a 1 st hidden layer:
(611) arbitrarily taking two feature vectors X from 8 feature vectors in the training feature set A i And X j Creating G 1 Second order polynomial equation as the basic neuron, i.e. the total number of the 1 st hidden layer basic neuronsWherein m is 8, and P is the threshold value of the maximum neuron total number in each hidden layer;
(612) respectively calculating and obtaining models corresponding to all basic neurons in the 1 st hidden layer according to Volterra quadratic polynomial regression
Wherein,is the target output vector predictor of the quadratic polynomial equation,refers to a target output predicted value, x 'of the a sample in the r basic neuron model in the 1 st hidden layer' a,j Means that the jth input characteristic value of the ath sample is selected from the input and is used as an element connected to form the 1 st hidden layer and the r th basic neuron model, and a is e {1,2, …, n 1 },b∈{1,2,…,m};
{w 0 ,w 2 ,w 3 ,w 4 ,w 5 Is the coefficient of a quadratic polynomial equationAnd Y r 1 The minimum difference is used as a target and is obtained by calculation by adopting a least square method; wherein, Y r 1 For training n in the feature set A 1 A vector of actual MRR values for each training sample;
(613) respectively calculating the output root mean square error of each neuron in the 1 st hidden layerA value; namely, it is
Wherein,outputting a predicted value for a target output of an a sample in an r basic neuron model in a 1 st hidden layer,outputting a real value for a target of an a-th sample in an r-th basic neuron model in a 1 st hidden layer;
(614) sequencing all neurons in a hidden layer 1 from small to large in output root mean square error, and taking P neurons sequenced at the front as effective neurons to form a hidden layer 1;
(615) taking each output of P neurons in the 1 st hidden layer as an input feature vector of the 2 nd hidden layer; the 2 nd hidden layer forms G 2 A basic neuron, andG 2 if the number of the neurons is more than P, repeating the steps (611) to (614) to obtain a hidden layer 2 containing P neurons, wherein the hidden layer 1 is participated in combination and connection to form a hidden layer 2 with U 1 A number of effective neurons;
(616) calculating U participating in combination and connection in the 1 st hidden layer to form the 2 nd hidden layer 1 Output of an effective neuronMean value of 1 I.e. by
(62) Establishing a middle hidden layer: repeating the steps (611) to (616) by taking each output of P neurons in the k-1 hidden layer (equivalent to the 1 st hidden layer) as an input feature vector of the k hidden layer (equivalent to the 2 nd hidden layer) to obtain an intermediate hidden layer;
U participating in combination and connection in the k-1 hidden layer to form the k hidden layer k An effective neuron outputMean value of k I.e. by
(63) Establishing an output layer: when E is k-1 -E k Less than or equal to 0.3 (i.e. the k hidden layer E) k Not when there is a significant decrease with increasing number of hidden layers), training stops; and 2 output RMSE small neurons in the k hidden layer are used as new input vectors and target output MRR vectors corresponding to the new input vectors to construct a quadratic polynomial equation, and the output of the equation is used as the final output prediction value of the GMDH neural network model.
The method for predicting the wafer CMP material removal rate of the GMDH neural network as described above, where P is 12.
Advantageous effects
(1) According to the wafer CMP material removal rate virtual prediction method of the GMDH neural network, the advantages of single-cycle prediction capability and short operation time of the GMDH network are utilized, and the optimal feature set is self-organized and selected to establish a wafer MRR prediction model. The combination of comprehensive consideration of physics knowledge and statistics overcomes the defects that the traditional prediction method only considers few variables and neglects the influence of important consumption variables on MRR. The problem that the grinding removal rate in the wafer CMP process cannot be rapidly and accurately obtained is solved.
(2) According to the virtual prediction method for the removal rate of the CMP material of the wafer of the GMDH neural network, error sources such as drift in the polishing process and difference among wafer products in different batches are comprehensively considered, and in order to improve the accuracy of a prediction model, two modes of rough polishing and fine polishing of the wafer are respectively modeled.
(3) According to the virtual prediction method for the removal rate of the CMP material of the wafer of the GMDH neural network, the average value of consumption variables of a polishing pad, a backing film and a flexible plate is selected as a most effective prediction factor from a rough polishing training network structure. The most effective variable widely used in the input layer polynomial equation at this time is the polishing pad consumption average.
(4) According to the virtual prediction method for the removal rate of the wafer CMP material of the GMDH neural network, the mean value, the standard deviation and the skewness of the consumption of the polishing pad, the skewness of the backing film and the skewness of the consumption variable of the partition film are selected as the most effective prediction factors from the structure of fine polishing training, and the most effective variable widely used in the polynomial equation of the input layer is the standard deviation of the consumption of the polishing pad.
(5) According to the virtual prediction method for the removal rate of the CMP material of the wafer of the GMDH neural network, the accurate prediction of the MRR value is rapidly obtained through the network model, and the influence of the abrasion consumption of the polishing pad on the target output MRR is the largest no matter in a fine polishing mode or a rough polishing mode, so that if the predicted value is not in the range of the MRR value, the process parameters are timely adjusted, for example, the new polishing pad, a trimmer and other worn materials are timely replaced, and the polishing pad is particularly used as a key maintenance component of the CMP process of the wafer. According to 2016PHM data and the result of experimental prediction error evaluation index calculation, the MRR prediction method provided by the method is proved to be more excellent than the prediction effect of a physical model and a traditional neural network. The method is suitable for nonlinear complex CMP process modeling.
Drawings
FIGS. 1 and 2 are flow charts of wafer removal rate prediction according to the present invention;
FIG. 3(a) shows the detection results of 4 MRR abnormal values according to the present invention;
FIG. 3(b) is a graph of MRR value distribution for two modes of operation in a CMP process of the present invention;
FIG. 4 is a schematic diagram of a GMDH network structure trained during a wafer rough polishing stage according to the present invention;
FIGS. 5 and 6 are predicted results of the rough polishing stage and the fine polishing stage, respectively, of the present invention;
FIG. 7 is a graph of a statistical analysis of process variables in the rough polishing mode of the present invention.
Detailed Description
The invention will be further illustrated with reference to specific embodiments. It should be understood that these examples are for illustrative purposes only and are not intended to limit the scope of the present invention. Further, it should be understood that various changes or modifications of the present invention may be made by those skilled in the art after reading the teaching of the present invention, and such equivalents may fall within the scope of the present invention as defined in the appended claims.
A method for predicting a wafer CMP material removal rate of a GMDH neural network, the flow diagram of which is shown in FIGS. 1-2, comprises the following steps:
(1) acquiring a polishing sample data set after removing the abnormal value; where the number of samples is n, each sample contains 25 process variables and corresponding MRR values (i.e., material removal rates);
the process variables include chamber pressure, main and external pressure, center pressure, retaining ring pressure, ripple pressure, edge pressure, dresser rotational speed, wafer rotational speed, polishing table rotational speed, type a slurry flow rate, type B slurry flow rate, type C slurry flow rate, consumption of polishing table backing film, consumption of polishing pad, consumption of wafer carrier flex, consumption of zoning film, consumption of dresser table, dressing solution state, chamber for wafer processing, process treatment stage, wafer identifier, time cut, wafer ring position identifier, and polishing machine identifier. The first 18 main process variables were selected for statistical distribution analysis. Other identifier variables have less impact on the target output and therefore are not analyzed as predictors.
The method for removing the abnormal value comprises the following steps: and detecting an abnormal value by using Grubbs to improve the prediction precision, wherein the abnormal value is generated by the measurement failure of a sensor and the occurrence of random errors of process parameters and is a maximum value or a minimum value.
(2) Performing statistical analysis on main process variables generated by polishing a plurality of wafers in the polishing sample data set, and determining b effective process variables with statistical width ranges of 0.12-11 (the width is the difference between the maximum value and the minimum value in the variables); wherein b is less than a;
(3) extracting the mean value, standard deviation, skewness and kurtosis of each effective process variable to obtain 4 × b characteristic vectors;
(4) screening the 4 x b characteristic vectors and corresponding MRR values by adopting a regression correlation analysis method, and determining m characteristic vectors as input characteristic vectors of the GMDH neural network model after setting a correlation coefficient threshold; wherein m is 8, m < (4 × b);
(5) carrying out normalization processing on a data set A formed by m feature vectors to obtain a training feature set A' and a testing feature set, wherein the sample size of the training feature set is n 1 The sample size of the test feature set is n 2 ,n=n 1 +n 2 (ii) a The specific process is as follows:
the data set formed by the m feature vectors is a, as follows:
wherein (x) 1,b ,x 2,b ,…,x a,b ,…,x n,b ) T For the b-th feature vector, x, in data set A a,b For the corresponding eigenvalue in the b-th eigenvector in the a-th sample in the data set a, m is 8, a ∈ {1,2, …, n }, b ∈ {1,2, …, m };
the normalization processing means that feature vectors in the data set A are normalized one by one to obtain a normalized data set, wherein the feature vectors refer to feature vectors (x) 1,b ,x 2,b ,…,x a,b ,…,x n,b ) T A is equal to {1,2, …, n }, b is equal to {1,2, …, m }, and the normalized calculation formula is as follows:
wherein x is normalized For normalized eigenvalue, x, in the b-th eigenvector actual Is the eigenvalue, x, in the b-th eigenvector max Is the largest eigenvalue, x, in the b-th eigenvector min B is the minimum eigenvalue in the b-th eigenvector, and b is epsilon {1,2, …, m };
in the normalized dataset, random selectionTaking each sample as a training feature set A', and recording as:
wherein,is the b th feature vector, x 'in the training feature set A' a,b Is the corresponding characteristic value, n, in the b-th characteristic vector of the a-th sample in the training characteristic set A 1 For the training sample size, m is 8, a ∈ {1,2, …, n 1 },b∈{1,2,…,m};
(6) Adopting a binary quadratic Volterra polynomial regression model, and taking the sample size as n 1 The m feature vectors in the training feature set A' are input layers, the corresponding MRR values in the training feature set are output layers, and a GMDH neural network model is trained; namely:
wherein,is the b-th feature vector, x 'in the training feature set A' a,b Is the corresponding eigenvalue, y, in the b-th eigenvector in the a-th sample in the training characteristic set A a Is the actual MRR value, n, corresponding to the a-th sample in the training feature set A 1 For training the sample size, a ∈ {1,2, …, n 1 },b∈{1,2,…,m};
The specific steps for training the GMDH neural network model are as follows:
(61) establishing a 1 st hidden layer:
(611) arbitrarily taking two feature vectors X from 8 feature vectors in the training feature set A i And X j Creating G 1 A second order polynomial equation as the number of basic neurons, i.e. the 1 st hidden layer basic neuronWherein m is 8, and P is the threshold value of the maximum neuron total number in each hidden layer; p ═ 12;
(612) respectively calculating and obtaining models corresponding to all basic neurons in the 1 st hidden layer according to Volterra quadratic polynomial regression
Wherein,is the target output vector predictor of the quadratic polynomial equation,the target output predicted value of the a sample in the r basic neuron model in the 1 st hidden layer is referred to, r is the serial number of the basic neuron in the 1 st hidden layer, x' a,j Means that the jth input characteristic value of the ath sample is selected from the input and is used as an element connected to form the 1 st hidden layer and the r th basic neuron model, and a is e {1,2, …, n 1 },b∈{1,2,…,m};P=12。
{w 0 ,w 2 ,w 3 ,w 4 ,w 5 Is the coefficient of a quadratic polynomial equationAnd Y r 1 The minimum difference is used as a target and is obtained by calculation by adopting a least square method; wherein, Y r 1 For training n in the feature set A 1 A vector of actual MRR values for each training sample;
(613) respectively calculating the output root mean square error of each neuron in the 1 st hidden layerA value; namely, it is
Wherein,for the target output predicted value of the a sample in the r basic neuron model in the 1 st hidden layer,outputting a real value for a target of an a-th sample in an r-th basic neuron model in a 1 st hidden layer, wherein r is a serial number of a basic neuron in the 1 st hidden layer;
(614) sequencing all neurons in a hidden layer 1 from small to large in output root mean square error, and taking P neurons sequenced at the front as effective neurons to form a hidden layer 1;
(615) taking the output of each of the P neurons in the hidden layer 1 as an input feature vector of the hidden layer 2, wherein P is 12; the 2 nd hidden layer forms G 2 A basic neuron, andG 2 repeating the steps (611) to (614) to obtain a hidden layer 2 containing P neurons, wherein P is 12, U is contained in the hidden layer 1, which is combined and connected to form the hidden layer 2 1 A number of effective neurons;
(616) calculating U participating in combination and connection in the 1 st hidden layer to form the 2 nd hidden layer 1 Output of an effective neuronMean value of 1 I.e. by
Wherein r is U in the 1 st hidden layer 1 The number of each valid neuron;
(62) establishing a middle hidden layer: outputting each item of P neurons in the (k-1) th hidden layer as an input feature vector of the (k) th hidden layer, and repeating the steps (611) to (616) to obtain a middle hidden layer;
U participating in combination and connection in the k-1 hidden layer to form the k hidden layer k An effective neuron outputMean value of k I.e. by
(63) Establishing an output layer: when E is k-1 -E k Less than or equal to 0.3 (i.e. the k hidden layer E) k Not when there is a significant decrease with increasing number of hidden layers), training stops; and 2 output RMSE small neurons in the k-th hidden layer are used as new input vectors and target output MRR vectors corresponding to the new input vectors to construct a quadratic polynomial equation, and the output of the equation is used as a final output prediction value of the GMDH neural network model.
(7) Inputting m characteristic values serving as input in a test characteristic set (a sample set to be tested) into a trained GMDH network model, and outputting a predicted MRR value;
(8) and comparing the predicted MRR value with the MRR values corresponding to the input m characteristic values in the test characteristic set to obtain the accuracy of the model prediction.
Predicting a rough polishing sample data set (the MRR value distribution graph is shown in figure 3 (b)) with the MRR value of 140-170 nm/min acquired by a sensor on CMP equipment by adopting the method for predicting the removal rate of the CMP material of the wafer of the GMDH neural network, wherein the data is from 2016PHM challenge race, Grubbs is adopted to detect abnormal values, 4 MRR values are found to be far greater than the abnormal values of 170nm/min (the abnormal value detection result is shown in figure 3 (a)), and the rough polishing sample data set with the abnormal values removed is acquired; the number of samples of the rough polishing sample data set is n, which is 102, and each sample contains 25 process variables and corresponding MRR values (i.e., material removal rates);
in the rough polishing sample data set after removing the abnormal values, randomly selecting five wafer polishing samples to perform process variable statistical analysis (as shown in fig. 7), and determining 4 effective process variables (variables with data width of 0.12-11, namely backing film consumption, polishing pad consumption, partition film consumption and flexible board consumption); wherein, the distribution range of the consumption of the backing film is 10.83, the distribution range of the consumption of the polishing pad is 9.63, the distribution range of the consumption of the partition film is 3.25, and the distribution range of the consumption of the flexible board is 0.12; other invalid variables are distributed more discretely and basically have a constant value; these variables therefore cannot be used as predictors for the model;
extracting the mean value, standard deviation, skewness and kurtosis of each effective process variable in the rough polishing sample data set to obtain 16 feature vectors;
screening 16 characteristic vectors and corresponding MRR values in the rough polishing sample data set by adopting a regression correlation analysis method, and setting a correlation coefficient threshold (the value is 0.65), so that 8 characteristic vectors with strong correlation can be determined to be used as input characteristic vectors of the GMDH neural network model, see table 1, as follows:
TABLE 1
Serial number | Name of characteristic variable |
X 1 | Consumption average of backing film |
X 2 | Consumption distortion of backing film |
X 3 | Average polishing pad consumption |
X 4 | Standard deviation of polishing pad consumption |
X 5 | Polishing pad consumption skewness |
X 6 | Consumption of the partition filmValue of |
X 7 | Skewness of consumption of partition film |
X 8 | Average consumption of flexible board |
The schematic diagram of the trained GMDH network structure is shown in FIG. 4, and the input vector which shows that the network is most widely used from the input layer is X3 (i.e., the average pad consumption in table 1), is the most effective MRR predictor. In the 1 st hidden layer, the first hidden layer,the selected neurons are eliminated, and the output RMSE of the next layer of new neurons formed by combining the neurons according to the model evaluation criterion is larger, which indicates that the correlation between the neurons and the output MRR is weak and the neurons do not participate in the network connection of the next layer. ThenAre the first hidden layer of active neurons. When the 4 th hidden layer is constructed, the mean value of the RMSE output by all the effective neurons does not have obvious descending trend along with the increase of the number of the hidden layers, the training is stopped, and the GMDH neural network topological structure containing 4 hidden layers is obtained, namely the GMDH network model is obtained.
Inputting 8 characteristic values which are taken as input in the corresponding test characteristic set into a trained GMDH network model, and outputting a predicted MRR value;
comparing the predicted MRR value with the MRR values corresponding to the 8 input characteristic values in the test characteristic set to obtain the accuracy of the model prediction, wherein a schematic diagram of the prediction result is shown in FIG. 5, and under a rough polishing working mode, the accuracy of the prediction result is as follows: the mean square error MSE is 3.95 and the mean square error RMSE is 1.99.
By adopting the prediction method for the removal rate of the CMP material of the wafer with the GMDH neural network, a fine polishing sample data set (MRR value distribution diagram is shown in figure 3 (b)) with an MRR value of 50-110 nm/min acquired by a sensor on CMP equipment is predicted, the data is obtained from 2016PHM challenge match data, Grubbs is adopted to detect abnormal values, and the fine polishing sample data set (the abnormal value detection result is shown in figure 3 (a)) with the abnormal values removed is acquired; where the number of samples, n 105, each sample contained 25 process variables and corresponding MRR values (i.e., material removal rates);
analyzing the sample generated by a single wafer in the fine polishing sample data set after the abnormal value is removed, and determining 3 effective process variables (the data width is 0.12-11, namely the consumption of the backing film, the consumption of the polishing pad and the consumption of the partition film);
extracting the mean value, standard deviation, skewness and kurtosis of each effective process variable to obtain 12 feature vectors;
screening 12 feature vectors and corresponding MRR values by adopting a regression correlation analysis method, and setting a correlation coefficient threshold (the value is 0.7), namely determining 8 feature vectors as input feature vectors of the GMDH neural network model, as shown in Table 2, as follows:
TABLE 2
Serial number | Name of characteristic variable |
X 1 | Consumption distortion of backing film |
X 2 | Average polishing pad consumption |
X 3 | Polishing ofStandard deviation of pad consumption |
X 4 | Polishing pad consumption skewness |
X 5 | Kurtosis of polishing pad consumption |
X 6 | Mean value of consumption of partitioned film |
X 7 | Standard deviation of consumption of zoned membrane |
X 8 | Skewness of consumption of partition film |
Inputting 8 feature vector samples which are taken as input in the test feature set into a trained GMDH network model, and outputting a predicted MRR value;
comparing the predicted MRR value with the MRR values corresponding to the 8 input characteristic values in the test characteristic set to obtain the accuracy of the model prediction, wherein a prediction result schematic diagram is shown in FIG. 6, and under a fine polishing working mode, the accuracy of the prediction result is as follows: the mean square error MSE is 9.82 and the mean square error RMSE is 3.13.
Table 1 shows the detailed predicted results of the training samples and the test samples under two different working modes
The training model obtained from the training samples in table 1 will also be analyzed for errors from the true values, which is called training error.
The prediction result shows that the MRR predicted value obtained by the GMDH network is in good accordance with the real measured value, when a network topological structure is established, a balance point is found between the fitting precision of the training sample and the prediction precision of the test set, so that the real internal relation (the nonlinear relation between each consumption characteristic and the MRR value) of the system can be reflected to the maximum extent by the algorithm even if the network model is small in sample or has high noise, and the optimality and the generalization of the established model are further ensured. The MRR real-time change of the wafer CMP process can be effectively monitored by the model. Mean Square Error (MSE) and Root Mean Square Error (RMSE) are used as model performance evaluation indicators. The smaller the RMSE, the higher the model prediction accuracy.
Claims (4)
1. A wafer CMP material removal rate prediction method of a GMDH neural network is characterized in that: the method comprises the following steps:
(1) acquiring a polishing sample data set after removing the abnormal value; wherein the number of samples is n, each sample contains a process variables and corresponding MRR values;
the polishing sample data set is acquired through a sensor on CMP equipment, and when the MRR value is 140-170 nm/min, the polishing sample data set refers to a rough polishing sample data set; when the MRR value is 50-110 nm/min, the polishing sample data set is a fine polishing sample data set;
(2) performing statistical analysis on main process variables generated by polishing a plurality of wafers in the polishing sample data set to determine b effective process variables; wherein b is less than a;
the effective process variable is a variable with a statistical width range of 0.12-11; when the polishing sample data set refers to a rough polishing sample data set, the effective process variables are as follows: backing film consumption, polishing pad consumption, zoning film consumption and flexible board consumption; or, when the polishing sample data set refers to a fine polishing sample data set, the effective process variables are as follows: backing film consumption, polishing pad consumption, and zoning film consumption;
(3) extracting the mean value, standard deviation, skewness and kurtosis of each effective process variable to obtain 4 × b characteristic vectors;
(4) screening the 4 x b characteristic vectors and corresponding MRR values by adopting a regression correlation analysis method, and determining m characteristic vectors as input characteristic vectors of the GMDH neural network model after setting a correlation coefficient threshold;
when the polishing sample data set is a rough polishing sample data set, m is 8, and the input process variable characteristics corresponding to the input characteristic vectors are respectively as follows: a mean value of backing film consumption, a warp of backing film consumption, a mean value of polishing pad consumption, a standard deviation of polishing pad consumption, a warp of polishing pad consumption, a mean value of divisional film consumption, a warp of divisional film consumption, and a mean value of flexible sheet consumption;
or, when the polishing sample data set is the fine polishing sample data set, where m is 8, and the input process variable features corresponding to the input feature vectors are respectively: a warp of backing film consumption, a mean of polishing pad consumption, a standard deviation of polishing pad consumption, a warp of polishing pad consumption, a kurtosis of polishing pad consumption, a mean of zoning film consumption, a standard deviation of zoning film consumption and a warp of zoning film consumption;
(5) carrying out normalization processing on a data set A formed by m feature vectors to obtain a training feature set A ', wherein the sample size of the training feature set A' is n 1 And n is 1 <n;
(6) Adopting a binary quadratic Volterra polynomial regression model, and taking the sample size as n 1 The m feature vectors in the training feature set A' are input layers, the corresponding MRR values in the training feature set are output layers, and a GMDH neural network model is trained and obtained, namely:
wherein,for training featuresThe b-th feature vector, x 'in set A' a,b Is the corresponding eigenvalue, y, in the b-th eigenvector in the a-th sample in the training characteristic set A a Is the actual MRR value, n, corresponding to the a-th sample in the training feature set A 1 For training the sample size, a ∈ {1,2, …, n 1 },b∈{1,2,…,m};
(7) And inputting the m characteristic values serving as input in the sample to be tested into the trained GMDH network model, and outputting the predicted MRR value.
2. The method of claim 1, wherein in step (5),
the data set formed by the m feature vectors is a, as follows:
wherein (x) 1,b ,x 2,b ,…,x a,b ,…,x n,b ) T For the b-th feature vector, x, in data set A a,b For the corresponding eigenvalue in the b-th eigenvector in the a-th sample in the dataset a, m ═ 8, a ∈ {1,2, …, n }, and b ∈ {1,2, …, m };
the normalization processing means that feature vectors in the data set A are normalized one by one to obtain a normalized data set, wherein the feature vectors refer to feature vectors (x) 1,b ,x 2,b ,…,x a,b ,…,x n,b ) T A is equal to {1,2, …, n }, b is equal to {1,2, …, m }, and the normalized calculation formula is as follows:
wherein x is normalized For normalized eigenvalue, x, in the b-th eigenvector actual Is the eigenvalue, x, in the b-th eigenvector max Is the largest eigenvalue, x, in the b-th eigenvector min B ∈ {1,2, …, m } which is the smallest eigenvalue in the b-th eigenvector;
in the normalized dataset, random selectionTaking the samples as a training feature set A', and recording as:
3. The method of claim 2, wherein the step (6) of training the GMDH neural network model comprises the following steps:
(61) establishing a 1 st hidden layer:
(611) arbitrarily taking two feature vectors X from 8 feature vectors in the training feature set A i And X j Creating G 1 Second order polynomial equation as the basic neuron, i.e. the total number of the 1 st hidden layer basic neuronsWherein m is 8, and P is the threshold value of the maximum neuron total number in each hidden layer;
(612) respectively calculating and obtaining models corresponding to all basic neurons in the 1 st hidden layer according to Volterra quadratic polynomial regression
Wherein,is the target output vector predictor of the quadratic polynomial equation,refers to a target output predicted value, x 'of the a sample in the r basic neuron model in the 1 st hidden layer' a,j Means that the jth input characteristic value of the ath sample is selected from the input and is used as an element connected to form the 1 st hidden layer and the r th basic neuron model, and a is e {1,2, …, n 1 },b∈{1,2,…,m};
{w 0 ,w 2 ,w 3 ,w 4 ,w 5 Is the coefficient of a quadratic polynomial equationAndthe minimum difference is used as a target and is obtained by calculation by adopting a least square method; wherein,for training n in the feature set A 1 A vector of actual MRR values for each training sample;
(613) respectively calculating the output root mean square error of each neuron in the 1 st hidden layerA value; namely, it is
Wherein,for the target output predicted value of the a sample in the r basic neuron model in the 1 st hidden layer,outputting a real value for a target of an a-th sample in an r-th basic neuron model in a 1 st hidden layer;
(614) sequencing all neurons in a hidden layer 1 from small to large in output root mean square error, and taking P neurons sequenced at the front as effective neurons to form a hidden layer 1;
(615) taking each output of P neurons in the 1 st hidden layer as an input feature vector of the 2 nd hidden layer; the 2 nd hidden layer forms G 2 A basic neuron, andrepeating the steps (611) to (614) to obtain a 2 nd hidden layer containing P neurons, wherein the 1 st hidden layer is participated in combination and connection to form the 2 nd hidden layer with U 1 A number of effective neurons;
(616) calculating U participating in combination and connection in the 1 st hidden layer to form the 2 nd hidden layer 1 Output of an effective neuronMean value of 1 I.e. by
(62) Establishing a middle hidden layer: outputting each item of P neurons in the (k-1) th hidden layer as an input feature vector of the (k) th hidden layer, and repeating the steps (611) to (616) to obtain a middle hidden layer;
U participating in combination and connection in the k-1 hidden layer to form a k hidden layer k Effective neuron outputMean value of k I.e. by
(63) Establishing an output layer: when E is k-1 -E k When the training time is less than or equal to 0.3, the training is stopped; and 2 output RMSE small neurons in the k hidden layer are used as new input vectors and target output MRR vectors corresponding to the new input vectors to construct a quadratic polynomial equation, and the output of the equation is used as the final output prediction value of the GMDH neural network model.
4. The method of claim 3, wherein P-12 is used for predicting the removal rate of CMP material from wafer by GMDH neural network.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011094499.9A CN112257337B (en) | 2020-10-14 | 2020-10-14 | Prediction method for removal rate of wafer CMP (chemical mechanical polishing) material of GMDH (Gaussian mixture distribution) neural network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011094499.9A CN112257337B (en) | 2020-10-14 | 2020-10-14 | Prediction method for removal rate of wafer CMP (chemical mechanical polishing) material of GMDH (Gaussian mixture distribution) neural network |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112257337A CN112257337A (en) | 2021-01-22 |
CN112257337B true CN112257337B (en) | 2022-09-16 |
Family
ID=74243417
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011094499.9A Active CN112257337B (en) | 2020-10-14 | 2020-10-14 | Prediction method for removal rate of wafer CMP (chemical mechanical polishing) material of GMDH (Gaussian mixture distribution) neural network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112257337B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114627985B (en) * | 2022-01-26 | 2024-06-21 | 苏州中砥半导体材料有限公司 | Optimization method, system and medium for polishing process of indium phosphide material |
CN114358443B (en) * | 2022-03-09 | 2022-06-24 | 深圳市信润富联数字科技有限公司 | Average material removal rate prediction method, device, electronic device and storage medium |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102799793A (en) * | 2012-07-27 | 2012-11-28 | 中国科学院微电子研究所 | Method and equipment for calculating chemical mechanical polishing removal rate |
CN102945304A (en) * | 2012-11-14 | 2013-02-27 | 中国科学院微电子研究所 | Method for calculating grinding removal rate of wafer surface |
TW201539602A (en) * | 2014-03-06 | 2015-10-16 | Kla Tencor Corp | Statistical overlay error prediction for feed forward and feedback correction of overlay errors, root cause analysis and process control |
CN107214610A (en) * | 2017-05-05 | 2017-09-29 | 天津华海清科机电科技有限公司 | The online flatness control system of copper CMP |
CN110039440A (en) * | 2019-03-27 | 2019-07-23 | 中国科学院微电子研究所 | A kind of method and device calculating CMP grind clearance |
CN110555230A (en) * | 2019-07-12 | 2019-12-10 | 北京交通大学 | rotary machine residual life prediction method based on integrated GMDH framework |
-
2020
- 2020-10-14 CN CN202011094499.9A patent/CN112257337B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102799793A (en) * | 2012-07-27 | 2012-11-28 | 中国科学院微电子研究所 | Method and equipment for calculating chemical mechanical polishing removal rate |
CN102945304A (en) * | 2012-11-14 | 2013-02-27 | 中国科学院微电子研究所 | Method for calculating grinding removal rate of wafer surface |
TW201539602A (en) * | 2014-03-06 | 2015-10-16 | Kla Tencor Corp | Statistical overlay error prediction for feed forward and feedback correction of overlay errors, root cause analysis and process control |
CN107214610A (en) * | 2017-05-05 | 2017-09-29 | 天津华海清科机电科技有限公司 | The online flatness control system of copper CMP |
CN110039440A (en) * | 2019-03-27 | 2019-07-23 | 中国科学院微电子研究所 | A kind of method and device calculating CMP grind clearance |
CN110555230A (en) * | 2019-07-12 | 2019-12-10 | 北京交通大学 | rotary machine residual life prediction method based on integrated GMDH framework |
Non-Patent Citations (2)
Title |
---|
Adaptive virtual metrology for semiconductor chemical mechanical planarization process using GMDH-type polynomial neural networks;Xiaodong Jia, Yuan Di, Jianshe Feng, Qibo Yang, Honghao Dai, Jay;《Journal of Process Control》;20181231;论文第44-54页 * |
基于粒子群的RBF神经网络晶圆清洗机过滤器流量预测;王燕燕;《中国优秀博硕士学位论文全文数据库(硕士) 信息科技辑》;20191215;论文第36-44页 * |
Also Published As
Publication number | Publication date |
---|---|
CN112257337A (en) | 2021-01-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4601492B2 (en) | Quality prediction system and method for production process | |
CN112257337B (en) | Prediction method for removal rate of wafer CMP (chemical mechanical polishing) material of GMDH (Gaussian mixture distribution) neural network | |
CN101976045B (en) | Panel quality virtual measurement method and system for TFT-LCD etching process | |
US6594024B1 (en) | Monitor CMP process using scatterometry | |
US8501501B1 (en) | Measurement of a sample using multiple models | |
JP2016224947A (en) | Measurement sample extraction method with sampling rate decision mechanism, and computer program product therefor | |
CN113094893A (en) | Wafer quality virtual measurement method and device, computer equipment and storage medium | |
CN107234495B (en) | Establish the method for average material removal rate prediction model and the method for predicted velocity | |
TW201415003A (en) | Optical metrology for in-situ measurements | |
KR102248777B1 (en) | Quantification and reduction of total measurement uncertainty | |
Di et al. | Enhanced virtual metrology on chemical mechanical planarization process using an integrated model and data-driven approach | |
TW201205440A (en) | Virtual measuring system and method thereof for predicting the quality of thin film transistor liquid crystal display processes | |
TW200401179A (en) | Method of predicting processing device condition or processed result | |
CN108875169A (en) | The degeneration modeling of surface vessel equipment digital multimeter and life-span prediction method | |
TWI647770B (en) | Yield rate determination method for wafer and method for multiple variable detection of wafer acceptance test | |
TWI427487B (en) | Method for sampling workpiece for inspection and computer program product performing the same | |
TW202339897A (en) | Shared data induced quality control for a chemical mechanical planarization process | |
CN109686412B (en) | Data coordination processing method and device for metal balance | |
Eiki et al. | Improving Efficiency and Robustness of Gaussian Process Based Outlier Detection via Ensemble Learning | |
Wang et al. | Predicting the Material Removal Rate in Chemical Mechanical Planarization Based on Improved Neural Network | |
US6922603B1 (en) | System and method for quantifying uniformity patterns for tool development and monitoring | |
CN115867925B (en) | System and method for controlling measurement of sample parameters | |
CN118092322A (en) | Model method for predicting optimal process condition and method for controlling semiconductor manufacturing process | |
Rymarczyk et al. | Analysis of data from measuring sensors for prediction in production process control systems | |
CN117900927B (en) | Efficiency monitoring method and system for full-automatic rubber roll polishing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |