CN112257337B - Prediction method for removal rate of wafer CMP (chemical mechanical polishing) material of GMDH (Gaussian mixture distribution) neural network - Google Patents

Prediction method for removal rate of wafer CMP (chemical mechanical polishing) material of GMDH (Gaussian mixture distribution) neural network Download PDF

Info

Publication number
CN112257337B
CN112257337B CN202011094499.9A CN202011094499A CN112257337B CN 112257337 B CN112257337 B CN 112257337B CN 202011094499 A CN202011094499 A CN 202011094499A CN 112257337 B CN112257337 B CN 112257337B
Authority
CN
China
Prior art keywords
hidden layer
data set
consumption
training
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011094499.9A
Other languages
Chinese (zh)
Other versions
CN112257337A (en
Inventor
贾花
宋万清
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai University of Engineering Science
Original Assignee
Shanghai University of Engineering Science
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai University of Engineering Science filed Critical Shanghai University of Engineering Science
Priority to CN202011094499.9A priority Critical patent/CN112257337B/en
Publication of CN112257337A publication Critical patent/CN112257337A/en
Application granted granted Critical
Publication of CN112257337B publication Critical patent/CN112257337B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • G06F30/27Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2111/00Details relating to CAD techniques
    • G06F2111/10Numerical modelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2113/00Details relating to the application field
    • G06F2113/18Chip packaging
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2119/00Details relating to the type or aim of the analysis or the optimisation
    • G06F2119/06Power analysis or power optimisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2119/00Details relating to the type or aim of the analysis or the optimisation
    • G06F2119/14Force analysis or force optimisation, e.g. static or dynamic forces

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Computer Hardware Design (AREA)
  • Geometry (AREA)
  • Mechanical Treatment Of Semiconductor (AREA)

Abstract

The invention relates to a method for predicting the removal rate of a wafer CMP material of a GMDH neural network, which comprises the following steps: (1) acquiring a polishing sample data set after removing the abnormal value; (2) analyzing the samples in the polishing sample data set to determine b effective process variables; (3) extracting the mean value, standard deviation, skewness and kurtosis of each effective process variable to obtain 4 × b characteristic vectors; (4) screening the correlation between the 4 × b characteristic vectors and the corresponding MRR values, and determining m characteristic vectors as input characteristic vectors of the GMDH neural network model; (5) carrying out normalization processing on a data set formed by the m feature vectors to obtain a training feature set; (6) obtaining a trained GMDH network model by adopting a binary quadratic Volterra polynomial regression model and taking input characteristic values in the training characteristic set as input layers and corresponding output MRR values as output layers; (7) and inputting the m characteristic values serving as input in the sample to be predicted into the trained GMDH network model, and outputting the predicted MRR value.

Description

Prediction method for removal rate of wafer CMP (chemical mechanical polishing) material of GMDH (Gaussian mixture distribution) neural network
Technical Field
The invention belongs to the technical field of semiconductor material prediction methods, and relates to a wafer CMP material removal rate prediction method of a GMDH neural network.
Background
Chemical Mechanical Polishing (CMP) is a mainstream process downstream of the end of wafer fabrication; the purpose of this process is to overcome the problem of wafer multilayer metallization. CMP planarizes the wafer surface by passivating and etching the wafer material with a slurry chemistry, i.e., the wafer is pressed downward to slide its surface over the slurry particles. Wafer CMP processes are very complex and involve a variety of chemical and mechanical phenomena, such as surface dynamics, electrochemical interfaces, contact mechanics, stress mechanics, fluid dynamics, and tribochemistry.
In the CMP process of a wafer, MRR is an important index (MRR value, i.e., material removal rate) for measuring the performance in the process. However, the quality of all wafers is controlled based on measurements of process variables, requiring expensive metrology tools and production cycles; and stringent experimental work procedures are required in the laboratory. The influencing factors influencing the MRR are mainly as follows: the down pressure, the rotation speed and temperature of the polishing disk and the polishing head, the type and flow rate of the polishing solution and the like. Currently, to simulate the physical mechanism of CMP, several models have been proposed: a typical model is the Preston equation, which describes MRR as a function of pressure and relative velocity V between the pad and the wafer, and the prediction accuracy of the physical model is the mean square error MSE of 870.25. There are also improvements based on the aforementioned Preston equation, such as increasing the slurry flow rate, contact stress, and chemical reaction rate into the original Preston equation; only the Depth Belief Network (DBN) of the general process parameters is considered, and the prediction precision of the test set of the model is that the Mean Square Error (MSE) is 7.29. Research efforts have been focused on the development of both physics-based and model-based predictive modeling techniques for CMP.
The wafer MRR is affected by many factors, including the rotational speed, temperature, and type of slurry of the polishing platen and head, in addition to the factors involved in the above model. The above method is limited to only considering general process parameters. The above-mentioned general process optimum parameters have been basically determined by research and experimental studies based on the influence factors of the existing CMP process MRR: such as pressure, rotational speed, and slurry flow rate, are all tightly controlled. However, in actual polishing, wear (i.e., consumption) of worn components such as a polishing pad and a dresser may also have an irreversible effect on the MRR value over time. The existing method only considers few general process variables and neglects the critical influence defect of important consumption variables on MRR value.
In a CMP process system, a large number of process variables are collected for process control purposes. Thus, the input dimensions of a typical modeling approach are very high, sometimes involving hundreds of input variables, where the presence of redundant features can significantly impact the performance of the virtual metrology model.
Therefore, the research of a model which can introduce important consumption variables, simplify feature selection and model selection and has high measurement accuracy is of great significance.
Disclosure of Invention
In order to solve the problem that the material removal rate in the CMP technology cannot be accurately obtained in the prior art, the invention provides a wafer CMP material removal rate prediction method of a GMDH neural network, which is a complex system modeling method based on the combination of physical knowledge and statistics; the self-adaptive model method of the GMDH neural network is adopted, the MRR value is accurately predicted, the MRR value is guaranteed to be within a normal range, so that the removal rate precision is improved, for example, under the wafer rough polishing mode and the wafer fine polishing mode, the MRR value range is respectively controlled to be 140-170 nm/min and 50-110 nm/min, if the predicted value does not conform to the range, the process parameters are timely adjusted, and for example, worn materials such as a trimmer, a polishing pad and the like are timely replaced. Accurate prediction of the model provides a decision analysis basis for evaluating the performance health state of each component of the CMP.
In order to achieve the purpose, the invention adopts the following scheme:
a wafer CMP material removal rate prediction method of a GMDH neural network comprises the following steps:
(1) acquiring a polishing sample data set after removing the abnormal value; where the number of samples is n, each sample contains a process variables and corresponding MRR values (i.e., material removal rates);
(2) performing statistical analysis on main process variables generated by polishing a plurality of wafers in a polishing sample data set, comprehensively considering the distribution uniformity and distribution range of each variable, and determining b effective process variables which are uniformly distributed and have wide ranges; wherein b is less than a;
(3) extracting the mean value, standard deviation, skewness and kurtosis of each effective process variable to obtain 4 × b characteristic vectors;
(4) screening the 4 x b characteristic vectors and corresponding MRR values by adopting a regression correlation analysis method, and determining m characteristic vectors as input characteristic vectors of the GMDH neural network model after setting a correlation coefficient threshold; wherein m < (4 × b);
determining the input dimension of the GMDH neural network model as m, and determining y as the corresponding output MRR value (the input feature dimension m is in direct proportion to the total number n of samples, and the specific data is determined to improve the training precision, the more features are, the higher precision is, but the precision tends to be stable along with the increase of the feature dimension, and even can be reduced);
(5) carrying out normalization processing on a data set A formed by m feature vectors to obtain a training feature set A' and a testing feature set, wherein the sample size of the training feature set is n 1 The sample size of the test feature set is n 2 ,n=n 1 +n 2 . The normalization process is intended to reduce the influence of the inconsistency of the unit dimensions of the input data on the prediction accuracy.
(6) Adopting a binary quadratic Volterra polynomial regression model, and taking the sample size as n 1 The m feature vectors in the training feature set A' are input layers, the corresponding MRR values in the training feature set are output layers, and a GMDH neural network model is trained; namely:
Figure GDA0003752223730000031
wherein,
Figure GDA0003752223730000032
for the b-th feature vector, x, in the training feature set A a,b Is the corresponding eigenvalue, y, in the b-th eigenvector in the a-th sample in the training characteristic set A a The actual MRR value, n, corresponding to the a-th sample in the training feature set A 1 For training the sample size, a ∈ {1,2, …, n 1 },b∈{1,2, …,m};
(7) Inputting m characteristic values serving as input in a test characteristic set (a sample set to be tested) into a trained GMDH network model, and outputting a predicted MRR value;
(8) and comparing the predicted MRR value with the MRR values corresponding to the input m characteristic values in the test characteristic set to obtain the accuracy of the model prediction.
Under the rough polishing working mode, the accuracy of a prediction result is as follows: mean square error MSE is 3.95, mean square error RMSE is 1.99;
under the fine polishing working mode, the accuracy of a prediction result is as follows: the mean square error MSE is 9.82 and the mean square error RMSE is 3.31.
As a preferred technical scheme:
according to the prediction method for the removal rate of the GMDH neural network wafer CMP material, in the step (1), a polishing sample data set is acquired through a sensor on CMP equipment, and when an MRR value is 140-170 nm/min, the polishing sample data set refers to a rough polishing sample data set; when the MRR value is 50-110 nm/min, the polishing sample data set is a fine polishing sample data set;
in the step (1), a is 25, and the process variables include chamber pressure, main and external pressure, center pressure, retaining ring pressure, ripple pressure, edge pressure, dresser rotation speed, wafer rotation speed, polishing table rotation speed, a-type slurry flow speed, B-type slurry flow speed, C-type slurry flow speed, consumption of a polishing table backing film, consumption of a polishing pad, consumption of a wafer carrier flexible plate, consumption of a partition film, consumption of a dresser table, dressing liquid state, a chamber for wafer processing, a process treatment stage, a wafer identifier, a time cut, a wafer ring position identifier and a polishing machine identifier. The first 18 main process variables were selected for statistical distribution analysis. Other identifier variables have less impact on the target output and therefore are not analyzed as predictors.
In the step (1), the method for removing the abnormal value comprises the following steps: and detecting an abnormal value by using Grubbs to improve the prediction precision, wherein the abnormal value is generated by the measurement failure of a sensor and the occurrence of random errors of process parameters and is a maximum value or a minimum value.
In the method for predicting the wafer CMP material removal rate of the GMDH neural network, in the step (2), the effective process variable is a variable with a statistical width range of 0.12 to 11 (the width is a difference between a maximum value and a minimum value of the variable). Narrow distribution of variables does not increase the accuracy of MRR values because the data of rotational speed, pressure and flow rate variables are distributed in a narrow range throughout the CMP process, and the pressure, rotational speed and slurry flow rate are all tightly controlled in actual wafer CMP. These process parameters are essentially fixed values to ensure wafer material removal accuracy and removal uniformity. From a physics perspective, pressure and rotational speed are key factors that affect MRR, but from a computational perspective, the inclusion of pressure and rotational speed in the GMDH neural network model does not increase MRR prediction accuracy.
When the polishing sample data set refers to a rough polishing sample data set, the effective process variables are as follows: backing film consumption, polishing pad consumption, zoning film consumption and flexible board consumption; other invalid variables are distributed more discretely and basically have a constant value; these variables therefore cannot be used as predictors for the model;
or when the polishing sample data set refers to a fine polishing sample data set, the effective process variables are as follows: backing film consumption, polishing pad consumption, and zoned film consumption.
In the method for predicting the material removal rate of the wafer CMP of the GMDH neural network, in step (4), when the polishing sample data set is the rough polishing sample data set, m is 8, and the input process variable features corresponding to the input feature vectors are respectively: a mean value of backing film consumption, a warp of backing film consumption, a mean value of polishing pad consumption, a standard deviation of polishing pad consumption, a warp of polishing pad consumption, a mean value of zoning film consumption, a warp of zoning film consumption and a mean value of flexible sheet consumption;
or, when the polishing sample data set is the fine polishing sample data set, where m is 8, and the input process variable features corresponding to the input feature vectors are respectively: the waviness of the consumption of the backing film, the mean of the consumption of the polishing pad, the standard deviation of the consumption of the polishing pad, the waviness of the consumption of the polishing pad, the kurtosis of the consumption of the polishing pad, the mean of the consumption of the partition film, the standard deviation of the consumption of the partition film, and the waviness of the consumption of the partition film.
The manner of obtaining the characteristic values is well known.
The method for predicting the wafer CMP material removal rate of the GMDH neural network as described above, in step (5),
the data set formed by the m feature vectors is a, as follows:
Figure GDA0003752223730000041
wherein (x) 1,b ,x 2,b ,…,x a,b ,…,x n,b ) T For the b-th feature vector, x, in data set A a,b For the corresponding eigenvalue in the b-th eigenvector in the a-th sample in the dataset a, m ═ 8, a ∈ {1,2, …, n }, and b ∈ {1,2, …, m };
the normalization processing means that feature vectors in the data set A are normalized one by one to obtain a normalized data set, wherein the feature vectors refer to feature vectors (x) 1,b ,x 2,b ,…,x a,b ,…,x n,b ) T A is equal to {1,2, …, n }, b is equal to {1,2, …, m }, and the normalized calculation formula is as follows:
Figure GDA0003752223730000051
wherein x is normalized For normalized eigenvalue, x, in the b-th eigenvector actual Is the eigenvalue, x, in the b-th eigenvector max Is the largest eigenvalue, x, in the b-th eigenvector min B ∈ {1,2, …, m } which is the smallest eigenvalue in the b-th eigenvector;
in the normalized dataset, random selection
Figure GDA0003752223730000052
Taking each sample as a training feature set A', and recording as:
Figure GDA0003752223730000053
wherein,
Figure GDA0003752223730000054
is the b-th feature vector, x 'in the training feature set A' a,b Is the corresponding eigenvalue, n, in the b-th eigenvector in the a-th sample in the training characteristic set A 1 For training sample size, m is 8, a ∈ {1,2, …, n 1 },b∈{1,2,…,m}。
In the method for predicting the wafer CMP material removal rate of the GMDH neural network, in the step (6), the step of training the GMDH neural network model specifically includes:
(61) establishing a 1 st hidden layer:
(611) arbitrarily taking two feature vectors X from 8 feature vectors in the training feature set A i And X j Creating G 1 Second order polynomial equation as the basic neuron, i.e. the total number of the 1 st hidden layer basic neurons
Figure GDA0003752223730000055
Wherein m is 8, and P is the threshold value of the maximum neuron total number in each hidden layer;
(612) respectively calculating and obtaining models corresponding to all basic neurons in the 1 st hidden layer according to Volterra quadratic polynomial regression
Figure GDA0003752223730000061
Figure GDA0003752223730000062
Figure GDA0003752223730000063
Figure GDA0003752223730000064
Figure GDA0003752223730000065
Figure GDA0003752223730000066
Wherein,
Figure GDA0003752223730000067
is the target output vector predictor of the quadratic polynomial equation,
Figure GDA0003752223730000068
refers to a target output predicted value, x 'of the a sample in the r basic neuron model in the 1 st hidden layer' a,j Means that the jth input characteristic value of the ath sample is selected from the input and is used as an element connected to form the 1 st hidden layer and the r th basic neuron model, and a is e {1,2, …, n 1 },b∈{1,2,…,m};
{w 0 ,w 2 ,w 3 ,w 4 ,w 5 Is the coefficient of a quadratic polynomial equation
Figure GDA0003752223730000069
And Y r 1 The minimum difference is used as a target and is obtained by calculation by adopting a least square method; wherein, Y r 1 For training n in the feature set A 1 A vector of actual MRR values for each training sample;
(613) respectively calculating the output root mean square error of each neuron in the 1 st hidden layer
Figure GDA00037522237300000610
A value; namely, it is
Figure GDA00037522237300000611
Figure GDA00037522237300000612
Wherein,
Figure GDA00037522237300000613
outputting a predicted value for a target output of an a sample in an r basic neuron model in a 1 st hidden layer,
Figure GDA00037522237300000614
outputting a real value for a target of an a-th sample in an r-th basic neuron model in a 1 st hidden layer;
(614) sequencing all neurons in a hidden layer 1 from small to large in output root mean square error, and taking P neurons sequenced at the front as effective neurons to form a hidden layer 1;
(615) taking each output of P neurons in the 1 st hidden layer as an input feature vector of the 2 nd hidden layer; the 2 nd hidden layer forms G 2 A basic neuron, and
Figure GDA00037522237300000615
G 2 if the number of the neurons is more than P, repeating the steps (611) to (614) to obtain a hidden layer 2 containing P neurons, wherein the hidden layer 1 is participated in combination and connection to form a hidden layer 2 with U 1 A number of effective neurons;
(616) calculating U participating in combination and connection in the 1 st hidden layer to form the 2 nd hidden layer 1 Output of an effective neuron
Figure GDA00037522237300000616
Mean value of 1 I.e. by
Figure GDA0003752223730000071
(62) Establishing a middle hidden layer: repeating the steps (611) to (616) by taking each output of P neurons in the k-1 hidden layer (equivalent to the 1 st hidden layer) as an input feature vector of the k hidden layer (equivalent to the 2 nd hidden layer) to obtain an intermediate hidden layer;
wherein the total number of basic neurons in the kth hidden layer
Figure GDA0003752223730000072
k≥2;
U participating in combination and connection in the k-1 hidden layer to form the k hidden layer k An effective neuron output
Figure GDA0003752223730000073
Mean value of k I.e. by
Figure GDA0003752223730000074
(63) Establishing an output layer: when E is k-1 -E k Less than or equal to 0.3 (i.e. the k hidden layer E) k Not when there is a significant decrease with increasing number of hidden layers), training stops; and 2 output RMSE small neurons in the k hidden layer are used as new input vectors and target output MRR vectors corresponding to the new input vectors to construct a quadratic polynomial equation, and the output of the equation is used as the final output prediction value of the GMDH neural network model.
The method for predicting the wafer CMP material removal rate of the GMDH neural network as described above, where P is 12.
Advantageous effects
(1) According to the wafer CMP material removal rate virtual prediction method of the GMDH neural network, the advantages of single-cycle prediction capability and short operation time of the GMDH network are utilized, and the optimal feature set is self-organized and selected to establish a wafer MRR prediction model. The combination of comprehensive consideration of physics knowledge and statistics overcomes the defects that the traditional prediction method only considers few variables and neglects the influence of important consumption variables on MRR. The problem that the grinding removal rate in the wafer CMP process cannot be rapidly and accurately obtained is solved.
(2) According to the virtual prediction method for the removal rate of the CMP material of the wafer of the GMDH neural network, error sources such as drift in the polishing process and difference among wafer products in different batches are comprehensively considered, and in order to improve the accuracy of a prediction model, two modes of rough polishing and fine polishing of the wafer are respectively modeled.
(3) According to the virtual prediction method for the removal rate of the CMP material of the wafer of the GMDH neural network, the average value of consumption variables of a polishing pad, a backing film and a flexible plate is selected as a most effective prediction factor from a rough polishing training network structure. The most effective variable widely used in the input layer polynomial equation at this time is the polishing pad consumption average.
(4) According to the virtual prediction method for the removal rate of the wafer CMP material of the GMDH neural network, the mean value, the standard deviation and the skewness of the consumption of the polishing pad, the skewness of the backing film and the skewness of the consumption variable of the partition film are selected as the most effective prediction factors from the structure of fine polishing training, and the most effective variable widely used in the polynomial equation of the input layer is the standard deviation of the consumption of the polishing pad.
(5) According to the virtual prediction method for the removal rate of the CMP material of the wafer of the GMDH neural network, the accurate prediction of the MRR value is rapidly obtained through the network model, and the influence of the abrasion consumption of the polishing pad on the target output MRR is the largest no matter in a fine polishing mode or a rough polishing mode, so that if the predicted value is not in the range of the MRR value, the process parameters are timely adjusted, for example, the new polishing pad, a trimmer and other worn materials are timely replaced, and the polishing pad is particularly used as a key maintenance component of the CMP process of the wafer. According to 2016PHM data and the result of experimental prediction error evaluation index calculation, the MRR prediction method provided by the method is proved to be more excellent than the prediction effect of a physical model and a traditional neural network. The method is suitable for nonlinear complex CMP process modeling.
Drawings
FIGS. 1 and 2 are flow charts of wafer removal rate prediction according to the present invention;
FIG. 3(a) shows the detection results of 4 MRR abnormal values according to the present invention;
FIG. 3(b) is a graph of MRR value distribution for two modes of operation in a CMP process of the present invention;
FIG. 4 is a schematic diagram of a GMDH network structure trained during a wafer rough polishing stage according to the present invention;
FIGS. 5 and 6 are predicted results of the rough polishing stage and the fine polishing stage, respectively, of the present invention;
FIG. 7 is a graph of a statistical analysis of process variables in the rough polishing mode of the present invention.
Detailed Description
The invention will be further illustrated with reference to specific embodiments. It should be understood that these examples are for illustrative purposes only and are not intended to limit the scope of the present invention. Further, it should be understood that various changes or modifications of the present invention may be made by those skilled in the art after reading the teaching of the present invention, and such equivalents may fall within the scope of the present invention as defined in the appended claims.
A method for predicting a wafer CMP material removal rate of a GMDH neural network, the flow diagram of which is shown in FIGS. 1-2, comprises the following steps:
(1) acquiring a polishing sample data set after removing the abnormal value; where the number of samples is n, each sample contains 25 process variables and corresponding MRR values (i.e., material removal rates);
the process variables include chamber pressure, main and external pressure, center pressure, retaining ring pressure, ripple pressure, edge pressure, dresser rotational speed, wafer rotational speed, polishing table rotational speed, type a slurry flow rate, type B slurry flow rate, type C slurry flow rate, consumption of polishing table backing film, consumption of polishing pad, consumption of wafer carrier flex, consumption of zoning film, consumption of dresser table, dressing solution state, chamber for wafer processing, process treatment stage, wafer identifier, time cut, wafer ring position identifier, and polishing machine identifier. The first 18 main process variables were selected for statistical distribution analysis. Other identifier variables have less impact on the target output and therefore are not analyzed as predictors.
The method for removing the abnormal value comprises the following steps: and detecting an abnormal value by using Grubbs to improve the prediction precision, wherein the abnormal value is generated by the measurement failure of a sensor and the occurrence of random errors of process parameters and is a maximum value or a minimum value.
(2) Performing statistical analysis on main process variables generated by polishing a plurality of wafers in the polishing sample data set, and determining b effective process variables with statistical width ranges of 0.12-11 (the width is the difference between the maximum value and the minimum value in the variables); wherein b is less than a;
(3) extracting the mean value, standard deviation, skewness and kurtosis of each effective process variable to obtain 4 × b characteristic vectors;
(4) screening the 4 x b characteristic vectors and corresponding MRR values by adopting a regression correlation analysis method, and determining m characteristic vectors as input characteristic vectors of the GMDH neural network model after setting a correlation coefficient threshold; wherein m is 8, m < (4 × b);
(5) carrying out normalization processing on a data set A formed by m feature vectors to obtain a training feature set A' and a testing feature set, wherein the sample size of the training feature set is n 1 The sample size of the test feature set is n 2 ,n=n 1 +n 2 (ii) a The specific process is as follows:
the data set formed by the m feature vectors is a, as follows:
Figure GDA0003752223730000091
wherein (x) 1,b ,x 2,b ,…,x a,b ,…,x n,b ) T For the b-th feature vector, x, in data set A a,b For the corresponding eigenvalue in the b-th eigenvector in the a-th sample in the data set a, m is 8, a ∈ {1,2, …, n }, b ∈ {1,2, …, m };
the normalization processing means that feature vectors in the data set A are normalized one by one to obtain a normalized data set, wherein the feature vectors refer to feature vectors (x) 1,b ,x 2,b ,…,x a,b ,…,x n,b ) T A is equal to {1,2, …, n }, b is equal to {1,2, …, m }, and the normalized calculation formula is as follows:
Figure GDA0003752223730000092
wherein x is normalized For normalized eigenvalue, x, in the b-th eigenvector actual Is the eigenvalue, x, in the b-th eigenvector max Is the largest eigenvalue, x, in the b-th eigenvector min B is the minimum eigenvalue in the b-th eigenvector, and b is epsilon {1,2, …, m };
in the normalized dataset, random selection
Figure GDA0003752223730000093
Taking each sample as a training feature set A', and recording as:
Figure GDA0003752223730000101
wherein,
Figure GDA0003752223730000102
is the b th feature vector, x 'in the training feature set A' a,b Is the corresponding characteristic value, n, in the b-th characteristic vector of the a-th sample in the training characteristic set A 1 For the training sample size, m is 8, a ∈ {1,2, …, n 1 },b∈{1,2,…,m};
(6) Adopting a binary quadratic Volterra polynomial regression model, and taking the sample size as n 1 The m feature vectors in the training feature set A' are input layers, the corresponding MRR values in the training feature set are output layers, and a GMDH neural network model is trained; namely:
Figure GDA0003752223730000103
wherein,
Figure GDA0003752223730000104
is the b-th feature vector, x 'in the training feature set A' a,b Is the corresponding eigenvalue, y, in the b-th eigenvector in the a-th sample in the training characteristic set A a Is the actual MRR value, n, corresponding to the a-th sample in the training feature set A 1 For training the sample size, a ∈ {1,2, …, n 1 },b∈{1,2,…,m};
The specific steps for training the GMDH neural network model are as follows:
(61) establishing a 1 st hidden layer:
(611) arbitrarily taking two feature vectors X from 8 feature vectors in the training feature set A i And X j Creating G 1 A second order polynomial equation as the number of basic neurons, i.e. the 1 st hidden layer basic neuron
Figure GDA0003752223730000105
Wherein m is 8, and P is the threshold value of the maximum neuron total number in each hidden layer; p ═ 12;
(612) respectively calculating and obtaining models corresponding to all basic neurons in the 1 st hidden layer according to Volterra quadratic polynomial regression
Figure GDA0003752223730000106
Figure GDA0003752223730000111
Figure GDA0003752223730000112
Figure GDA0003752223730000113
Figure GDA0003752223730000114
Figure GDA0003752223730000115
Wherein,
Figure GDA0003752223730000116
is the target output vector predictor of the quadratic polynomial equation,
Figure GDA0003752223730000117
the target output predicted value of the a sample in the r basic neuron model in the 1 st hidden layer is referred to, r is the serial number of the basic neuron in the 1 st hidden layer, x' a,j Means that the jth input characteristic value of the ath sample is selected from the input and is used as an element connected to form the 1 st hidden layer and the r th basic neuron model, and a is e {1,2, …, n 1 },b∈{1,2,…,m};P=12。
{w 0 ,w 2 ,w 3 ,w 4 ,w 5 Is the coefficient of a quadratic polynomial equation
Figure GDA0003752223730000118
And Y r 1 The minimum difference is used as a target and is obtained by calculation by adopting a least square method; wherein, Y r 1 For training n in the feature set A 1 A vector of actual MRR values for each training sample;
(613) respectively calculating the output root mean square error of each neuron in the 1 st hidden layer
Figure GDA0003752223730000119
A value; namely, it is
Figure GDA00037522237300001110
Figure GDA00037522237300001111
Wherein,
Figure GDA00037522237300001112
for the target output predicted value of the a sample in the r basic neuron model in the 1 st hidden layer,
Figure GDA00037522237300001113
outputting a real value for a target of an a-th sample in an r-th basic neuron model in a 1 st hidden layer, wherein r is a serial number of a basic neuron in the 1 st hidden layer;
(614) sequencing all neurons in a hidden layer 1 from small to large in output root mean square error, and taking P neurons sequenced at the front as effective neurons to form a hidden layer 1;
(615) taking the output of each of the P neurons in the hidden layer 1 as an input feature vector of the hidden layer 2, wherein P is 12; the 2 nd hidden layer forms G 2 A basic neuron, and
Figure GDA00037522237300001114
G 2 repeating the steps (611) to (614) to obtain a hidden layer 2 containing P neurons, wherein P is 12, U is contained in the hidden layer 1, which is combined and connected to form the hidden layer 2 1 A number of effective neurons;
(616) calculating U participating in combination and connection in the 1 st hidden layer to form the 2 nd hidden layer 1 Output of an effective neuron
Figure GDA0003752223730000121
Mean value of 1 I.e. by
Figure GDA0003752223730000122
Wherein r is U in the 1 st hidden layer 1 The number of each valid neuron;
(62) establishing a middle hidden layer: outputting each item of P neurons in the (k-1) th hidden layer as an input feature vector of the (k) th hidden layer, and repeating the steps (611) to (616) to obtain a middle hidden layer;
wherein the total number of basic neurons in the kth hidden layer
Figure GDA0003752223730000123
k≥2;
U participating in combination and connection in the k-1 hidden layer to form the k hidden layer k An effective neuron output
Figure GDA0003752223730000124
Mean value of k I.e. by
Figure GDA0003752223730000125
(63) Establishing an output layer: when E is k-1 -E k Less than or equal to 0.3 (i.e. the k hidden layer E) k Not when there is a significant decrease with increasing number of hidden layers), training stops; and 2 output RMSE small neurons in the k-th hidden layer are used as new input vectors and target output MRR vectors corresponding to the new input vectors to construct a quadratic polynomial equation, and the output of the equation is used as a final output prediction value of the GMDH neural network model.
(7) Inputting m characteristic values serving as input in a test characteristic set (a sample set to be tested) into a trained GMDH network model, and outputting a predicted MRR value;
(8) and comparing the predicted MRR value with the MRR values corresponding to the input m characteristic values in the test characteristic set to obtain the accuracy of the model prediction.
Predicting a rough polishing sample data set (the MRR value distribution graph is shown in figure 3 (b)) with the MRR value of 140-170 nm/min acquired by a sensor on CMP equipment by adopting the method for predicting the removal rate of the CMP material of the wafer of the GMDH neural network, wherein the data is from 2016PHM challenge race, Grubbs is adopted to detect abnormal values, 4 MRR values are found to be far greater than the abnormal values of 170nm/min (the abnormal value detection result is shown in figure 3 (a)), and the rough polishing sample data set with the abnormal values removed is acquired; the number of samples of the rough polishing sample data set is n, which is 102, and each sample contains 25 process variables and corresponding MRR values (i.e., material removal rates);
in the rough polishing sample data set after removing the abnormal values, randomly selecting five wafer polishing samples to perform process variable statistical analysis (as shown in fig. 7), and determining 4 effective process variables (variables with data width of 0.12-11, namely backing film consumption, polishing pad consumption, partition film consumption and flexible board consumption); wherein, the distribution range of the consumption of the backing film is 10.83, the distribution range of the consumption of the polishing pad is 9.63, the distribution range of the consumption of the partition film is 3.25, and the distribution range of the consumption of the flexible board is 0.12; other invalid variables are distributed more discretely and basically have a constant value; these variables therefore cannot be used as predictors for the model;
extracting the mean value, standard deviation, skewness and kurtosis of each effective process variable in the rough polishing sample data set to obtain 16 feature vectors;
screening 16 characteristic vectors and corresponding MRR values in the rough polishing sample data set by adopting a regression correlation analysis method, and setting a correlation coefficient threshold (the value is 0.65), so that 8 characteristic vectors with strong correlation can be determined to be used as input characteristic vectors of the GMDH neural network model, see table 1, as follows:
TABLE 1
Serial number Name of characteristic variable
X 1 Consumption average of backing film
X 2 Consumption distortion of backing film
X 3 Average polishing pad consumption
X 4 Standard deviation of polishing pad consumption
X 5 Polishing pad consumption skewness
X 6 Consumption of the partition filmValue of
X 7 Skewness of consumption of partition film
X 8 Average consumption of flexible board
The schematic diagram of the trained GMDH network structure is shown in FIG. 4, and the input vector which shows that the network is most widely used from the input layer is X3 (i.e., the average pad consumption in table 1), is the most effective MRR predictor. In the 1 st hidden layer, the first hidden layer,
Figure GDA0003752223730000131
the selected neurons are eliminated, and the output RMSE of the next layer of new neurons formed by combining the neurons according to the model evaluation criterion is larger, which indicates that the correlation between the neurons and the output MRR is weak and the neurons do not participate in the network connection of the next layer. Then
Figure GDA0003752223730000132
Are the first hidden layer of active neurons. When the 4 th hidden layer is constructed, the mean value of the RMSE output by all the effective neurons does not have obvious descending trend along with the increase of the number of the hidden layers, the training is stopped, and the GMDH neural network topological structure containing 4 hidden layers is obtained, namely the GMDH network model is obtained.
Inputting 8 characteristic values which are taken as input in the corresponding test characteristic set into a trained GMDH network model, and outputting a predicted MRR value;
comparing the predicted MRR value with the MRR values corresponding to the 8 input characteristic values in the test characteristic set to obtain the accuracy of the model prediction, wherein a schematic diagram of the prediction result is shown in FIG. 5, and under a rough polishing working mode, the accuracy of the prediction result is as follows: the mean square error MSE is 3.95 and the mean square error RMSE is 1.99.
By adopting the prediction method for the removal rate of the CMP material of the wafer with the GMDH neural network, a fine polishing sample data set (MRR value distribution diagram is shown in figure 3 (b)) with an MRR value of 50-110 nm/min acquired by a sensor on CMP equipment is predicted, the data is obtained from 2016PHM challenge match data, Grubbs is adopted to detect abnormal values, and the fine polishing sample data set (the abnormal value detection result is shown in figure 3 (a)) with the abnormal values removed is acquired; where the number of samples, n 105, each sample contained 25 process variables and corresponding MRR values (i.e., material removal rates);
analyzing the sample generated by a single wafer in the fine polishing sample data set after the abnormal value is removed, and determining 3 effective process variables (the data width is 0.12-11, namely the consumption of the backing film, the consumption of the polishing pad and the consumption of the partition film);
extracting the mean value, standard deviation, skewness and kurtosis of each effective process variable to obtain 12 feature vectors;
screening 12 feature vectors and corresponding MRR values by adopting a regression correlation analysis method, and setting a correlation coefficient threshold (the value is 0.7), namely determining 8 feature vectors as input feature vectors of the GMDH neural network model, as shown in Table 2, as follows:
TABLE 2
Serial number Name of characteristic variable
X 1 Consumption distortion of backing film
X 2 Average polishing pad consumption
X 3 Polishing ofStandard deviation of pad consumption
X 4 Polishing pad consumption skewness
X 5 Kurtosis of polishing pad consumption
X 6 Mean value of consumption of partitioned film
X 7 Standard deviation of consumption of zoned membrane
X 8 Skewness of consumption of partition film
Inputting 8 feature vector samples which are taken as input in the test feature set into a trained GMDH network model, and outputting a predicted MRR value;
comparing the predicted MRR value with the MRR values corresponding to the 8 input characteristic values in the test characteristic set to obtain the accuracy of the model prediction, wherein a prediction result schematic diagram is shown in FIG. 6, and under a fine polishing working mode, the accuracy of the prediction result is as follows: the mean square error MSE is 9.82 and the mean square error RMSE is 3.13.
Table 1 shows the detailed predicted results of the training samples and the test samples under two different working modes
Figure GDA0003752223730000141
Figure GDA0003752223730000151
The training model obtained from the training samples in table 1 will also be analyzed for errors from the true values, which is called training error.
The prediction result shows that the MRR predicted value obtained by the GMDH network is in good accordance with the real measured value, when a network topological structure is established, a balance point is found between the fitting precision of the training sample and the prediction precision of the test set, so that the real internal relation (the nonlinear relation between each consumption characteristic and the MRR value) of the system can be reflected to the maximum extent by the algorithm even if the network model is small in sample or has high noise, and the optimality and the generalization of the established model are further ensured. The MRR real-time change of the wafer CMP process can be effectively monitored by the model. Mean Square Error (MSE) and Root Mean Square Error (RMSE) are used as model performance evaluation indicators. The smaller the RMSE, the higher the model prediction accuracy.

Claims (4)

1. A wafer CMP material removal rate prediction method of a GMDH neural network is characterized in that: the method comprises the following steps:
(1) acquiring a polishing sample data set after removing the abnormal value; wherein the number of samples is n, each sample contains a process variables and corresponding MRR values;
the polishing sample data set is acquired through a sensor on CMP equipment, and when the MRR value is 140-170 nm/min, the polishing sample data set refers to a rough polishing sample data set; when the MRR value is 50-110 nm/min, the polishing sample data set is a fine polishing sample data set;
(2) performing statistical analysis on main process variables generated by polishing a plurality of wafers in the polishing sample data set to determine b effective process variables; wherein b is less than a;
the effective process variable is a variable with a statistical width range of 0.12-11; when the polishing sample data set refers to a rough polishing sample data set, the effective process variables are as follows: backing film consumption, polishing pad consumption, zoning film consumption and flexible board consumption; or, when the polishing sample data set refers to a fine polishing sample data set, the effective process variables are as follows: backing film consumption, polishing pad consumption, and zoning film consumption;
(3) extracting the mean value, standard deviation, skewness and kurtosis of each effective process variable to obtain 4 × b characteristic vectors;
(4) screening the 4 x b characteristic vectors and corresponding MRR values by adopting a regression correlation analysis method, and determining m characteristic vectors as input characteristic vectors of the GMDH neural network model after setting a correlation coefficient threshold;
when the polishing sample data set is a rough polishing sample data set, m is 8, and the input process variable characteristics corresponding to the input characteristic vectors are respectively as follows: a mean value of backing film consumption, a warp of backing film consumption, a mean value of polishing pad consumption, a standard deviation of polishing pad consumption, a warp of polishing pad consumption, a mean value of divisional film consumption, a warp of divisional film consumption, and a mean value of flexible sheet consumption;
or, when the polishing sample data set is the fine polishing sample data set, where m is 8, and the input process variable features corresponding to the input feature vectors are respectively: a warp of backing film consumption, a mean of polishing pad consumption, a standard deviation of polishing pad consumption, a warp of polishing pad consumption, a kurtosis of polishing pad consumption, a mean of zoning film consumption, a standard deviation of zoning film consumption and a warp of zoning film consumption;
(5) carrying out normalization processing on a data set A formed by m feature vectors to obtain a training feature set A ', wherein the sample size of the training feature set A' is n 1 And n is 1 <n;
(6) Adopting a binary quadratic Volterra polynomial regression model, and taking the sample size as n 1 The m feature vectors in the training feature set A' are input layers, the corresponding MRR values in the training feature set are output layers, and a GMDH neural network model is trained and obtained, namely:
Figure FDA0003752223720000021
wherein,
Figure FDA0003752223720000022
for training featuresThe b-th feature vector, x 'in set A' a,b Is the corresponding eigenvalue, y, in the b-th eigenvector in the a-th sample in the training characteristic set A a Is the actual MRR value, n, corresponding to the a-th sample in the training feature set A 1 For training the sample size, a ∈ {1,2, …, n 1 },b∈{1,2,…,m};
(7) And inputting the m characteristic values serving as input in the sample to be tested into the trained GMDH network model, and outputting the predicted MRR value.
2. The method of claim 1, wherein in step (5),
the data set formed by the m feature vectors is a, as follows:
Figure FDA0003752223720000023
wherein (x) 1,b ,x 2,b ,…,x a,b ,…,x n,b ) T For the b-th feature vector, x, in data set A a,b For the corresponding eigenvalue in the b-th eigenvector in the a-th sample in the dataset a, m ═ 8, a ∈ {1,2, …, n }, and b ∈ {1,2, …, m };
the normalization processing means that feature vectors in the data set A are normalized one by one to obtain a normalized data set, wherein the feature vectors refer to feature vectors (x) 1,b ,x 2,b ,…,x a,b ,…,x n,b ) T A is equal to {1,2, …, n }, b is equal to {1,2, …, m }, and the normalized calculation formula is as follows:
Figure FDA0003752223720000024
wherein x is normalized For normalized eigenvalue, x, in the b-th eigenvector actual Is the eigenvalue, x, in the b-th eigenvector max Is the largest eigenvalue, x, in the b-th eigenvector min B ∈ {1,2, …, m } which is the smallest eigenvalue in the b-th eigenvector;
in the normalized dataset, random selection
Figure FDA0003752223720000031
Taking the samples as a training feature set A', and recording as:
Figure FDA0003752223720000032
wherein,
Figure FDA0003752223720000033
is the b-th feature vector, x 'in the training feature set A' a,b Is the corresponding eigenvalue, n, in the b-th eigenvector in the a-th sample in the training characteristic set A 1 For training sample size, m is 8, a ∈ {1,2, …, n 1 },b∈{1,2,…,m}。
3. The method of claim 2, wherein the step (6) of training the GMDH neural network model comprises the following steps:
(61) establishing a 1 st hidden layer:
(611) arbitrarily taking two feature vectors X from 8 feature vectors in the training feature set A i And X j Creating G 1 Second order polynomial equation as the basic neuron, i.e. the total number of the 1 st hidden layer basic neurons
Figure FDA0003752223720000034
Wherein m is 8, and P is the threshold value of the maximum neuron total number in each hidden layer;
(612) respectively calculating and obtaining models corresponding to all basic neurons in the 1 st hidden layer according to Volterra quadratic polynomial regression
Figure FDA0003752223720000035
Figure FDA0003752223720000036
Figure FDA0003752223720000037
Figure FDA0003752223720000038
Figure FDA0003752223720000039
Figure FDA00037522237200000310
Wherein,
Figure FDA0003752223720000041
is the target output vector predictor of the quadratic polynomial equation,
Figure FDA0003752223720000042
refers to a target output predicted value, x 'of the a sample in the r basic neuron model in the 1 st hidden layer' a,j Means that the jth input characteristic value of the ath sample is selected from the input and is used as an element connected to form the 1 st hidden layer and the r th basic neuron model, and a is e {1,2, …, n 1 },b∈{1,2,…,m};
{w 0 ,w 2 ,w 3 ,w 4 ,w 5 Is the coefficient of a quadratic polynomial equation
Figure FDA0003752223720000043
And
Figure FDA0003752223720000044
the minimum difference is used as a target and is obtained by calculation by adopting a least square method; wherein,
Figure FDA0003752223720000045
for training n in the feature set A 1 A vector of actual MRR values for each training sample;
(613) respectively calculating the output root mean square error of each neuron in the 1 st hidden layer
Figure FDA0003752223720000046
A value; namely, it is
Figure FDA0003752223720000047
Figure FDA0003752223720000048
Wherein,
Figure FDA0003752223720000049
for the target output predicted value of the a sample in the r basic neuron model in the 1 st hidden layer,
Figure FDA00037522237200000410
outputting a real value for a target of an a-th sample in an r-th basic neuron model in a 1 st hidden layer;
(614) sequencing all neurons in a hidden layer 1 from small to large in output root mean square error, and taking P neurons sequenced at the front as effective neurons to form a hidden layer 1;
(615) taking each output of P neurons in the 1 st hidden layer as an input feature vector of the 2 nd hidden layer; the 2 nd hidden layer forms G 2 A basic neuron, and
Figure FDA00037522237200000411
repeating the steps (611) to (614) to obtain a 2 nd hidden layer containing P neurons, wherein the 1 st hidden layer is participated in combination and connection to form the 2 nd hidden layer with U 1 A number of effective neurons;
(616) calculating U participating in combination and connection in the 1 st hidden layer to form the 2 nd hidden layer 1 Output of an effective neuron
Figure FDA00037522237200000412
Mean value of 1 I.e. by
Figure FDA00037522237200000413
(62) Establishing a middle hidden layer: outputting each item of P neurons in the (k-1) th hidden layer as an input feature vector of the (k) th hidden layer, and repeating the steps (611) to (616) to obtain a middle hidden layer;
wherein the total number of primitive neurons in the kth hidden layer
Figure FDA0003752223720000051
U participating in combination and connection in the k-1 hidden layer to form a k hidden layer k Effective neuron output
Figure FDA0003752223720000052
Mean value of k I.e. by
Figure FDA0003752223720000053
(63) Establishing an output layer: when E is k-1 -E k When the training time is less than or equal to 0.3, the training is stopped; and 2 output RMSE small neurons in the k hidden layer are used as new input vectors and target output MRR vectors corresponding to the new input vectors to construct a quadratic polynomial equation, and the output of the equation is used as the final output prediction value of the GMDH neural network model.
4. The method of claim 3, wherein P-12 is used for predicting the removal rate of CMP material from wafer by GMDH neural network.
CN202011094499.9A 2020-10-14 2020-10-14 Prediction method for removal rate of wafer CMP (chemical mechanical polishing) material of GMDH (Gaussian mixture distribution) neural network Active CN112257337B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011094499.9A CN112257337B (en) 2020-10-14 2020-10-14 Prediction method for removal rate of wafer CMP (chemical mechanical polishing) material of GMDH (Gaussian mixture distribution) neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011094499.9A CN112257337B (en) 2020-10-14 2020-10-14 Prediction method for removal rate of wafer CMP (chemical mechanical polishing) material of GMDH (Gaussian mixture distribution) neural network

Publications (2)

Publication Number Publication Date
CN112257337A CN112257337A (en) 2021-01-22
CN112257337B true CN112257337B (en) 2022-09-16

Family

ID=74243417

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011094499.9A Active CN112257337B (en) 2020-10-14 2020-10-14 Prediction method for removal rate of wafer CMP (chemical mechanical polishing) material of GMDH (Gaussian mixture distribution) neural network

Country Status (1)

Country Link
CN (1) CN112257337B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114627985B (en) * 2022-01-26 2024-06-21 苏州中砥半导体材料有限公司 Optimization method, system and medium for polishing process of indium phosphide material
CN114358443B (en) * 2022-03-09 2022-06-24 深圳市信润富联数字科技有限公司 Average material removal rate prediction method, device, electronic device and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102799793A (en) * 2012-07-27 2012-11-28 中国科学院微电子研究所 Method and equipment for calculating chemical mechanical polishing removal rate
CN102945304A (en) * 2012-11-14 2013-02-27 中国科学院微电子研究所 Method for calculating grinding removal rate of wafer surface
TW201539602A (en) * 2014-03-06 2015-10-16 Kla Tencor Corp Statistical overlay error prediction for feed forward and feedback correction of overlay errors, root cause analysis and process control
CN107214610A (en) * 2017-05-05 2017-09-29 天津华海清科机电科技有限公司 The online flatness control system of copper CMP
CN110039440A (en) * 2019-03-27 2019-07-23 中国科学院微电子研究所 A kind of method and device calculating CMP grind clearance
CN110555230A (en) * 2019-07-12 2019-12-10 北京交通大学 rotary machine residual life prediction method based on integrated GMDH framework

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102799793A (en) * 2012-07-27 2012-11-28 中国科学院微电子研究所 Method and equipment for calculating chemical mechanical polishing removal rate
CN102945304A (en) * 2012-11-14 2013-02-27 中国科学院微电子研究所 Method for calculating grinding removal rate of wafer surface
TW201539602A (en) * 2014-03-06 2015-10-16 Kla Tencor Corp Statistical overlay error prediction for feed forward and feedback correction of overlay errors, root cause analysis and process control
CN107214610A (en) * 2017-05-05 2017-09-29 天津华海清科机电科技有限公司 The online flatness control system of copper CMP
CN110039440A (en) * 2019-03-27 2019-07-23 中国科学院微电子研究所 A kind of method and device calculating CMP grind clearance
CN110555230A (en) * 2019-07-12 2019-12-10 北京交通大学 rotary machine residual life prediction method based on integrated GMDH framework

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Adaptive virtual metrology for semiconductor chemical mechanical planarization process using GMDH-type polynomial neural networks;Xiaodong Jia, Yuan Di, Jianshe Feng, Qibo Yang, Honghao Dai, Jay;《Journal of Process Control》;20181231;论文第44-54页 *
基于粒子群的RBF神经网络晶圆清洗机过滤器流量预测;王燕燕;《中国优秀博硕士学位论文全文数据库(硕士) 信息科技辑》;20191215;论文第36-44页 *

Also Published As

Publication number Publication date
CN112257337A (en) 2021-01-22

Similar Documents

Publication Publication Date Title
JP4601492B2 (en) Quality prediction system and method for production process
CN112257337B (en) Prediction method for removal rate of wafer CMP (chemical mechanical polishing) material of GMDH (Gaussian mixture distribution) neural network
CN101976045B (en) Panel quality virtual measurement method and system for TFT-LCD etching process
US6594024B1 (en) Monitor CMP process using scatterometry
US8501501B1 (en) Measurement of a sample using multiple models
JP2016224947A (en) Measurement sample extraction method with sampling rate decision mechanism, and computer program product therefor
CN113094893A (en) Wafer quality virtual measurement method and device, computer equipment and storage medium
CN107234495B (en) Establish the method for average material removal rate prediction model and the method for predicted velocity
TW201415003A (en) Optical metrology for in-situ measurements
KR102248777B1 (en) Quantification and reduction of total measurement uncertainty
Di et al. Enhanced virtual metrology on chemical mechanical planarization process using an integrated model and data-driven approach
TW201205440A (en) Virtual measuring system and method thereof for predicting the quality of thin film transistor liquid crystal display processes
TW200401179A (en) Method of predicting processing device condition or processed result
CN108875169A (en) The degeneration modeling of surface vessel equipment digital multimeter and life-span prediction method
TWI647770B (en) Yield rate determination method for wafer and method for multiple variable detection of wafer acceptance test
TWI427487B (en) Method for sampling workpiece for inspection and computer program product performing the same
TW202339897A (en) Shared data induced quality control for a chemical mechanical planarization process
CN109686412B (en) Data coordination processing method and device for metal balance
Eiki et al. Improving Efficiency and Robustness of Gaussian Process Based Outlier Detection via Ensemble Learning
Wang et al. Predicting the Material Removal Rate in Chemical Mechanical Planarization Based on Improved Neural Network
US6922603B1 (en) System and method for quantifying uniformity patterns for tool development and monitoring
CN115867925B (en) System and method for controlling measurement of sample parameters
CN118092322A (en) Model method for predicting optimal process condition and method for controlling semiconductor manufacturing process
Rymarczyk et al. Analysis of data from measuring sensors for prediction in production process control systems
CN117900927B (en) Efficiency monitoring method and system for full-automatic rubber roll polishing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant