CN114998330B - Unsupervised wafer defect detection method, unsupervised wafer defect detection device, unsupervised wafer defect detection equipment and storage medium - Google Patents

Unsupervised wafer defect detection method, unsupervised wafer defect detection device, unsupervised wafer defect detection equipment and storage medium Download PDF

Info

Publication number
CN114998330B
CN114998330B CN202210894978.1A CN202210894978A CN114998330B CN 114998330 B CN114998330 B CN 114998330B CN 202210894978 A CN202210894978 A CN 202210894978A CN 114998330 B CN114998330 B CN 114998330B
Authority
CN
China
Prior art keywords
wafer
unsupervised
detection data
self
label
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210894978.1A
Other languages
Chinese (zh)
Other versions
CN114998330A (en
Inventor
杨万里
毕海
段江伟
汪伟
柯链宝
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN202210894978.1A priority Critical patent/CN114998330B/en
Publication of CN114998330A publication Critical patent/CN114998330A/en
Application granted granted Critical
Publication of CN114998330B publication Critical patent/CN114998330B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • G06T7/0004Industrial image inspection
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/75Systems in which material is subjected to a chemical reaction, the progress or the result of the reaction being investigated
    • G01N21/76Chemiluminescence; Bioluminescence
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/84Systems specially adapted for particular applications
    • G01N21/88Investigating the presence of flaws or contamination
    • G01N21/8851Scan or image signal processing specially adapted therefor, e.g. for scan signal adjustment, for detecting different kinds of defects, for compensating for structures, markings, edges
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/84Systems specially adapted for particular applications
    • G01N21/88Investigating the presence of flaws or contamination
    • G01N21/95Investigating the presence of flaws or contamination characterised by the material or shape of the object to be examined
    • G01N21/9501Semiconductor wafers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10064Fluorescence image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30108Industrial image inspection
    • G06T2207/30148Semiconductor; IC; Wafer
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Abstract

The invention belongs to the technical field of wafer detection, and discloses an unsupervised wafer defect detection method, device, equipment and storage medium. The method comprises the following steps: acquiring an initial wafer detection data set; utilizing a preset unsupervised self-encoder to make a pseudo label for the label-free wafer detection data in the initial wafer detection data set, and generating a target wafer detection data set; training a preset classifier according to the target wafer detection data set to obtain a trained target classifier; and when the wafer data to be detected is received, detecting the wafer data to be detected by using the target classifier to obtain a wafer defect detection result. By the mode, the pseudo labels are made for the label-free wafer detection data, the label data and the pseudo label data are used for training the target classifier, so that the target classifier learns different defect characteristics, the unknown defect types can be identified, and the detection precision of wafer defects is improved.

Description

Unsupervised wafer defect detection method, unsupervised wafer defect detection device, unsupervised wafer defect detection equipment and storage medium
Technical Field
The invention relates to the technical field of wafer detection, in particular to an unsupervised wafer defect detection method, device, equipment and storage medium.
Background
In the Micro LED display technology, multiple fault types are generated in the wafer (wafer) production process due to multiple influences such as differences in preparation equipment, production process, material design, and the like, and there is no coupling between the fault types. Wafer detection data is large in amount and free of labels, and wafer detection data is large in amount and cannot be obtained easily. In the existing wafer detection method, a relatively stable detection model can be trained only by using a large amount of label data, but the acquired label-free data is difficult to use, and only a few defect types can be learned, so that the detection accuracy is not high in unknown defect types.
The above is only for the purpose of assisting understanding of the technical aspects of the present invention, and does not represent an admission that the above is prior art.
Disclosure of Invention
The invention mainly aims to provide an unsupervised wafer defect detection method, device, equipment and storage medium, and aims to solve the technical problems that a relatively stable detection model can be trained only by using a large amount of label data in the conventional wafer detection method, the acquired label-free data is difficult to use, and only a few defect types can be learned, so that the detection precision is not high in the unknown defect types.
In order to achieve the above object, the present invention provides an unsupervised wafer defect detecting method, which comprises the following steps:
acquiring an initial wafer detection data set;
utilizing a preset unsupervised self-encoder to make a pseudo label for the label-free wafer detection data in the initial wafer detection data set, and generating a target wafer detection data set;
training a preset classifier according to the target wafer detection data set to obtain a trained target classifier;
and when receiving the data of the wafer to be detected, detecting the data of the wafer to be detected by using the target classifier to obtain a wafer defect detection result.
Optionally, the preset unsupervised self-encoder and the preset arbiter form a countermeasure self-encoding network;
the method for generating the target wafer detection data set by using the pseudo label made by the preset unsupervised self-encoder for the label-free wafer detection data in the initial wafer detection data set comprises the following steps:
inputting unlabeled wafer inspection data in the initial wafer inspection data set to the countering self-coding network;
predicting the label of the label-free wafer detection data by using a preset unsupervised self-encoder in the antagonistic self-encoding network, and constraining the prediction result of the preset unsupervised self-encoder by using a preset discriminator in the antagonistic self-encoding network to obtain a pseudo label corresponding to the label-free wafer detection data;
and generating a target wafer detection data set according to the pseudo label and the initial wafer detection data set.
Optionally, before the preset unsupervised auto-encoder is used to make a pseudo tag for the unlabeled wafer inspection data in the initial wafer inspection dataset and generate the target wafer inspection dataset, the method further includes:
and performing preliminary training on an initial discriminator according to the labeled wafer detection data in the initial wafer detection data set to obtain a preset discriminator.
Optionally, the preset unsupervised auto-encoder is formed by a stack of multiple auto-encoders, wherein the multiple auto-encoders are provided with different activation functions.
Optionally, the generating a target wafer inspection dataset by using a preset unsupervised auto-encoder to make a pseudo tag for the unlabeled wafer inspection data in the initial wafer inspection dataset includes:
predicting labels of the label-free wafer detection data in the initial wafer detection data set by using a plurality of self-encoders to obtain a plurality of prediction results;
and determining a pseudo label corresponding to the label-free wafer detection data according to the plurality of prediction results to generate a target wafer detection data set.
Optionally, after the labels of the unlabeled wafer inspection data in the initial wafer inspection dataset are respectively predicted by using a plurality of self-encoders to obtain a plurality of prediction results, the method further includes:
determining prediction accuracies respectively corresponding to the plurality of self-encoders;
deleting the self-encoders with the prediction accuracy lower than a preset threshold value from the self-encoders to obtain a plurality of residual self-encoders;
and determining a pseudo label corresponding to the label-free wafer detection data according to the prediction results corresponding to the residual self-encoders to generate a target wafer detection data set.
Optionally, the determining, according to the prediction results corresponding to the multiple remaining self-encoders, the pseudo label corresponding to the unlabeled wafer inspection data to generate a target wafer inspection data set includes:
determining a weight value corresponding to each residual self-encoder according to the prediction accuracy corresponding to the residual self-encoders;
and determining a pseudo label corresponding to the label-free wafer detection data according to the prediction results corresponding to the residual self-encoders and the weight values, and generating a target wafer detection data set.
In addition, in order to achieve the above object, the present invention further provides an unsupervised wafer defect detecting apparatus, including:
the acquisition module is used for acquiring an initial wafer detection data set;
the label generation module is used for making a pseudo label for the label-free wafer detection data in the initial wafer detection data set by using a preset unsupervised self-encoder to generate a target wafer detection data set;
the training module is used for training a preset classifier according to the target wafer detection data set to obtain a trained target classifier;
and the detection module is used for detecting the wafer data to be detected by using the target classifier when receiving the wafer data to be detected to obtain a wafer defect detection result.
In addition, in order to achieve the above object, the present invention further provides an unsupervised wafer defect detecting apparatus, including: a memory, a processor, and an unsupervised wafer defect detection program stored on the memory and executable on the processor, the unsupervised wafer defect detection program configured to implement the unsupervised wafer defect detection method as described above.
In addition, to achieve the above object, the present invention further provides a storage medium having an unsupervised wafer defect detecting program stored thereon, wherein the unsupervised wafer defect detecting program, when executed by a processor, implements the unsupervised wafer defect detecting method as described above.
The method comprises the steps of obtaining an initial wafer detection data set; utilizing a preset unsupervised self-encoder to make a pseudo label for the label-free wafer detection data in the initial wafer detection data set, and generating a target wafer detection data set; training a preset classifier according to the target wafer detection data set to obtain a trained target classifier; and when the wafer data to be detected is received, detecting the wafer data to be detected by using the target classifier to obtain a wafer defect detection result. By the mode, the pseudo labels are made for the label-free wafer detection data, the label data and the pseudo label data are used for training the target classifier, so that the target classifier learns different defect characteristics, the unknown defect types can be identified, and the detection precision of wafer defects is improved.
Drawings
FIG. 1 is a schematic structural diagram of an unsupervised wafer defect inspection apparatus for a hardware operating environment according to an embodiment of the present invention;
FIG. 2 is a schematic flow chart illustrating a method for unsupervised wafer defect inspection according to a first embodiment of the present invention;
FIG. 3 is a flowchart illustrating a second embodiment of an unsupervised wafer defect inspection method according to the present invention;
FIG. 4 is a schematic diagram of a model layout of an unsupervised wafer defect inspection method according to an embodiment of the present invention;
FIG. 5 is a flowchart illustrating a method for unsupervised wafer defect inspection according to a third embodiment of the present invention;
FIG. 6 is a schematic diagram of an integration strategy of an unsupervised wafer defect detection method according to an embodiment of the present invention;
FIG. 7 is a block diagram of an unsupervised wafer defect inspection apparatus according to a first embodiment of the present invention.
The implementation, functional features and advantages of the present invention will be further described with reference to the accompanying drawings.
Detailed Description
It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
Referring to fig. 1, fig. 1 is a schematic structural diagram of an unsupervised wafer defect detecting apparatus in a hardware operating environment according to an embodiment of the present invention.
As shown in fig. 1, the unsupervised wafer defect inspection apparatus may include: a processor 1001, such as a Central Processing Unit (CPU), a communication bus 1002, a user interface 1003, a network interface 1004, and a memory 1005. Wherein a communication bus 1002 is used to enable connective communication between these components. The user interface 1003 may include a Display screen (Display), an input unit such as a Keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface, a wireless interface. The network interface 1004 may optionally include a standard wired interface, a Wireless interface (e.g., a Wireless-Fidelity (Wi-Fi) interface). The Memory 1005 may be a Random Access Memory (RAM) or a Non-Volatile Memory (NVM), such as a disk Memory. The memory 1005 may alternatively be a storage device separate from the processor 1001.
Those skilled in the art will appreciate that the configuration shown in fig. 1 does not constitute a limitation of an unsupervised wafer defect inspection apparatus and may include more or fewer components than shown, or some components in combination, or a different arrangement of components.
As shown in fig. 1, a memory 1005 as a storage medium may include an operating system, a network communication module, a user interface module, and an unsupervised wafer defect inspection program.
In the unsupervised wafer defect inspection apparatus shown in fig. 1, the network interface 1004 is mainly used for data communication with a network server; the user interface 1003 is mainly used for data interaction with a user; the processor 1001 and the memory 1005 of the unsupervised wafer defect inspection apparatus of the present invention may be disposed in the unsupervised wafer defect inspection apparatus, and the unsupervised wafer defect inspection apparatus calls the unsupervised wafer defect inspection program stored in the memory 1005 through the processor 1001 and executes the unsupervised wafer defect inspection method provided by the embodiment of the present invention.
An embodiment of the present invention provides an unsupervised wafer defect detecting method, and referring to fig. 2, fig. 2 is a schematic flow chart of a first embodiment of the unsupervised wafer defect detecting method according to the present invention.
In this embodiment, the unsupervised wafer defect detection method includes the following steps:
step S10: an initial wafer inspection dataset is obtained.
It should be understood that the main implementation body of this embodiment is unsupervised wafer defect detection equipment, and the unsupervised wafer defect detection equipment may be equipment such as a computer and a server, and may also be other equipment with inference capability, which is not limited in this embodiment.
It should be noted that the initial wafer inspection data set includes a small amount of labeled wafer inspection data and a large amount of unlabeled wafer inspection data, and in a specific implementation, the wafer inspection data are photoluminescence images corresponding to a plurality of defective wafers acquired by using a Photoluminescence (PL) technology, and a detection frame is framed in the corresponding photoluminescence images in advance according to a known defect type and corresponding defect type information is labeled to obtain the labeled wafer inspection data.
Step S20: and utilizing a preset unsupervised self-encoder to make pseudo labels for the unlabeled wafer detection data in the initial wafer detection data set, and generating a target wafer detection data set.
It should be understood that the preset unsupervised self-encoder infers the hidden structural features inside the data by using the unlabeled wafer detection data, and distinguishes the wafer detection data of different defect types, so as to make pseudo labels for a large amount of unlabeled wafer detection data, and generate a target wafer detection data set according to a small amount of labeled wafer detection data and a large amount of pseudo label wafer detection data. In a specific implementation, the predetermined unsupervised self-encoder distinguishes data differences between different defect types according to a spatial scale distance function, such as a mean square error function (MSE) or a root mean square error function (RMSE), where the mean square error function is expressed as:
Figure 596716DEST_PATH_IMAGE001
the root mean square error function is expressed as:
Figure 863749DEST_PATH_IMAGE002
preferably, the preset unsupervised self-encoder and the preset discriminator are arranged to form an anti-self-encoding network, and in the process of training and learning a large amount of unlabeled wafer detection data by the preset unsupervised self-encoder, the preset unsupervised self-encoder and the preset discriminator transmit weights through back propagation and are mutually restricted. Specifically, the preset discriminator is used for discriminating the authenticity of the pseudo label, so that whether the training of the preset unsupervised self-encoder is correct or not is determined, the pseudo label generated by the preset unsupervised self-encoder is as close to the result of the preset discriminator as possible, and the quality of the pseudo label generated by the self-encoder is improved.
Preferably, the preset discriminator is initialized in advance according to a small amount of labeled wafer detection data, so that the preset discriminator can master partial characteristics of real data, rapid fitting during model training is facilitated, and back propagation gradient dissipation is prevented.
On the other hand, in the embodiment, a plurality of depth self-encoders are arranged to be stacked to form a preset unsupervised self-encoder, thereby effectively avoiding the randomness of the network, and each depth self-encoder corresponds to an activation function and provides the nonlinear modeling capability. And aiming at the output results of the multiple depth self-encoders, combining the output results according to a preset integration strategy so as to determine the prediction result of the preset unsupervised self-encoders, and manufacturing pseudo labels for the label-free wafer detection data according to the prediction result.
Step S30: and training a preset classifier according to the target wafer detection data set to obtain the trained target classifier.
It should be understood that the target wafer inspection dataset includes a small amount of labeled wafer inspection data and a large amount of pseudo-labeled wafer inspection data, and the preset classifier is trained through the target wafer inspection dataset, so that the preset classifier learns the features for distinguishing the defect types. The preset classifier of this embodiment may be a convolutional network classifier, the convolutional network classifier is used to detect the wafer image, output a prediction result, determine a corresponding loss function value according to the prediction result and the tag information, perform iterative training until the loss function value is lower than a certain value or the current iteration number is greater than the maximum iteration number, perform defect stratification on the target wafer detection data set through the convolutional network classifier to achieve defect classification, and obtain a trained convolutional network classifier.
Step S40: and when the wafer data to be detected is received, detecting the wafer data to be detected by using the target classifier to obtain a wafer defect detection result.
It should be noted that the wafer data to be detected is a photoluminescence image acquired by a Photoluminescence (PL) technology, and is detected by using a trained target classifier, so as to determine whether the currently acquired data has defects, and determine a corresponding defect type.
In the embodiment, an initial wafer detection data set is obtained; utilizing a preset unsupervised self-encoder to make a pseudo label for the label-free wafer detection data in the initial wafer detection data set, and generating a target wafer detection data set; training a preset classifier according to the target wafer detection data set to obtain a trained target classifier; and when the wafer data to be detected is received, detecting the wafer data to be detected by using the target classifier to obtain a wafer defect detection result. By the method, the pseudo labels are made for the label-free wafer detection data, the target classifier is trained by using the labeled data and the pseudo label data, so that the target classifier learns different defect characteristics, the unknown defect types can be identified, and the detection precision of the wafer defects is improved.
Referring to fig. 3, fig. 3 is a flowchart illustrating an unsupervised wafer defect inspection method according to a second embodiment of the present invention.
Based on the first embodiment, the predetermined unsupervised self-encoder and the predetermined discriminator in the unsupervised wafer defect detecting method of the embodiment form an anti-self-encoding network;
the step S20 includes:
step S201: inputting unlabeled wafer inspection data in the initial wafer inspection dataset into the countering self-coding network.
Step S202: and predicting the label of the label-free wafer detection data by using a preset unsupervised self-encoder in the antagonistic self-encoding network, and constraining the prediction result of the preset unsupervised self-encoder by using a preset discriminator in the antagonistic self-encoding network to obtain a pseudo label corresponding to the label-free wafer detection data.
It should be understood that, in the embodiment, the discriminator is used to form a countervailing self-encoding network in combination with the self-encoder, and the countervailing self-encoding network is used to perform classification prediction on a large amount of unlabeled wafer detection data, so as to distinguish the unlabeled wafer detection data into different defect types, and at the same time, the discriminator is used to improve the quality of generating the pseudo labels from the self-encoder. The weight is transmitted by the aid of back propagation in the training process of the preset unsupervised self-encoder and the preset discriminator, and mutual restriction is achieved. The discriminator determines whether the self-encoder training is correct or not by discriminating the authenticity of the pseudo label, and the pseudo label generated by the self-encoder is as close as possible to the result of the discriminator, so that the corresponding pseudo label is made for each non-label wafer detection data.
Step S203: and generating a target wafer detection data set according to the pseudo label and the initial wafer detection data set.
In a specific implementation, a corresponding pseudo label is marked for the non-label wafer detection data in the initial wafer detection data set, and a target wafer detection data set including label wafer detection data and pseudo label wafer detection data is generated.
Further, before the step S20, the method further includes: and performing preliminary training on an initial discriminator according to the labeled wafer detection data in the initial wafer detection data set to obtain a preset discriminator.
It should be noted that, the discriminator cannot initially discriminate whether the data is true, so in this embodiment, before training the anti-self-coding network, the discriminator is initially learned, and the discriminator is initially trained by using a small amount of labeled wafer detection data, so that the discriminator can grasp partial characteristics of true data, which is beneficial to fast fitting during model training and prevents back propagation gradient dissipation.
Referring to fig. 4, fig. 4 is a schematic model layout diagram of an unsupervised wafer defect detection method according to an embodiment of the present invention, in this embodiment, a small number of labeled wafer data sets are used to initialize an unsupervised hierarchical learner, a pseudo label data set is created for a large number of wafer data sets by stacking an unsupervised self-encoder, a confrontation discriminator is trained to improve the quality of pseudo labels generated by the self-encoder, and finally, a convolutional network classifier is trained by using a large number of pseudo label data and labeled data, so as to improve detection accuracy.
In the embodiment, an initial wafer detection data set is obtained; inputting the unlabeled wafer detection data in the initial wafer detection data set into a counterself-coding network; predicting the label of the label-free wafer detection data by using a preset unsupervised self-encoder in the antagonistic self-encoding network, and constraining the prediction result of the preset unsupervised self-encoder by using a preset discriminator in the antagonistic self-encoding network to obtain a pseudo label corresponding to the label-free wafer detection data; generating a target wafer detection data set according to the pseudo label and the initial wafer detection data set; training a preset classifier according to the target wafer detection data set to obtain a trained target classifier; and when the wafer data to be detected is received, detecting the wafer data to be detected by using the target classifier to obtain a wafer defect detection result. Through the mode, the discriminator is arranged to restrain the prediction result of the unsupervised self-encoder, the generation quality of the pseudo label is improved, the labeled data and the pseudo label data are used for training the target classifier, the target classifier learns different defect characteristics, the unknown defect types can be recognized, and the training precision and the recognition precision of the target classifier are further improved.
Referring to fig. 5, fig. 5 is a flowchart illustrating an unsupervised wafer defect inspection method according to a third embodiment of the present invention.
Based on the first embodiment, the preset unsupervised self-encoder in the unsupervised wafer defect detecting method of the present embodiment is formed by stacking a plurality of self-encoders, wherein the plurality of self-encoders are set with different activation functions.
The step S20 includes:
step S204: and respectively predicting the labels of the label-free wafer detection data in the initial wafer detection data set by using a plurality of self-encoders to obtain a plurality of prediction results.
It can be understood that, in order to improve generalization performance and enable the model to learn deep features of the input data, in the embodiment, multiple depth Automatic Encoders (AEs) are stacked to form a preset unsupervised auto-encoder, which can effectively avoid randomness of the network. The self-encoders are respectively provided with different activation functions, and the activation functions are used for providing nonlinear modeling capability.
In a particular implementation, the activation function is represented as
Figure 531491DEST_PATH_IMAGE003
Training sample
Figure 200370DEST_PATH_IMAGE004
Determining hidden variables by activating functions for input data x
Figure 662575DEST_PATH_IMAGE005
Specifically, it is expressed by the following formula:
Figure 416904DEST_PATH_IMAGE006
Figure 153916DEST_PATH_IMAGE007
wherein the training of the self-encoder aims at optimizing the parameter set
Figure 146143DEST_PATH_IMAGE008
The reconstruction error is minimized. For the loss function of the data reconstruction error, the present embodiment is set as a traditional mean square error loss function (MSE) and a cross entropy function, where the cross entropy cost function exhibits a faster convergence speed and a stronger global optimization capability. In the case of a training sample for unlabeled m-dimensional data, the cross-entropy cost function is defined as:
Figure 575987DEST_PATH_IMAGE009
preferably, considering the advantage of the activation function as a sparse processing mode in learning useful features of the model, in this embodiment, the regularized cross entropy in the automatic encoders of different activation functions rewrites the loss function of an unlabeled m-dimensional training sample into:
Figure 817613DEST_PATH_IMAGE010
it should be noted that, the stacked multiple autoencoders are trained according to the above loss function, so that the multiple autoencoders perform feature layering on a large amount of unlabeled wafer inspection data.
Step S205: and determining a pseudo label corresponding to the label-free wafer detection data according to the plurality of prediction results to generate a target wafer detection data set.
It should be understood that an integration strategy is set to integrate the plurality of prediction results, a pseudo label corresponding to each non-label wafer detection data is determined, and a target wafer detection data set is generated according to the labeled wafer detection data and the pseudo label wafer detection data. Optionally, the integrated voting strategy is a majority voting strategy.
Further, after the step S205, the method further includes: determining prediction accuracies respectively corresponding to the plurality of self-encoders; deleting the self-encoders with the prediction accuracy lower than a preset threshold value from the self-encoders to obtain a plurality of residual self-encoders; and determining a pseudo label corresponding to the label-free wafer detection data according to the prediction results corresponding to the residual self-encoders to generate a target wafer detection data set.
In a specific implementation, since the majority voting policy has a defect that all the individual models have the same weight, for multiple types of defect data, the data is easily interfered by some network layer to be useless, preferably, the present embodiment sets a preset threshold for screening the stacked multiple autoencoders, where the preset threshold may be an accuracy threshold obtained by using the majority voting policy. Referring to fig. 6, fig. 6 is an integrated strategy diagram of an embodiment of the unsupervised wafer defect detection method of the present invention, in this embodiment, the accuracy judgment is performed according to a preset threshold, only a single network with an accuracy exceeding the threshold is considered in the training, other networks are discarded, and a final output result is determined according to a prediction result of an auto-encoder with an accuracy exceeding the preset threshold. Optionally, the prediction results of the remaining self-encoders are averaged to obtain a final prediction result, so as to determine a pseudo label corresponding to each non-label wafer detection data, and generate a target wafer detection data set.
Further, the determining, according to the prediction results corresponding to the multiple residual encoders, the pseudo label corresponding to the unlabeled wafer inspection data to generate a target wafer inspection data set includes: determining a weight value corresponding to each residual self-encoder according to the prediction accuracy corresponding to the residual self-encoders; and determining a pseudo label corresponding to the label-free wafer detection data according to the prediction results corresponding to the residual self-encoders and the weight values, and generating a target wafer detection data set.
It should be noted that, in this embodiment, a corresponding weight value is assigned to a network output meeting a requirement according to a corresponding prediction accuracy, and a comprehensive diagnosis result of each sample is determined according to the weight value. In a particular implementation, multiple self-encoders in a stack are trained multiple times in order to maintain a stable combined diagnostic result.
In the embodiment, an initial wafer detection data set is obtained; respectively predicting the labels of the label-free wafer detection data in the initial wafer detection data set by using a plurality of self-encoders to obtain a plurality of prediction results; determining a pseudo label corresponding to the label-free wafer detection data according to the plurality of prediction results to generate a target wafer detection data set; training a preset classifier according to the target wafer detection data set to obtain a trained target classifier; and when the wafer data to be detected is received, detecting the wafer data to be detected by using the target classifier to obtain a wafer defect detection result. Through the mode, the unsupervised self-encoder formed by stacking the self-encoders is used for generating the pseudo labels, the influence of network randomness on the generation of the pseudo labels is avoided, the generation quality of the pseudo labels is improved, the labeled data and the pseudo label data are used for training the target classifier, the target classifier learns different defect characteristics, unknown defect types can be identified, and the training precision and the identification precision of the target classifier are further improved.
In addition, an embodiment of the present invention further provides a storage medium, where an unsupervised wafer defect inspection program is stored on the storage medium, and when executed by a processor, the unsupervised wafer defect inspection program implements the unsupervised wafer defect inspection method described above.
Since the storage medium adopts all technical solutions of all the above embodiments, at least all the beneficial effects brought by the technical solutions of the above embodiments are achieved, and details are not repeated herein.
Referring to fig. 7, fig. 7 is a block diagram illustrating a structure of an unsupervised wafer defect inspection apparatus according to a first embodiment of the present invention.
As shown in fig. 7, an unsupervised wafer defect detecting apparatus according to an embodiment of the present invention includes:
an obtaining module 10 is configured to obtain an initial wafer inspection dataset.
And a label generating module 20, configured to use a preset unsupervised auto-encoder to generate a pseudo label for the unlabeled wafer inspection data in the initial wafer inspection dataset, so as to generate a target wafer inspection dataset.
And the training module 30 is configured to train a preset classifier according to the target wafer detection data set to obtain a trained target classifier.
And the detection module 40 is configured to detect the wafer data to be detected by using the target classifier when the wafer data to be detected is received, so as to obtain a wafer defect detection result.
It should be understood that the above is only an example, and the technical solution of the present invention is not limited in any way, and in a specific application, a person skilled in the art may set the technical solution as needed, and the present invention is not limited thereto.
In the embodiment, an initial wafer detection data set is obtained; utilizing a preset unsupervised self-encoder to make a pseudo label for the label-free wafer detection data in the initial wafer detection data set, and generating a target wafer detection data set; training a preset classifier according to the target wafer detection data set to obtain a trained target classifier; and when the wafer data to be detected is received, detecting the wafer data to be detected by using the target classifier to obtain a wafer defect detection result. By the mode, the pseudo labels are made for the label-free wafer detection data, the label data and the pseudo label data are used for training the target classifier, so that the target classifier learns different defect characteristics, the unknown defect types can be identified, and the detection precision of wafer defects is improved.
It should be noted that the above-described work flows are only exemplary, and do not limit the scope of the present invention, and in practical applications, a person skilled in the art may select some or all of them to achieve the purpose of the solution of the embodiment according to actual needs, and the present invention is not limited herein.
In addition, the technical details that are not described in detail in this embodiment may refer to the unsupervised wafer defect detection method provided in any embodiment of the present invention, and are not described herein again.
In one embodiment, the predetermined unsupervised self-encoder and the predetermined arbiter form a robust self-encoding network;
the tag generation module 20 is further configured to input the non-tag wafer inspection data in the initial wafer inspection data set to the countermeasure self-encoding network; predicting the label of the label-free wafer detection data by using a preset unsupervised self-encoder in the antagonistic self-encoding network, and constraining the prediction result of the preset unsupervised self-encoder by using a preset discriminator in the antagonistic self-encoding network to obtain a pseudo label corresponding to the label-free wafer detection data; and generating a target wafer detection data set according to the pseudo label and the initial wafer detection data set.
In an embodiment, the training module 30 is further configured to perform a preliminary training on an initial discriminator according to the labeled wafer detection data in the initial wafer detection data set, so as to obtain a preset discriminator.
In an embodiment, the pre-set unsupervised auto-encoder is formed by a stack of multiple auto-encoders, wherein the multiple auto-encoders are provided with different activation functions.
In an embodiment, the label generating module 20 is further configured to predict labels of the unlabeled wafer inspection data in the initial wafer inspection data set by using a plurality of self-encoders, respectively, to obtain a plurality of prediction results; and determining a pseudo label corresponding to the label-free wafer detection data according to the plurality of prediction results to generate a target wafer detection data set.
In an embodiment, the tag generating module 20 is further configured to determine prediction accuracies corresponding to the plurality of self-encoders, respectively; deleting the self-encoders with the prediction accuracy lower than a preset threshold value from the self-encoders to obtain a plurality of residual self-encoders; and determining a pseudo label corresponding to the label-free wafer detection data according to the prediction results corresponding to the residual self-encoders to generate a target wafer detection data set.
In an embodiment, the tag generating module 20 is further configured to determine a weight value corresponding to each remaining self-encoder according to the prediction accuracy corresponding to the plurality of remaining self-encoders; and determining a pseudo label corresponding to the label-free wafer detection data according to the prediction results corresponding to the residual self-encoders and the weight values, and generating a target wafer detection data set.
Furthermore, it should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or system that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or system. Without further limitation, an element defined by the phrase "comprising a … …" does not exclude the presence of another identical element in a process, method, article, or system that comprises the element.
The above-mentioned serial numbers of the embodiments of the present invention are merely for description and do not represent the merits of the embodiments.
Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solution of the present invention or portions thereof that contribute to the prior art may be embodied in the form of a software product, where the computer software product is stored in a storage medium (e.g. Read Only Memory (ROM)/RAM, magnetic disk, optical disk), and includes several instructions for enabling a terminal device (e.g. a mobile phone, a computer, a server, or a network device) to execute the method according to the embodiments of the present invention.
The above description is only a preferred embodiment of the present invention, and not intended to limit the scope of the present invention, and all modifications of equivalent structures and equivalent processes, which are made by using the contents of the present specification and the accompanying drawings, or directly or indirectly applied to other related technical fields, are included in the scope of the present invention.

Claims (9)

1. An unsupervised wafer defect detection method, comprising:
acquiring an initial wafer detection data set;
utilizing a preset unsupervised self-encoder to make a pseudo label for the label-free wafer detection data in the initial wafer detection data set, and generating a target wafer detection data set;
training a preset classifier according to the target wafer detection data set to obtain a trained target classifier;
when wafer data to be detected are received, the target classifier is used for detecting the wafer data to be detected, and a wafer defect detection result is obtained;
the preset unsupervised self-encoder and the preset discriminator form an anti-self-encoding network;
the generating of the target wafer detection data set by using the preset unsupervised auto-encoder to make the pseudo-label for the unlabeled wafer detection data in the initial wafer detection data set comprises:
inputting unlabeled wafer inspection data in the initial wafer inspection data set to the countering self-coding network;
predicting the label of the label-free wafer detection data by using a preset unsupervised self-encoder in the antagonistic self-encoding network, and constraining the prediction result of the preset unsupervised self-encoder by using a preset discriminator in the antagonistic self-encoding network to obtain a pseudo label corresponding to the label-free wafer detection data;
and generating a target wafer detection data set according to the pseudo label and the initial wafer detection data set.
2. The unsupervised wafer defect detection method of claim 1, wherein before generating the target wafer inspection data set, the method further comprises:
and performing preliminary training on an initial discriminator according to the labeled wafer detection data in the initial wafer detection data set to obtain a preset discriminator.
3. The unsupervised wafer defect detection method of claim 1, wherein the preset unsupervised self-encoder is formed by a stack of a plurality of self-encoders, wherein the plurality of self-encoders are provided with different activation functions.
4. The unsupervised wafer defect detection method of claim 3, wherein generating the target wafer inspection dataset by creating pseudo-labels for the unlabeled wafer inspection data in the initial wafer inspection dataset using a predetermined unsupervised self-encoder comprises:
predicting labels of the label-free wafer detection data in the initial wafer detection data set by using a plurality of self-encoders to obtain a plurality of prediction results;
and determining a pseudo label corresponding to the label-free wafer detection data according to the plurality of prediction results to generate a target wafer detection data set.
5. The unsupervised wafer defect inspection method of claim 4, wherein after the labels of the unlabeled wafer inspection data in the initial wafer inspection data set are respectively predicted by a plurality of self-encoders to obtain a plurality of prediction results, the method further comprises:
determining prediction accuracies respectively corresponding to the plurality of self-encoders;
deleting the self-encoders with the prediction accuracy lower than a preset threshold value from the self-encoders to obtain a plurality of residual self-encoders;
and determining a pseudo label corresponding to the label-free wafer detection data according to the prediction results corresponding to the residual self-encoders to generate a target wafer detection data set.
6. The unsupervised wafer defect inspection method of claim 5, wherein said determining the pseudo label corresponding to the unlabeled wafer inspection data according to the prediction results corresponding to the plurality of residual autoencoders to generate a target wafer inspection data set, comprises:
determining a weight value corresponding to each residual self-encoder according to the prediction accuracy corresponding to the residual self-encoders;
and determining a pseudo label corresponding to the label-free wafer detection data according to the prediction results corresponding to the residual self-encoders and the weight values, and generating a target wafer detection data set.
7. An unsupervised wafer defect inspection device, comprising:
the acquisition module is used for acquiring an initial wafer detection data set;
the label generation module is used for making a pseudo label for the label-free wafer detection data in the initial wafer detection data set by using a preset unsupervised self-encoder to generate a target wafer detection data set;
the training module is used for training a preset classifier according to the target wafer detection data set to obtain a trained target classifier;
the detection module is used for detecting the data of the wafer to be detected by using the target classifier when the data of the wafer to be detected is received, so as to obtain a wafer defect detection result;
the preset unsupervised self-encoder and the preset discriminator form an anti-self-encoding network;
the label generation module is further configured to input the label-free wafer detection data in the initial wafer detection data set to the countermeasure self-encoding network; predicting the label of the label-free wafer detection data by using a preset unsupervised self-encoder in the antagonistic self-encoding network, and constraining the prediction result of the preset unsupervised self-encoder by using a preset discriminator in the antagonistic self-encoding network to obtain a pseudo label corresponding to the label-free wafer detection data; and generating a target wafer detection data set according to the pseudo label and the initial wafer detection data set.
8. An unsupervised wafer defect inspection apparatus, the apparatus comprising: a memory, a processor, and an unsupervised wafer defect detection program stored on the memory and executable on the processor, the unsupervised wafer defect detection program configured to implement the unsupervised wafer defect detection method of any of claims 1-6.
9. A storage medium having an unsupervised wafer defect inspection program stored thereon, the unsupervised wafer defect inspection program when executed by a processor implementing the unsupervised wafer defect inspection method of any of claims 1-6.
CN202210894978.1A 2022-07-28 2022-07-28 Unsupervised wafer defect detection method, unsupervised wafer defect detection device, unsupervised wafer defect detection equipment and storage medium Active CN114998330B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210894978.1A CN114998330B (en) 2022-07-28 2022-07-28 Unsupervised wafer defect detection method, unsupervised wafer defect detection device, unsupervised wafer defect detection equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210894978.1A CN114998330B (en) 2022-07-28 2022-07-28 Unsupervised wafer defect detection method, unsupervised wafer defect detection device, unsupervised wafer defect detection equipment and storage medium

Publications (2)

Publication Number Publication Date
CN114998330A CN114998330A (en) 2022-09-02
CN114998330B true CN114998330B (en) 2022-11-01

Family

ID=83020972

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210894978.1A Active CN114998330B (en) 2022-07-28 2022-07-28 Unsupervised wafer defect detection method, unsupervised wafer defect detection device, unsupervised wafer defect detection equipment and storage medium

Country Status (1)

Country Link
CN (1) CN114998330B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115631178B (en) * 2022-11-03 2023-11-10 昆山润石智能科技有限公司 Automatic wafer defect detection method, system, equipment and storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107316049A (en) * 2017-05-05 2017-11-03 华南理工大学 A kind of transfer learning sorting technique based on semi-supervised self-training
CN111832570A (en) * 2020-07-02 2020-10-27 北京工业大学 Image semantic segmentation model training method and system
CN114240882A (en) * 2021-12-16 2022-03-25 深圳市商汤科技有限公司 Defect detection method and device, electronic equipment and storage medium
CN114299034A (en) * 2021-12-30 2022-04-08 杭州海康威视数字技术股份有限公司 Defect detection model training method, defect detection method and device

Also Published As

Publication number Publication date
CN114998330A (en) 2022-09-02

Similar Documents

Publication Publication Date Title
CN111368896B (en) Hyperspectral remote sensing image classification method based on dense residual three-dimensional convolutional neural network
CN108052911B (en) Deep learning-based multi-mode remote sensing image high-level feature fusion classification method
CN106290378B (en) Defect classification method and defect inspecting system
TW201734955A (en) Generating simulated output for a specimen
CN110046706B (en) Model generation method and device and server
CN112016635B (en) Device type identification method and device, computer device and storage medium
CN110728295B (en) Semi-supervised landform classification model training and landform graph construction method
JP6107531B2 (en) Feature extraction program and information processing apparatus
CN115170449B (en) Multi-mode fusion scene graph generation method, system, equipment and medium
CN113868497A (en) Data classification method and device and storage medium
CN112633310A (en) Method and system for classifying sensor data with improved training robustness
Wang et al. Hand-drawn electronic component recognition using deep learning algorithm
CN114998330B (en) Unsupervised wafer defect detection method, unsupervised wafer defect detection device, unsupervised wafer defect detection equipment and storage medium
CN114048468A (en) Intrusion detection method, intrusion detection model training method, device and medium
Pham et al. Unsupervised training of Bayesian networks for data clustering
CN110263808B (en) Image emotion classification method based on LSTM network and attention mechanism
CN115062779A (en) Event prediction method and device based on dynamic knowledge graph
CN114332075A (en) Rapid structural defect identification and classification method based on lightweight deep learning model
CN109101984B (en) Image identification method and device based on convolutional neural network
CN116402777B (en) Power equipment detection method and system based on machine vision
CN112529025A (en) Data processing method and device
CN116108363A (en) Incomplete multi-view multi-label classification method and system based on label guidance
CN115858388A (en) Test case priority ordering method and device based on variation model mapping chart
CN113033716A (en) Image mark estimation method based on confrontation fusion crowdsourcing label
CN114510592A (en) Image classification method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant