CN118378024A

CN118378024A - Abnormality detection method and device based on wafer manufacturing and electronic device

Info

Publication number: CN118378024A
Application number: CN202410805475.1A
Authority: CN
Inventors: 邱海祥; 姜辉; 王啸飞; 沈海超; 樊星
Original assignee: Hangzhou Guangli Microelectronics Co ltd
Current assignee: Hangzhou Guangli Microelectronics Co ltd
Priority date: 2024-06-21
Filing date: 2024-06-21
Publication date: 2024-07-23
Anticipated expiration: 2044-06-21
Also published as: CN118378024B

Abstract

The present application relates to an abnormality detection method, device and electronic device based on wafer manufacturing, and the abnormality detection method includes: preprocessing the training data, and then training the target deep learning model based on the preprocessed training data and bidirectional restricted adversarial learning. Acquire and preprocess the time series data to be detected, and determine the reconstruction error corresponding to the time series data to be detected according to the pre-trained target deep learning model; determine the abnormal sequence according to the reconstruction error, and evaluate the abnormal score in the abnormal sequence according to the sliding window of the preset sequence length, and determine the abnormal data of the target wafer equipment, which is conducive to capturing and identifying various types of abnormal situations, thereby improving the recognition efficiency of abnormal data in the time series data to be detected.

Description

Anomaly detection method, device and electronic device based on wafer manufacturing

技术领域Technical Field

本申请涉及半导体芯片领域，特别是涉及一种基于晶圆制造的异常检测方法、装置和电子装置。The present application relates to the field of semiconductor chips, and in particular to an abnormality detection method, device and electronic device based on wafer manufacturing.

背景技术Background technique

在现有的半导体芯片制造过程中，晶圆制造是一个复杂、昂贵和漫长的过程，涉及数百个工艺步骤，需要同时监控相应的工艺参数变化。随着包括半导体设备精密度和复杂度的提高，对晶圆制造过程中故障的检测和分类质量提出了更高的要求，即需要及时发现和识别潜在的异常和故障。In the existing semiconductor chip manufacturing process, wafer manufacturing is a complex, expensive and lengthy process involving hundreds of process steps, which requires simultaneous monitoring of corresponding process parameter changes. With the increase in the precision and complexity of semiconductor equipment, higher requirements are placed on the quality of fault detection and classification in the wafer manufacturing process, that is, the need to promptly discover and identify potential anomalies and faults.

目前，相关技术对晶圆制造过程中常用的故障检测和分类，是通过创建模型提取传感器原始数据中的特征值信息，并对特征值信息进行统计。由于半导体芯片的复杂性和精密度，传统的基于规则或统计的检测方法往往无法满足精确性和效率的要求。在晶圆制造过程中，不同传感器的原始数据呈现多样化，芯片工程师很难对不同传感器的单个时间点是否异常进行标记与判断，导致在晶圆制造过程中异常检测的精度较低。At present, the relevant technologies for fault detection and classification commonly used in the wafer manufacturing process are to extract the characteristic value information in the raw data of the sensor by creating a model and perform statistics on the characteristic value information. Due to the complexity and precision of semiconductor chips, traditional rule-based or statistical detection methods often cannot meet the requirements of accuracy and efficiency. In the wafer manufacturing process, the raw data of different sensors are diverse, and it is difficult for chip engineers to mark and judge whether a single time point of different sensors is abnormal, resulting in low accuracy of abnormality detection in the wafer manufacturing process.

针对相关技术中存在晶圆制造过程中的异常检测精度有待提升的问题，目前还没有提出有效的解决方案。With regard to the problem in related technologies that the accuracy of abnormality detection in the wafer manufacturing process needs to be improved, no effective solution has been proposed so far.

发明内容Summary of the invention

在本实施例中提供了一种基于晶圆制造的异常检测方法、装置和电子装置，以解决相关技术中在晶圆制造过程中的异常检测精度有待提升的问题。In this embodiment, a method, device and electronic device for abnormality detection based on wafer manufacturing are provided to solve the problem in the related art that the accuracy of abnormality detection in the wafer manufacturing process needs to be improved.

第一个方面，在本实施例中提供了一种基于晶圆制造的异常检测方法，所述方法包括：In a first aspect, a method for detecting anomalies based on wafer manufacturing is provided in this embodiment, and the method includes:

获取待检测时序数据；所述待检测时序数据包括晶圆设备的传感器数据；Acquire time series data to be detected; the time series data to be detected includes sensor data of wafer equipment;

根据预先训练好的目标深度学习模型，确定所述待检测时序数据对应的重构误差；所述目标深度学习模型根据预设的对抗学习方法训练得到；Determine the reconstruction error corresponding to the time series data to be detected according to a pre-trained target deep learning model; the target deep learning model is trained according to a preset adversarial learning method;

根据所述重构误差确定异常序列，并根据预设序列长度的滑动窗口对所述异常序列中的异常分值进行评估，确定目标晶圆设备异常数据。An abnormal sequence is determined according to the reconstruction error, and an abnormal score in the abnormal sequence is evaluated according to a sliding window of a preset sequence length to determine abnormal data of a target wafer device.

在其中的一些实施例中，所述获取待检测时序数据之前，包括：In some of the embodiments, before acquiring the time series data to be detected, the process includes:

获取晶圆制造过程中的多个训练时序数据，对多个所述训练时序数据进行处理，得到多个统一训练时序数据；Acquire multiple training time series data in a wafer manufacturing process, and process the multiple training time series data to obtain multiple unified training time series data;

确定多个所述统一训练时序数据中的密集序列数据；所述密集序列数据包括符合正态分布的序列数据；Determine dense sequence data in the plurality of unified training time series data; the dense sequence data includes sequence data conforming to a normal distribution;

根据所述密集序列数据，以及所述预设的对抗学习方法，对所述预设的深度学习模型进行训练，确定目标深度学习模型；所述预设的深度学习模型基于晶圆传感器的波形构成。According to the dense sequence data and the preset adversarial learning method, the preset deep learning model is trained to determine a target deep learning model; the preset deep learning model is based on the waveform of the wafer sensor.

在其中的一些实施例中，所述根据预先训练好的目标深度学习模型，确定所述待检测时序数据对应的重构误差之前，包括：In some of the embodiments, before determining the reconstruction error corresponding to the time series data to be detected according to the pre-trained target deep learning model, the method includes:

根据预设的采样频率在晶圆的传感器上获取初始时序数据，Acquire initial timing data from the sensor on the wafer according to the preset sampling frequency.

根据所述密集序列数据，对所述初始时序数据进行处理，得到待检测时序数据。The initial time series data is processed according to the dense sequence data to obtain the time series data to be detected.

在其中的一些实施例中，所述根据预先训练好的目标深度学习模型，确定所述待检测时序数据对应的重构误差，包括：In some embodiments, determining the reconstruction error corresponding to the time series data to be detected according to a pre-trained target deep learning model includes:

将所述待检测时序数据映射至预设的潜在空间，得到编码向量；Mapping the time series data to be detected to a preset latent space to obtain a coding vector;

当所述编码向量在预设的阈值范围内时，对所述编码向量进行处理，并生成重构序列；When the encoding vector is within a preset threshold range, processing the encoding vector and generating a reconstructed sequence;

将所述重构序列的数据与所述待检测时序数据之间的差值确定为重构误差。The difference between the data of the reconstructed sequence and the time series data to be detected is determined as a reconstruction error.

在其中的一些实施例中，所述根据所述重构误差确定异常序列，包括：In some embodiments, determining the abnormal sequence according to the reconstruction error includes:

对所述待检测时序数据中多个时间点对应的重构误差进行归一化处理，得到多个时间点的误差分值；Normalizing the reconstruction errors corresponding to multiple time points in the time series data to be detected to obtain error scores for the multiple time points;

根据预设的异常值，确定异常分值数组；所述预设的异常分值根据预设的统计方法对所述误差分值处理得到；Determine an array of abnormal scores according to preset abnormal values; the preset abnormal scores are obtained by processing the error scores according to a preset statistical method;

确定所述异常分值数组对应的序列为异常序列。Determine that the sequence corresponding to the abnormal score array is an abnormal sequence.

在其中的一些实施例中，所述根据预设序列长度的滑动窗口对所述异常序列中的异常分值进行评估，确定目标晶圆设备异常数据，包括：In some embodiments, evaluating the abnormal scores in the abnormal sequence according to the sliding window of the preset sequence length to determine the abnormal data of the target wafer equipment includes:

根据预设序列长度的滑动窗口，对所述异常序列进行划分，得到多个子序列；Dividing the abnormal sequence according to a sliding window of a preset sequence length to obtain multiple subsequences;

确定所述子序列中超过预设的统计值的异常分值为晶圆设备异常点，确定所述子序列中除所述晶圆设备异常点外的异常分值为晶圆设备正常点；Determine the abnormal scores exceeding the preset statistical value in the subsequence as abnormal points of wafer equipment, and determine the abnormal scores other than the abnormal points of wafer equipment in the subsequence as normal points of wafer equipment;

为所述子序列中的所述晶圆设备异常点设置为第一标签，为所述晶圆设备正常点设置第二标签；Setting a first label for the abnormal point of the wafer equipment in the subsequence, and setting a second label for the normal point of the wafer equipment;

确定所述第一标签对应的晶圆设备异常点的数据为晶圆设备异常数据。The data of the abnormal point of the wafer device corresponding to the first tag is determined as the abnormal data of the wafer device.

在其中的一些实施例中，所述根据预设序列长度的滑动窗口对所述异常序列中的异常分值进行评估，确定目标晶圆设备异常数据，还包括：In some embodiments, evaluating the abnormal scores in the abnormal sequence according to the sliding window of the preset sequence length to determine the abnormal data of the target wafer device further includes:

根据所述滑动窗口，对所述异常序列进行划分，得到第一晶圆序列和第二晶圆序列；所述第一晶圆序列和第二晶圆序列分别为相邻的子序列；According to the sliding window, the abnormal sequence is divided to obtain a first wafer sequence and a second wafer sequence; the first wafer sequence and the second wafer sequence are adjacent subsequences respectively;

对所述第一晶圆序列中的第一异常分值和所述第二晶圆序列中的第二异常分值进行差除操作，得到目标序列异常分值；所述第一异常分值和所述第二异常分值包括异常序列中分值排序在前的异常分值；Performing a difference operation on a first abnormal score in the first wafer sequence and a second abnormal score in the second wafer sequence to obtain a target sequence abnormal score; the first abnormal score and the second abnormal score include an abnormal score ranked first in the abnormal sequence;

当所述目标序列异常分值未超过预设的分值阈值时，为所述第一晶圆序列中的异常分值设置第二标签；When the abnormal score of the target sequence does not exceed a preset score threshold, setting a second label for the abnormal score in the first wafer sequence;

确定除设置为第二标签外的异常分值为目标晶圆设备异常数据。Determine the abnormal scores other than those set as the second label as target wafer device abnormal data.

第二个方面，在本实施例中提供了一种基于晶圆制造的异常检测装置，所述装置包括：数据获取模块、误差确定模块、数据评估模块；In a second aspect, in this embodiment, a wafer manufacturing-based abnormality detection device is provided, the device comprising: a data acquisition module, an error determination module, and a data evaluation module;

所述数据获取模块，用于获取待检测时序数据；所述待检测时序数据包括晶圆设备的传感器数据；The data acquisition module is used to acquire the time series data to be detected; the time series data to be detected includes sensor data of the wafer equipment;

所述误差确定模块，用于根据预先训练好的目标深度学习模型，确定所述待检测时序数据对应的重构误差；所述目标深度学习模型根据预设的对抗学习方法训练得到；The error determination module is used to determine the reconstruction error corresponding to the time series data to be detected according to a pre-trained target deep learning model; the target deep learning model is trained according to a preset adversarial learning method;

所述数据评估模块，用于根据所述重构误差确定异常序列，并根据预设序列长度的滑动窗口对所述异常序列中的异常分值进行评估，确定目标晶圆设备异常数据。The data evaluation module is used to determine an abnormal sequence according to the reconstruction error, and evaluate the abnormal score in the abnormal sequence according to a sliding window of a preset sequence length to determine the abnormal data of the target wafer equipment.

第三个方面，在本实施例中提供了一种电子装置，包括存储器、处理器以及存储在所述存储器上并可在所述处理器上运行的计算机程序，所述处理器执行所述计算机程序时实现上述第一个方面所述的基于晶圆制造的异常检测方法。In a third aspect, an electronic device is provided in this embodiment, comprising a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein when the processor executes the computer program, the wafer manufacturing-based anomaly detection method described in the first aspect is implemented.

第四个方面，在本实施例中提供了一种存储介质，其上存储有计算机程序，该程序被处理器执行时实现上述第一个方面所述的基于晶圆制造的异常检测方法。In a fourth aspect, in this embodiment, a storage medium is provided, on which a computer program is stored, and when the program is executed by a processor, the abnormality detection method based on wafer manufacturing described in the first aspect above is implemented.

与相关技术相比，在本实施例中提供的一种基于晶圆制造的异常检测方法、装置和电子装置，通过对训练数据进行预处理，进而根据预处理后的训练数据，并基于双向限制对抗学习，训练得到目标深度学习模型，对待检测时序数据进行处理，有利于捕捉和识别多种类型的异常情况，进而提高对待检测时序数据的异常检测的精确度和效率。其后，得到待检测时序数据对应的重构误差，并通过滑动窗口对待检测时序数据进行实时监控，进而提高了对待检测时序数据中的异常数据的识别效率。Compared with the related art, the wafer manufacturing-based abnormality detection method, device and electronic device provided in this embodiment preprocesses the training data, and then trains the target deep learning model based on the preprocessed training data and bidirectional restricted adversarial learning to process the time series data to be detected, which is conducive to capturing and identifying various types of abnormal situations, thereby improving the accuracy and efficiency of abnormality detection of the time series data to be detected. Afterwards, the reconstruction error corresponding to the time series data to be detected is obtained, and the time series data to be detected is monitored in real time through a sliding window, thereby improving the recognition efficiency of abnormal data in the time series data to be detected.

本申请的一个或多个实施例的细节在以下附图和描述中提出，以使本申请的其他特征、目的和优点更加简明易懂。The details of one or more embodiments of the present application are set forth in the following drawings and description to make other features, objects, and advantages of the present application more readily apparent.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

此处所说明的附图用来提供对本申请的进一步理解，构成本申请的一部分，本申请的示意性实施例及其说明用于解释本申请，并不构成对本申请的不当限定。在附图中：The drawings described herein are used to provide a further understanding of the present application and constitute a part of the present application. The illustrative embodiments of the present application and their descriptions are used to explain the present application and do not constitute an improper limitation on the present application. In the drawings:

图1是本实施例提供的基于晶圆制造的异常检测方法的终端的硬件结构框图；FIG1 is a hardware structure block diagram of a terminal of an abnormality detection method based on wafer manufacturing provided in this embodiment;

图2是本申请实施例提供的基于晶圆制造的异常检测方法的流程图；FIG2 is a flow chart of an abnormality detection method based on wafer manufacturing provided in an embodiment of the present application;

图3是本具体实施例提供的基于晶圆制造的异常检测方法的流程图；FIG3 is a flow chart of an abnormality detection method based on wafer manufacturing provided in this specific embodiment;

图4是本具体实施例提供的深度学习模型的结构图；FIG4 is a structural diagram of a deep learning model provided in this specific embodiment;

图5是本实施例的基于晶圆制造的异常检测装置的结构框图。FIG5 is a structural block diagram of the abnormality detection device based on wafer manufacturing according to this embodiment.

具体实施方式Detailed ways

为更清楚地理解本申请的目的、技术方案和优点，下面结合附图和实施例，对本申请进行了描述和说明。In order to more clearly understand the purpose, technical solutions and advantages of the present application, the present application is described and illustrated below in conjunction with the accompanying drawings and embodiments.

除另作定义外，本申请所涉及的技术术语或者科学术语应具有本申请所属技术领域具备一般技能的人所理解的一般含义。在本申请中的“一”、“一个”、“一种”、“该”、“这些”等类似的词并不表示数量上的限制，它们可以是单数或者复数。在本申请中所涉及的术语“包括”、“包含”、“具有”及其任何变体，其目的是涵盖不排他的包含；例如，包含一系列步骤或模块（单元）的过程、方法和系统、产品或设备并未限定于列出的步骤或模块（单元），而可包括未列出的步骤或模块（单元），或者可包括这些过程、方法、产品或设备固有的其他步骤或模块（单元）。在本申请中所涉及的“连接”、“相连”、“耦接”等类似的词语并不限定于物理的或机械连接，而可以包括电气连接，无论是直接连接还是间接连接。在本申请中所涉及的“多个”是指两个或两个以上。“和/或”描述关联对象的关联关系，表示可以存在三种关系，例如，“A和/或B”可以表示：单独存在A，同时存在A和B，单独存在B这三种情况。通常情况下，字符“/”表示前后关联的对象是一种“或”的关系。在本申请中所涉及的术语“第一”、“第二”、“第三”等，只是对相似对象进行区分，并不代表针对对象的特定排序。Unless otherwise defined, the technical terms or scientific terms involved in this application shall have the general meaning understood by people with general skills in the technical field to which this application belongs. The words "one", "a", "the", "these" and the like in this application do not indicate a quantitative limitation, and they may be singular or plural. The terms "include", "comprise", "have" and any variants thereof involved in this application are intended to cover non-exclusive inclusions; for example, a process, method and system, product or device comprising a series of steps or modules (units) is not limited to the listed steps or modules (units), but may include unlisted steps or modules (units), or may include other steps or modules (units) inherent to these processes, methods, products or devices. The words "connect", "connected", "coupled" and the like involved in this application are not limited to physical or mechanical connections, but may include electrical connections, whether direct or indirect. The "multiple" involved in this application refers to two or more. "And/or" describes the association relationship of associated objects, indicating that there may be three relationships, for example, "A and/or B" may mean: A exists alone, A and B exist at the same time, and B exists alone. Generally, the character "/" indicates that the objects associated with each other are in an "or" relationship. The terms "first", "second", "third", etc. involved in this application are only used to distinguish similar objects and do not represent a specific ordering of the objects.

晶圆制造是半导体制造过程中的重要环节，其主要目的是将硅片加工成可用于集成电路的晶圆，晶圆制造过程涉及多个复杂的工序和技术，包括硅锭生长、切割芯片、热处理以及后续的晶圆处理工序等。在晶圆处理工序中，会在晶圆上制作电路及电子元件，如晶体管、电容、逻辑开关等。这一过程通常包括清洗晶圆、表面氧化、化学气相沉积、涂膜、曝光、显影、蚀刻、离子植入、金属溅镀等步骤，以完成数层电路及元件的加工与制作。Wafer manufacturing is an important part of the semiconductor manufacturing process. Its main purpose is to process silicon wafers into wafers that can be used for integrated circuits. The wafer manufacturing process involves multiple complex processes and technologies, including silicon ingot growth, chip cutting, heat treatment, and subsequent wafer processing. During the wafer processing process, circuits and electronic components such as transistors, capacitors, and logic switches are made on the wafer. This process usually includes steps such as cleaning the wafer, surface oxidation, chemical vapor deposition, coating, exposure, development, etching, ion implantation, and metal sputtering to complete the processing and production of several layers of circuits and components.

晶圆制造的复杂性和动态性，导致现有的特征值信息提取方法存在以下问题：1、有些异常模式在统计上并不显著，对这种模式的故障检测不够准确。2、不同的工艺配方，需要有经验的工程师去创建不同的模型。3、工艺配方的变化需要花费大量的时间去更新模型。4、部分工艺步骤时长过久，特征值提取不够及时。The complexity and dynamics of wafer manufacturing lead to the following problems in existing eigenvalue information extraction methods: 1. Some abnormal patterns are not statistically significant, and fault detection for such patterns is not accurate enough. 2. Different process recipes require experienced engineers to create different models. 3. Changes in process recipes require a lot of time to update the model. 4. Some process steps take too long, and eigenvalue extraction is not timely enough.

由于半导体芯片的复杂性和精密度，传统的基于规则或统计的检测方法往往无法满足精确性和效率的要求。在晶圆制造过程中，不同传感器的原始数据呈现多样化，芯片工程师很难对不同传感器的单个时间点是否异常进行标记与判断。随着机器学习、深度学习等人工智能算法的发展，不需要标签的无监督学习算法成为目前故障检测算法的主流。现有技术中对故障进行检测分类的方法包括：基于均值和标准差的方法、基于概率模型的方法、基于距离的方法以及基于聚类的方法等；其中，基于均值和标准差的方法，简单易实现且计算效率较高；但是对于非高斯分布的数据的处理效果较差，且对于离群点的鲁棒性较差。基于概率模型的方法，例如高斯和模型，这种方法能够建模复杂的数据分布，并且考虑了特征之间的相关性；但是对数据分布的假设要求较高；计算复杂度较高。基于距离的方法，例如k值最近邻算法，不依赖于数据分布的先验知识；可以适应不同类型的数据；但是计算复杂度高；对于高维数据集不适用。基于聚类的方法：能够发现数据集中的密度区域，并将稀疏区域视为异常；但是对于高维数据和复杂数据分布的计算开销较大。Due to the complexity and precision of semiconductor chips, traditional rule-based or statistical detection methods often cannot meet the requirements of accuracy and efficiency. In the wafer manufacturing process, the raw data of different sensors are diverse, and it is difficult for chip engineers to mark and judge whether a single time point of different sensors is abnormal. With the development of artificial intelligence algorithms such as machine learning and deep learning, unsupervised learning algorithms that do not require labels have become the mainstream of fault detection algorithms. The methods for detecting and classifying faults in the prior art include: methods based on mean and standard deviation, methods based on probability models, methods based on distance, and methods based on clustering, etc. Among them, the methods based on mean and standard deviation are simple and easy to implement and have high computational efficiency; however, the processing effect for non-Gaussian distributed data is poor, and the robustness to outliers is poor. Methods based on probability models, such as Gaussian sum models, can model complex data distributions and take into account the correlation between features; however, they have high requirements for assumptions about data distribution; and have high computational complexity. Distance-based methods, such as the k-value nearest neighbor algorithm, do not rely on prior knowledge of data distribution; can adapt to different types of data; but have high computational complexity; and are not applicable to high-dimensional data sets. Clustering-based methods: can find dense areas in the data set and treat sparse areas as anomalies; however, the computational overhead is high for high-dimensional data and complex data distribution.

基于现有技术中存在的问题，本申请对晶圆制造过程中的异常检测采用深度学习算法。其中，深度学习算法具有强大的自动学习和表征能力，可以从大量的数据中学习并识别出复杂的异常模式。一旦构建和训练好的模型得到部署，通过以特定频率自动化采集设备的传感器参数，就可以实时监测晶圆制造过程中的数据流变化，进而识别出与正常工作模式不符的模式，并及时向操作人员或自动化系统发出警报，以便采取纠正措施。这种快速响应和处理异常的能力可以有效地防范故障对生产造成的影响，大大提高芯片生产的良率。Based on the problems existing in the prior art, this application adopts a deep learning algorithm for anomaly detection in the wafer manufacturing process. Among them, the deep learning algorithm has powerful automatic learning and characterization capabilities, and can learn and identify complex abnormal patterns from a large amount of data. Once the constructed and trained model is deployed, by automatically collecting the sensor parameters of the equipment at a specific frequency, the data flow changes in the wafer manufacturing process can be monitored in real time, and then the patterns that do not match the normal working mode can be identified, and the operator or the automation system can be promptly alerted so that corrective measures can be taken. This ability to respond quickly and handle anomalies can effectively prevent the impact of failures on production and greatly improve the yield of chip production.

在本实施例中提供的方法实施例可以在终端、计算机或者类似的运算装置中执行。比如在终端上运行，图1是本实施例提供的基于晶圆制造的异常检测方法的终端的硬件结构框图。如图1所示，终端可以包括一个或多个（图1中仅示出一个）处理器102和用于存储数据的存储器104，其中，处理器102可以包括但不限于微处理器MCU或可编程逻辑器件FPGA等的处理装置。上述终端还可以包括用于通信功能的传输设备106以及输入输出设备108。本领域普通技术人员可以理解，图1所示的结构仅为示意，其并不对上述终端的结构造成限制。例如，终端还可包括比图1中所示更多或者更少的组件，或者具有与图1所示出的不同配置。The method embodiment provided in this embodiment can be executed in a terminal, a computer or a similar computing device. For example, when running on a terminal, FIG1 is a hardware structure block diagram of a terminal of an abnormality detection method based on wafer manufacturing provided in this embodiment. As shown in FIG1 , the terminal may include one or more (only one is shown in FIG1 ) processors 102 and a memory 104 for storing data, wherein the processor 102 may include but is not limited to a processing device such as a microprocessor MCU or a programmable logic device FPGA. The above-mentioned terminal may also include a transmission device 106 and an input and output device 108 for communication functions. It can be understood by those skilled in the art that the structure shown in FIG1 is only for illustration and does not limit the structure of the above-mentioned terminal. For example, the terminal may also include more or fewer components than those shown in FIG1 , or have a different configuration than that shown in FIG1 .

存储器104可用于存储计算机程序，例如，应用软件的软件程序以及模块，如在本实施例中的基于晶圆制造的异常检测方法对应的计算机程序，处理器102通过运行存储在存储器104内的计算机程序，从而执行各种功能应用以及数据处理，即实现上述的方法。存储器104可包括高速随机存储器，还可包括非易失性存储器，如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中，存储器104可进一步包括相对于处理器102远程设置的存储器，这些远程存储器可以通过网络连接至终端。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 104 can be used to store computer programs, for example, software programs and modules of application software, such as the computer program corresponding to the abnormality detection method based on wafer manufacturing in this embodiment. The processor 102 executes various functional applications and data processing by running the computer program stored in the memory 104, that is, to implement the above method. The memory 104 may include a high-speed random access memory, and may also include a non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some examples, the memory 104 may further include a memory remotely arranged relative to the processor 102, and these remote memories may be connected to the terminal via a network. Examples of the above-mentioned network include, but are not limited to, the Internet, an intranet, a local area network, a mobile communication network, and a combination thereof.

传输设备106用于经由一个网络接收或者发送数据。上述的网络包括终端的通信供应商提供的无线网络。在一个实例中，传输设备106包括一个网络适配器（NetworkInterface Controller，简称为NIC），其可通过基站与其他网络设备相连从而可与互联网进行通讯。在一个实例中，传输设备106可以为射频（RadioFrequency，简称为RF）模块，其用于通过无线方式与互联网进行通讯。The transmission device 106 is used to receive or send data via a network. The above network includes a wireless network provided by the communication provider of the terminal. In one example, the transmission device 106 includes a network adapter (Network Interface Controller, referred to as NIC), which can be connected to other network devices through a base station so as to communicate with the Internet. In one example, the transmission device 106 can be a radio frequency (Radio Frequency, referred to as RF) module, which is used to communicate with the Internet wirelessly.

在本实施例中提供了一种基于晶圆制造的异常检测方法，以特定频率自动化采集晶圆制造设备的传感器参数，通过使用本发明提出的深度学习算法实时监测数据的变化，将准确及时的异常检测结果及时报警并反馈给产线，可以提醒芯片工程师或自动化系统对生产过程进行干预，有效地防范故障对生产造成的影响，提高芯片生产的良率。图2是本申请实施例提供的基于晶圆制造的异常检测方法的流程图，如图2所示，该流程包括如下步骤：In this embodiment, a wafer manufacturing-based anomaly detection method is provided, which automatically collects sensor parameters of wafer manufacturing equipment at a specific frequency, monitors data changes in real time by using the deep learning algorithm proposed in the present invention, and promptly alarms and feeds back accurate and timely anomaly detection results to the production line, which can remind chip engineers or automation systems to intervene in the production process, effectively prevent the impact of faults on production, and improve the yield of chip production. Figure 2 is a flow chart of the wafer manufacturing-based anomaly detection method provided in an embodiment of the present application. As shown in Figure 2, the process includes the following steps:

步骤S210，获取待检测时序数据；待检测时序数据包括晶圆设备的传感器数据。Step S210, obtaining timing data to be detected; the timing data to be detected includes sensor data of the wafer equipment.

其中，在晶圆制造的工艺节点和设备上设置传感器，处理器根据预设频率采集传感器数据，即为待检测时序数据。示例性地，采集得到的传感器数据，即待检测时序数据涵盖了温度、压力、流量、浓度、速度等多种参数，反映了制造过程中的实时状态。通过采集晶圆的传感器数据，能够实时监控晶圆制造过程的状态，及时发现潜在问题。具体地，采集待检测时序数据的方法可以包括：有线连接采集、无线连接采集以及边缘计算采集等。Among them, sensors are set on the process nodes and equipment of wafer manufacturing, and the processor collects sensor data according to a preset frequency, which is the time series data to be detected. Exemplarily, the collected sensor data, that is, the time series data to be detected, covers multiple parameters such as temperature, pressure, flow, concentration, speed, etc., reflecting the real-time status of the manufacturing process. By collecting sensor data of the wafer, the status of the wafer manufacturing process can be monitored in real time and potential problems can be discovered in time. Specifically, the method of collecting time series data to be detected may include: wired connection collection, wireless connection collection, and edge computing collection.

步骤S220，根据预先训练好的目标深度学习模型，确定待检测时序数据对应的重构误差；目标深度学习模型根据预设的对抗学习方法训练得到。Step S220, determining the reconstruction error corresponding to the time series data to be detected based on a pre-trained target deep learning model; the target deep learning model is trained according to a preset adversarial learning method.

其中，处理器通过预设的对抗学习方法训练得到目标深度学习模型。具体地，根据预设的对抗学习方法对基于晶圆制造传感器波形的深度学习模型进行训练，得到最优模型参数，即为目标深度学习模型。具体地，获取晶圆制造过程中传感器的训练时序数据，并通过预设的对抗学习方法对训练时序数据进行重构以及异常识别。其中预设的对抗学习方法中包括生成器和判别器，通过生成器和判别器对训练数据进行交替优化，得到目标深度学习模型。将待检测时序数据输入至预先训练完成的目标深度学习模型中，通过目标深度学习模型中基于对抗学习方法的生成模块，对待检测时序数据进行处理，得到重构序列；其后，通过目标深度学习模型中基于对抗学习方法的判别模块，确定重构序列以及输入序列，并计算出在每个时间点，重构序列和输入序列之间的重构误差，进而通过重构误差进行异常识别。示例性地，通过采用L2正则化的方法计算重构序列和输入序列之间的每个点的误差；其中L2正则化的方法包括：在损失函数中增加一个正则化项，即损失函数等于原始损失加上λ乘以模型参数的平方和。其中，λ是正则化参数，用于控制正则化的强度。Among them, the processor obtains the target deep learning model through the preset adversarial learning method training. Specifically, the deep learning model based on the wafer manufacturing sensor waveform is trained according to the preset adversarial learning method to obtain the optimal model parameters, which is the target deep learning model. Specifically, the training time series data of the sensor in the wafer manufacturing process is obtained, and the training time series data is reconstructed and anomalies are identified through the preset adversarial learning method. The preset adversarial learning method includes a generator and a discriminator, and the training data is alternately optimized by the generator and the discriminator to obtain the target deep learning model. The time series data to be detected is input into the pre-trained target deep learning model, and the time series data to be detected is processed by the generation module based on the adversarial learning method in the target deep learning model to obtain a reconstructed sequence; thereafter, the reconstructed sequence and the input sequence are determined by the discriminant module based on the adversarial learning method in the target deep learning model, and the reconstruction error between the reconstructed sequence and the input sequence is calculated at each time point, and then the anomaly is identified through the reconstruction error. Exemplarily, the error of each point between the reconstructed sequence and the input sequence is calculated by adopting the L2 regularization method; wherein the L2 regularization method includes: adding a regularization term to the loss function, that is, the loss function is equal to the original loss plus λ multiplied by the square sum of the model parameters. Wherein λ is a regularization parameter used to control the strength of regularization.

步骤S230，根据重构误差确定异常序列，并根据预设序列长度的滑动窗口对异常序列中的异常分值进行评估，确定目标晶圆设备异常数据。Step S230, determining an abnormal sequence according to the reconstruction error, and evaluating the abnormal scores in the abnormal sequence according to a sliding window of a preset sequence length to determine abnormal data of the target wafer device.

其中，处理器在根据目标深度学习模型获取重构误差之后，根据重构误差确定待检测时序数据中的异常序列。具体地，通过对重构误差进行归一化处理，得到归一化得分，并根据归一化得分确定异常序列。示例性地，每个时间点的归一化得分在0-1之间的序列即为异常序列。在确定异常序列后，引入预设序列长度的滑动窗口将异常序列中异常分值数组划分为多个信号段，得到多条窗口数据，并根据四分位统计方法对异常序列中多条窗口数据的异常分值进行评估，进而得到目标晶圆设备异常数据。Among them, after the processor obtains the reconstruction error according to the target deep learning model, the processor determines the abnormal sequence in the time series data to be detected according to the reconstruction error. Specifically, by normalizing the reconstruction error, a normalized score is obtained, and the abnormal sequence is determined according to the normalized score. Exemplarily, a sequence with a normalized score between 0 and 1 at each time point is an abnormal sequence. After determining the abnormal sequence, a sliding window of a preset sequence length is introduced to divide the abnormal score array in the abnormal sequence into multiple signal segments, and multiple window data are obtained. The abnormal scores of multiple window data in the abnormal sequence are evaluated according to the quartile statistical method, and then the target wafer equipment abnormal data is obtained.

通过上述步骤，通过基于双向限制对抗学习的目标深度学习模型，对待检测时序数据进行处理，有利于捕捉和识别多种类型的异常情况，进而提高对待检测时序数据的异常检测的精确度和效率。其后，得到待检测时序数据对应的重构误差，并通过滑动窗口对待检测时序数据进行实时监控，进而提高了对待检测时序数据中的异常数据的识别效率。Through the above steps, the target deep learning model based on bidirectional restricted adversarial learning is used to process the time series data to be detected, which is conducive to capturing and identifying various types of abnormal situations, thereby improving the accuracy and efficiency of abnormal detection of the time series data to be detected. Afterwards, the reconstruction error corresponding to the time series data to be detected is obtained, and the time series data to be detected is monitored in real time through a sliding window, thereby improving the recognition efficiency of abnormal data in the time series data to be detected.

在其中的一些实施例中，步骤S210获取待检测时序数据之前包括：步骤S201至步骤S203。In some of the embodiments, before step S210 of acquiring the time series data to be detected, the steps include: step S201 to step S203.

步骤S201，获取晶圆制造过程中的多个训练时序数据，对多个训练时序数据进行处理，得到多个统一训练时序数据。Step S201, acquiring a plurality of training time series data in a wafer manufacturing process, and processing the plurality of training time series data to obtain a plurality of unified training time series data.

其中，在训练目标深度学习模型时，处理器获取晶圆制造过程中的多个训练时序数据。示例性地，多个训练时序数据为实时获取最新的时序数据，确保了训练数据的实时性，进一步提高了后续得到目标深度学习模型的准确性。其后，对多个训练时序数据进行处理，具体地，对训练时序数据进行处理的方法包括：检测多个训练时序数据的时间戳以及数据值是否有缺失，若有，则对缺失时间戳和数据值进行补全，进一步地，根据前一训练时序数据对缺失的时间戳和数据值进行补全，或是计算多个训练时序数据中数据值的众数，进而根据众数对缺失的数据值进行补全。处理器对多个训练时序数据进行补全处理后，对补全后的多个训练时序数据按照预设的时间戳进行统一时间长度的截取操作，进而获取多个统一训练时序数据。Among them, when training the target deep learning model, the processor obtains multiple training time series data in the wafer manufacturing process. Exemplarily, multiple training time series data are the latest time series data obtained in real time, which ensures the real-time nature of the training data and further improves the accuracy of the subsequent target deep learning model. Thereafter, multiple training time series data are processed. Specifically, the method for processing the training time series data includes: detecting whether the timestamps and data values of multiple training time series data are missing, and if so, the missing timestamps and data values are supplemented, and further, the missing timestamps and data values are supplemented according to the previous training time series data, or the mode of the data values in the multiple training time series data is calculated, and then the missing data values are supplemented according to the mode. After the processor performs the supplementation processing on the multiple training time series data, the multiple supplemented training time series data are intercepted with a unified time length according to the preset timestamps, thereby obtaining multiple unified training time series data.

步骤S202，确定多个统一训练时序数据中的密集序列数据；密集序列数据包括符合正态分布的序列数据。Step S202, determining dense sequence data in a plurality of unified training time series data; the dense sequence data includes sequence data that conforms to a normal distribution.

其中，处理器在获取多个统一训练时序数据后，对多条时序数据进行归一化处理的操作，获取符合正态分布的序列数据。示例性地，通过3Sigma统计方法，进而有利于将统一训练时序数据的范围限制在正态分布的三倍标准差之内，进而排除异常序列数据，得到密集序列数据，通过密集序列数据对预设的深度学习模型进行训练，进而有利于提高目标深度学习模型的准确性。Among them, after obtaining multiple unified training time series data, the processor performs normalization processing on the multiple time series data to obtain sequence data that conforms to the normal distribution. Exemplarily, through the 3Sigma statistical method, it is beneficial to limit the range of the unified training time series data to within three times the standard deviation of the normal distribution, thereby excluding abnormal sequence data and obtaining dense sequence data. The preset deep learning model is trained through the dense sequence data, which is beneficial to improve the accuracy of the target deep learning model.

步骤S203，根据密集序列数据，以及预设的对抗学习方法，对预设的深度学习模型进行训练，确定目标深度学习模型；预设的深度学习模型基于晶圆传感器的波形构成。Step S203, training a preset deep learning model according to the dense sequence data and a preset adversarial learning method to determine a target deep learning model; the preset deep learning model is based on the waveform of the wafer sensor.

其中，处理器获取多个密集序列数据后，基于预设的对抗学习方法，将密集序列数据输入预设的深度学习模型中进行训练。进一步地，预设的深度学习模型中预设的对抗学习方法包括生成模块和判别模块，每个模块由两到三层的全连接层进行连接得到。输入密集序列数据至生成模块和判别模块，以对预设的深度学习模型进行训练。当模型训练完成后，生成四个模型，即目标深度学习模型包括：编码器、解码器、真假判别器、测量判别器。具体地，编码器和解码器是两个生成器；编码器将时间序列映射到潜在空间，而解码器将潜在空间转换到重建的时间序列；真假判别器用来区分输入的序列是真实的待检测时间序列还是解码器生成的虚假时间序列，测量判别器用来测量编码器将输入序列映射到潜在空间的有效性。进一步地，处理器每隔一段时间采集多个训练时序数据，并输入至深度学习模型进行训练；其后，通过生成模块和判别模块进行迭代优化，进而得到目标深度学习模型。Among them, after the processor obtains multiple dense sequence data, based on the preset adversarial learning method, the dense sequence data is input into the preset deep learning model for training. Further, the preset adversarial learning method in the preset deep learning model includes a generation module and a discrimination module, each module is connected by two to three layers of fully connected layers. Input dense sequence data to the generation module and the discrimination module to train the preset deep learning model. When the model training is completed, four models are generated, that is, the target deep learning model includes: encoder, decoder, true and false discriminator, and measurement discriminator. Specifically, the encoder and the decoder are two generators; the encoder maps the time series to the latent space, and the decoder converts the latent space to the reconstructed time series; the true and false discriminator is used to distinguish whether the input sequence is a real time series to be detected or a false time series generated by the decoder, and the measurement discriminator is used to measure the effectiveness of the encoder mapping the input sequence to the latent space. Further, the processor collects multiple training time series data at intervals and inputs them into the deep learning model for training; thereafter, it is iteratively optimized through the generation module and the discrimination module to obtain the target deep learning model.

通过上述步骤，在训练目标深度学习模型时，采集训练时序数据，并对训练时序数据进行处理，并根据统计方法得到符合正态分布的密集序列数据，进而通过密集序列数据对深度学习模型进行训练，进而得到目标深度学习模型。通过对训练时序数据进行补全、截断以及归一化等操作，得到准确度更高的训练数据，进而有利于提高训练得到的目标深度学习模型的精确性。Through the above steps, when training the target deep learning model, the training time series data is collected and processed, and dense sequence data that conforms to the normal distribution is obtained according to the statistical method, and then the deep learning model is trained by the dense sequence data to obtain the target deep learning model. By completing, truncating and normalizing the training time series data, more accurate training data is obtained, which is conducive to improving the accuracy of the target deep learning model obtained by training.

在其中的一些实施例中，步骤S220中根据预先训练好的目标深度学习模型，确定待检测时序数据对应的重构误差之前，包括：根据预设的采样频率在晶圆的传感器上获取初始时序数据，根据密集序列数据，对初始时序数据进行处理，得到待检测时序数据。In some of the embodiments, before determining the reconstruction error corresponding to the time series data to be detected based on the pre-trained target deep learning model in step S220, it includes: acquiring initial time series data on the sensor of the wafer according to a preset sampling frequency, processing the initial time series data according to the dense sequence data, and obtaining the time series data to be detected.

其中，处理器根据预设的采样频率自动化采集晶圆制造过程中设备上的传感器数据，即为初始时序数据；判断初始时序数据的数据长度是否小于密集序列数据的长度，若是，则根据密集序列数据的长度对初始时序数据的长度进行补齐。其后，对初始时序数据的缺失时间戳和数据值进行检测和补全，并对数据补全后的初始时序数据进行归一化处理，得到待检测时序数据。Among them, the processor automatically collects sensor data on the equipment in the wafer manufacturing process according to the preset sampling frequency, which is the initial time series data; it determines whether the data length of the initial time series data is less than the length of the dense sequence data. If so, the length of the initial time series data is padded according to the length of the dense sequence data. Afterwards, the missing timestamps and data values of the initial time series data are detected and padded, and the initial time series data after data padded is normalized to obtain the time series data to be detected.

通过上述步骤，对待检测的初始时序数据进行数据预处理操作，得到待检测时序数据，进而实现了待检测时序数据与训练时序数据的一致性，有利于提高后续通过目标深度学习模型对待检测时序数据处理的效率和精确度。Through the above steps, data preprocessing operations are performed on the initial time series data to be detected to obtain the time series data to be detected, thereby achieving consistency between the time series data to be detected and the training time series data, which is beneficial to improving the efficiency and accuracy of subsequent processing of the time series data to be detected through the target deep learning model.

在其中的一些实施例中，步骤S220包括：步骤S221至步骤S223。In some embodiments, step S220 includes: step S221 to step S223.

步骤S221，将待检测时序数据映射至预设的潜在空间，得到编码向量。Step S221, mapping the time series data to be detected to a preset latent space to obtain a coding vector.

具体地，处理器将待检测时序数据加载至目标深度学习模型中后，通过目标深度学习模型中的生成模块，具体为生成模块中的编码器，将待检测时序数据映射至潜在空间，得到待检测时序数据对应的编码向量。进一步地，通过目标深度学习模型中的判别模块，具体为判别模块中的测量判别器，测量编码器将待检测时序数据映射至潜在空间的有效性。Specifically, after the processor loads the time series data to be detected into the target deep learning model, the time series data to be detected is mapped to the latent space through the generation module in the target deep learning model, specifically the encoder in the generation module, to obtain the encoding vector corresponding to the time series data to be detected. Further, the effectiveness of the encoder in mapping the time series data to be detected to the latent space is measured through the discriminator module in the target deep learning model, specifically the measurement discriminator in the discriminator module.

步骤S222，当编码向量在预设的阈值范围内时，对编码向量进行处理，并生成重构序列。Step S222: When the coding vector is within a preset threshold range, the coding vector is processed and a reconstructed sequence is generated.

具体地，处理器判断编码向量是否在预设的阈值范围内，若是，则通过目标深度学习模型中的生成模块，具体为生成模块中的解码器，将编码向量转换至重建的时间序列，即生成重构序列。Specifically, the processor determines whether the coding vector is within a preset threshold range. If so, the coding vector is converted into a reconstructed time series through the generation module in the target deep learning model, specifically the decoder in the generation module, that is, a reconstructed sequence is generated.

步骤S223，将重构序列的数据与待检测时序数据之间的差值确定为重构误差。Step S223: determine the difference between the data of the reconstructed sequence and the time series data to be detected as the reconstruction error.

其中，处理器通过目标深度学习模型中的判别模块，具体为判别模块中的真假判别器区分重构序列的数据与待检测时序数据；其后计算在每个时间点重构序列的数据与待检测时序数据之间的差值，即为重构误差。具体地，通过L2正则化的方法计算在每个时间点重构序列的数据与待检测时序数据之间的差值，有助于避免目标深度学习模型对训练数据中的噪声或异常分值过于敏感的问题，从而提高目标深度学习模型的泛化能力。Among them, the processor distinguishes the data of the reconstructed sequence from the time series data to be detected through the discrimination module in the target deep learning model, specifically the true and false discriminator in the discrimination module; then calculates the difference between the data of the reconstructed sequence and the time series data to be detected at each time point, which is the reconstruction error. Specifically, calculating the difference between the data of the reconstructed sequence and the time series data to be detected at each time point by the L2 regularization method helps to avoid the problem that the target deep learning model is too sensitive to noise or abnormal scores in the training data, thereby improving the generalization ability of the target deep learning model.

通过上述步骤，通过目标深度学习模型中的生成模块和判别模块，对待检测时序数据进行映射、重构以及判断，进而提高了确定重构误差的准确性和效率。Through the above steps, the generation module and the discrimination module in the target deep learning model are used to map, reconstruct and judge the time series data to be detected, thereby improving the accuracy and efficiency of determining the reconstruction error.

在其中的一些实施例中，步骤S230中根据重构误差确定异常序列，包括：In some embodiments, determining the abnormal sequence according to the reconstruction error in step S230 includes:

步骤S231，对待检测时序数据中多个时间点对应的重构误差进行归一化处理，得到多个时间点的误差分值。Step S231 , normalizing the reconstruction errors corresponding to multiple time points in the time series data to be detected to obtain error scores for the multiple time points.

其中，当确定多个时间点对应的重构误差后，对多个重构误差进行归一化处理，得到多个重构误差对应的误差分值。After the reconstruction errors corresponding to the multiple time points are determined, the multiple reconstruction errors are normalized to obtain error scores corresponding to the multiple reconstruction errors.

步骤S232，根据预设的异常值，确定异常分值数组；预设的异常值根据预设的统计方法对误差分值处理得到；确定异常分值数组对应的序列为异常序列。Step S232, determining an abnormal score array according to a preset abnormal value; obtaining the preset abnormal value by processing the error score according to a preset statistical method; determining that the sequence corresponding to the abnormal score array is an abnormal sequence.

其中，处理器预设异常值，进而根据异常值确定多个误差分值中的异常分值数据。示例性地，异常分值数组中的异常值在0到1这个范围内。The processor presets an abnormal value, and then determines abnormal score data in a plurality of error scores according to the abnormal value. Exemplarily, the abnormal value in the abnormal score array is in the range of 0 to 1.

通过上述步骤，对重构误差进行归一化处理，确定重构误差对应的误差分值，进而根据误差分值确定异常分值数组，并确定异常分值数组对应的序列为异常序列，进而提高了确定异常序列的准确性。Through the above steps, the reconstruction error is normalized, the error score corresponding to the reconstruction error is determined, and then the abnormal score array is determined according to the error score, and the sequence corresponding to the abnormal score array is determined to be an abnormal sequence, thereby improving the accuracy of determining the abnormal sequence.

在其中的一些实施例中，步骤S230中根据预设序列长度的滑动窗口对异常序列中的异常分值进行评估，确定目标晶圆设备异常数据，包括：In some embodiments, in step S230, the abnormal scores in the abnormal sequence are evaluated according to a sliding window of a preset sequence length to determine the abnormal data of the target wafer device, including:

步骤S233，根据预设序列长度的滑动窗口，对异常序列进行划分，得到多个子序列。Step S233: divide the abnormal sequence according to a sliding window of a preset sequence length to obtain multiple subsequences.

其中，处理器预设序列长度的滑动窗口，并根据滑动窗口对异常序列进行滑动划分，得到多个子序列。进一步地，多个子序列为固定时间的窗口数据。The processor presets a sliding window of sequence length, and performs sliding division on the abnormal sequence according to the sliding window to obtain multiple subsequences. Further, the multiple subsequences are window data of fixed time.

步骤S234，确定子序列中超过预设的统计值的异常分值为晶圆设备异常点，确定子序列中除晶圆设备异常点外的异常分值为晶圆正常点；为子序列中的晶圆设备异常点设置为第一标签，为晶圆正常点设置第二标签。Step S234, determining the abnormal scores exceeding the preset statistical value in the subsequence as wafer equipment abnormal points, determining the abnormal scores other than the wafer equipment abnormal points in the subsequence as wafer normal points; setting the first label for the wafer equipment abnormal points in the subsequence, and setting the second label for the wafer normal points.

其中，处理器将子序列中超过预设的统计点的异常分值点为晶圆设备异常点。进一步地，预设的统计点根据四分位统计方法确定，在子序列中计算四分位统计值，四分位统计值即为预设的统计点。判断子序列中超过四分位统计值的异常分值，该异常分值对应晶圆设备异常点。当确定晶圆设备异常点后，为晶圆设备异常点设置第一标签，即异常标签；同时，将子序列中除晶圆设备异常点的晶圆正常点设置第二标签，即正常标签。示例性地，晶圆设备异常点的第一标签为1，晶圆正常点的第二标签为0。Among them, the processor regards the abnormal score points in the subsequence that exceed the preset statistical points as wafer equipment abnormal points. Furthermore, the preset statistical points are determined according to the quartile statistical method, and the quartile statistical values are calculated in the subsequence, and the quartile statistical values are the preset statistical points. The abnormal score that exceeds the quartile statistical value in the subsequence is judged, and the abnormal score corresponds to the wafer equipment abnormal point. After the wafer equipment abnormal point is determined, a first label, that is, an abnormal label, is set for the wafer equipment abnormal point; at the same time, a second label, that is, a normal label, is set for the normal wafer points in the subsequence except the wafer equipment abnormal point. Exemplarily, the first label of the wafer equipment abnormal point is 1, and the second label of the wafer normal point is 0.

步骤S235，确定第一标签对应的晶圆设备异常点的数据为目标晶圆设备异常数据。Step S235 , determining that the data of the abnormal point of the wafer device corresponding to the first tag is the abnormal data of the target wafer device.

通过上述步骤，通过滑动窗口将异常序列进行滑动划分得到多个子序列，并通过四分位统计方法确定多个子序列中的目标晶圆设备异常数据。通过滑动窗口的方式来评估目标晶圆设备异常数据，有利于增加异常数据的召回率，进一步有利于提高异常检测的精确性。Through the above steps, the abnormal sequence is divided into multiple subsequences by sliding the sliding window, and the target wafer device abnormal data in the multiple subsequences is determined by the quartile statistical method. Evaluating the target wafer device abnormal data by sliding the window is conducive to increasing the recall rate of abnormal data, and further conducive to improving the accuracy of abnormality detection.

在其中的一些实施例中，步骤S230中根据预设序列长度的滑动窗口对异常序列中的异常分值进行评估，确定目标晶圆设备异常数据的方法，还包括：In some embodiments, the method of evaluating the abnormal scores in the abnormal sequence according to the sliding window of the preset sequence length in step S230 to determine the abnormal data of the target wafer device further includes:

步骤S236，根据滑动窗口，对异常序列进行划分，得到第一晶圆序列和第二晶圆序列；第一晶圆序列和第二晶圆序列分别为相邻的子序列。Step S236: divide the abnormal sequence according to the sliding window to obtain a first wafer sequence and a second wafer sequence; the first wafer sequence and the second wafer sequence are adjacent subsequences respectively.

其中，处理器根据滑动窗口，将异常序列划分为多个相邻的第一晶圆序列和第二晶圆序列。The processor divides the abnormal sequence into a plurality of adjacent first wafer sequences and second wafer sequences according to the sliding window.

步骤S237，对第一晶圆序列中的第一异常分值和第二晶圆序列中的第二异常分值进行差除操作，得到目标序列异常分值；第一异常分值和第二异常分值包括异常序列中分值排序在前的异常分值。Step S237, performing a difference operation on the first abnormal score in the first wafer sequence and the second abnormal score in the second wafer sequence to obtain a target sequence abnormal score; the first abnormal score and the second abnormal score include the abnormal score ranked first in the abnormal sequence.

其中，处理器分别确定第一晶圆序列和第二晶圆序列中的最大异常分值，分别为第一异常分值和第二异常分值。对第一异常分值和第二异常分值进行差除操作，即得到第一异常分值与第二异常分值的差值，除以第一异常分值的值，即为目标序列异常分值。The processor determines the maximum abnormal score in the first wafer sequence and the second wafer sequence, which are the first abnormal score and the second abnormal score, respectively. The first abnormal score and the second abnormal score are subjected to a difference operation, that is, the difference between the first abnormal score and the second abnormal score is obtained, and the difference is divided by the value of the first abnormal score, which is the target sequence abnormal score.

步骤S238，当目标序列异常分值未超过预设的分值阈值时，为第一晶圆序列中的异常分值设置第二标签。确定除设置为第二标签外的异常分值为目标晶圆设备异常数据。Step S238: When the target sequence abnormality score does not exceed the preset score threshold, a second label is set for the abnormality score in the first wafer sequence, and the abnormality scores other than those set as the second label are determined as target wafer device abnormality data.

其中，处理器判断目标序列异常分值是否超过预设的分值阈值，当目标序列异常分值未超过预设的分值阈值时，可以理解为第一晶圆序列中的异常分值为正常分值，即为第一晶圆序列中的异常分值设置第二标签。处理器重新对异常序列中的异常分值进行更新处理，将除了设置为第二标签外的异常分值确定为目标晶圆设备异常数据。The processor determines whether the target sequence abnormal score exceeds a preset score threshold. When the target sequence abnormal score does not exceed the preset score threshold, it can be understood that the abnormal score in the first wafer sequence is a normal score, that is, a second label is set for the abnormal score in the first wafer sequence. The processor re-updates the abnormal score in the abnormal sequence and determines the abnormal score other than the one set as the second label as abnormal data of the target wafer device.

通过上述步骤，滑动窗口会增加异常数据的召回率，同样也会增加异常数据的误检率。因此，采用上述批判召回的方法，即通过差除操作对多个子序列中的异常分值的标签进行更新，有利于降低异常检测误检率，进而有利于提高异常检测的精确度。Through the above steps, the sliding window will increase the recall rate of abnormal data, but also increase the false detection rate of abnormal data. Therefore, the above critical recall method, that is, updating the labels of abnormal scores in multiple subsequences through difference operations, is conducive to reducing the false detection rate of abnormal detection, and thus helps to improve the accuracy of abnormal detection.

下面通过具体实施例对本实施例进行描述和说明。The present embodiment is described and illustrated by means of specific examples below.

图3是本具体实施例提供的基于晶圆制造的异常检测方法的流程图，如图3所示，该基于晶圆制造的异常检测方法包括如下步骤：FIG3 is a flow chart of an abnormality detection method based on wafer manufacturing provided in this specific embodiment. As shown in FIG3 , the abnormality detection method based on wafer manufacturing includes the following steps:

步骤S310，预处理训练数据并训练深度学习模型。Step S310, preprocessing training data and training the deep learning model.

具体地，处理器采集并加载晶圆制造过程中传感器A最新的多条时序数据，此处的多条时序数据即为前述实施例中的多个训练时序数据。其后，对多条时序数据的缺失时间戳和数据值进行检测与补全操作，并对多条时序数据做统一长度截断操作，并保存历史数据序列长度，此处的历史数据序列长度即为前述实施例中的密集序列数据。将这多条时序数据归一化后计算并保存历史密集数据均值序列。将上述加载针对半导体领域传感器波形精心构造的深度学习模型，包括两个生成器和两个判别器。两个生成器分别可以看作编码器和解码器。将收集完的密集序列批量输入模型，使用生成器和判别器交替优化，并保存训练好的最优模型参数。Specifically, the processor collects and loads the latest multiple time series data of sensor A during the wafer manufacturing process. The multiple time series data here are the multiple training time series data in the aforementioned embodiment. Thereafter, the missing timestamps and data values of the multiple time series data are detected and completed, and the multiple time series data are truncated to a unified length, and the length of the historical data sequence is saved. The length of the historical data sequence here is the dense sequence data in the aforementioned embodiment. After normalizing these multiple time series data, the mean sequence of historical dense data is calculated and saved. The above-mentioned deep learning model carefully constructed for the sensor waveform in the semiconductor field is loaded, including two generators and two discriminators. The two generators can be regarded as encoders and decoders, respectively. The collected dense sequences are batch-input into the model, and the generator and discriminator are used to alternately optimize, and the trained optimal model parameters are saved.

优选地，对多条时序数据的缺失时间和数据进行检测与补全的操作可以包括：对一条时序数据从前往后计算时间戳的间隔，获取时间戳间隔的众数，将该众数当作数据采集周期。通过时序数据中前一个和后一个时间戳的间隔整除数据采集周期，如果得到的整除结果等于n，则在前一个时间戳和后一个时间戳对应的时序数据中间补全n-1个前向值，n为大于等于1的正整数。Preferably, the operation of detecting and completing missing time and data of multiple time series data may include: calculating the interval of timestamps from the front to the back of a time series data, obtaining the mode of the timestamp interval, and using the mode as the data collection period. Dividing the data collection period by the interval of the previous and next timestamps in the time series data, if the divisible result is equal to n, then completing n-1 forward values between the time series data corresponding to the previous timestamp and the next timestamp, where n is a positive integer greater than or equal to 1.

优选地，为了保持每条时序数据的长度一致，因此，需要获取多条时序数据中数据长度最短的一条时序数据；并根据最短的数据长度，对这多条时序数据进行截断，得到统一长度的时序数据，并保存为历史数据序列长度。Preferably, in order to keep the length of each time series data consistent, it is necessary to obtain the time series data with the shortest data length among multiple time series data; and according to the shortest data length, truncate these multiple time series data to obtain time series data of uniform length, and save it as the historical data sequence length.

优选地，使用3sigma统计方法对进行补全以及截断等操作之后的多条时序数据进行处理，并将处于3sigma之外的时序数据对应的时间点判断为离群点。将多条时序数据的同一时间点输入3sigma，只要有一个时间点处于3sigma之外，确定该时间点为离群点，将包括离群点的整条时间序列确定为离群序列。其后，对多条时序数据筛选掉离群序列，收集得到密集序列。对这多条密集序列进行横向归一化并计算每个时间戳的均值，得到历史密集数据均值序列。Preferably, the 3sigma statistical method is used to process multiple time series data after completion and truncation, and the time points corresponding to the time series data outside 3sigma are judged as outliers. The same time point of multiple time series data is input into 3sigma. As long as there is a time point outside 3sigma, the time point is determined to be an outlier, and the entire time series including the outlier is determined as an outlier sequence. Thereafter, the outlier sequences are filtered out of the multiple time series data, and dense sequences are collected. These multiple dense sequences are horizontally normalized and the mean of each timestamp is calculated to obtain the mean sequence of historical dense data.

步骤S320，预处理待检测时序数据。Step S320: pre-processing the time series data to be detected.

具体地，处理器以预设的特定频率自动化采集设备上传感器A的传感器参数，得到线上实时窗口数据，此处的窗口数据即为前述实施例中的待检测时序数据。当窗口数据长度不足历史数据序列长度时，则根据历史密集数据均值序列进行补齐，然后经过如步骤S320中对待训练的时序数据进行的截断等操作，对窗口数据进行预处理，得到待检测时序数据。Specifically, the processor automatically collects sensor parameters of sensor A on the device at a preset specific frequency to obtain online real-time window data, where the window data is the time series data to be detected in the aforementioned embodiment. When the length of the window data is less than the length of the historical data sequence, it is padded according to the mean sequence of historical dense data, and then the window data is preprocessed through operations such as truncation of the time series data to be trained in step S320 to obtain the time series data to be detected.

步骤S330，根据训练好的深度学习模型，对待检测时序数据进行异常检测和识别。Step S330: Perform anomaly detection and identification on the time series data to be detected based on the trained deep learning model.

具体地，当对线上实时窗口数据进行预处理，得到待检测时序数据后，将待检测时序数据加载入训练好的深度学习模型，即前述实施例中的目标深度学习模型。通过目标深度学习模型评估待检测时序数据中的异常时，首先通过深度学习模型中的两个生成器，即编码器和解码器得到待检测时序数据的重构序列，然后通过计算重构序列和输入序列之间的重构误差来进行异常识别。此处的生成器即为前述实施例中的生成模块。示例性地，通过L2正则化的方法计算重构序列和输入序列之间的每个时间点的误差。其后计算重构误差的归一化得分，将处于0到1之间的归一化得分对应的时间点的异常分值作为最终的异常分值数组。Specifically, after the online real-time window data is preprocessed to obtain the time series data to be detected, the time series data to be detected is loaded into the trained deep learning model, that is, the target deep learning model in the aforementioned embodiment. When evaluating the anomalies in the time series data to be detected by the target deep learning model, the reconstructed sequence of the time series data to be detected is first obtained by the two generators in the deep learning model, namely the encoder and the decoder, and then the anomaly is identified by calculating the reconstruction error between the reconstructed sequence and the input sequence. The generator here is the generation module in the aforementioned embodiment. Exemplarily, the error at each time point between the reconstructed sequence and the input sequence is calculated by the L2 regularization method. Thereafter, the normalized score of the reconstruction error is calculated, and the anomaly score at the time point corresponding to the normalized score between 0 and 1 is used as the final anomaly score array.

其后，引入滑动窗口对线上数据的异常分值数组进行数据切分操作，将数组划分为多个信号段，从而获得了固定秒数的多条窗口数据，多条窗口数据即为前述实施例中的子序列。在多条窗口数据内计算四分位统计值，并将超过四分位统计值的点设置为异常点。同时，将输出的异常分值数组转换为异常标签数组。其中异常点的标签是1，即为前述实施例中的第一标签；正常点的标签是0，即为前述实施例中的第二标签。采用滑动窗口的方式来评估异常分值，增加了对于异常的召回率。Afterwards, a sliding window is introduced to perform data segmentation operations on the anomaly score array of online data, and the array is divided into multiple signal segments, thereby obtaining multiple window data of fixed seconds, and the multiple window data are the subsequences in the aforementioned embodiment. The quartile statistics are calculated in the multiple window data, and the points exceeding the quartile statistics are set as anomalies. At the same time, the output anomaly score array is converted into an anomaly label array. The label of the anomaly point is 1, which is the first label in the aforementioned embodiment; the label of the normal point is 0, which is the second label in the aforementioned embodiment. The sliding window method is used to evaluate the anomaly score, which increases the recall rate of the anomaly.

进一步地，对于异常分值到异常标签的转换，利用滑动窗口将完整时间序列划分成多个等长的子序列，每一个子序列作为一个样本，检测每一个子序列是否含有异常点，以增加对于异常的召回率。具体地，把子序列内异常分值超过四分位的点设为异常。示例性地，将窗口序列长度设置为完整序列的三分之一，并将滑动步长设置为完整序列的三十分之一。先计算输入序列每个滑动窗口内的下四分位值Q1和上四分位值Q3，将窗口数据中大于上四分位加上倍(Q3-Q1)的时间点全部判为异常，生成异常数组1，如下所示：Furthermore, for the conversion of anomaly scores to anomaly labels, a sliding window is used to divide the complete time series into multiple subsequences of equal length. Each subsequence is used as a sample to detect whether each subsequence contains anomalies, so as to increase the recall rate of anomalies. Specifically, points in the subsequence whose anomaly scores exceed the quartile are set as anomalies. Exemplarily, the window sequence length is set to one-third of the complete sequence, and the sliding step size is set to one-thirtieth of the complete sequence. First, the lower quartile value Q1 and the upper quartile value Q3 in each sliding window of the input sequence are calculated, and the points in the window data that are greater than the upper quartile plus the lower quartile value Q1 and the upper quartile value Q3 are added. All time points (Q3-Q1) are judged as abnormal, and abnormal array 1 is generated, as shown below:

其中，表示异常数组1，Q1表示下四分位值，Q3表示上四分位值；t表示时间戳；根据时序数据的良率确定。in, represents anomaly array 1, Q1 represents the lower quartile value, Q3 represents the upper quartile value; t represents the timestamp; Determined based on the yield of timing data.

将线上窗口数据中小于下四分位减去倍(Q3-Q1)的时间点全部判为异常，生成异常数组2，如下所示：Subtract the online window data that is less than the lower quartile All time points (Q3-Q1) are judged as abnormal, and abnormal array 2 is generated, as shown below:

其中，表示异常数组2，Q1表示下四分位值，Q3表示上四分位值；t表示时间戳；根据时序数据的良率确定。将异常数组中的异常点和非异常点标出，其中异常点标记为1，非异常点标记为0。进一步地，两个异常数组有一个为异常的点就算是异常点。in, represents anomaly array 2, Q1 represents the lower quartile value, Q3 represents the upper quartile value; t represents the timestamp; Determined based on the yield of the timing data. Mark the abnormal points and non-abnormal points in the abnormal array, where the abnormal points are marked as 1 and the non-abnormal points are marked as 0. Furthermore, if there is an abnormal point in the two abnormal arrays, it is considered an abnormal point.

同时，滑动窗口会增加对于异常的召回率，同样也会增加误检率。因此，进一步地，采用批判召回的后处理方法降低对于异常检测的误检率。具体地，将前后滑动窗口内的最大异常分值做差除操作，此处的前后滑动窗口分别为前述实施例中的第一晶圆序列和第二晶圆序列。如果得到的差除操作结果小于设定的阈值，则将前置窗口内，即第一晶圆序列内的异常点标签全部重新赋值为正常的第二标签。将最终后处理完的二维异常数组输出并实时标红可视化页面的时序窗口内的异常点，供芯片工程师观察，实时监测窗口内数据的变化。At the same time, the sliding window will increase the recall rate for anomalies, and will also increase the false detection rate. Therefore, further, a post-processing method of critical recall is used to reduce the false detection rate for anomaly detection. Specifically, the maximum anomaly score in the front and rear sliding windows is subjected to a difference operation, where the front and rear sliding windows are the first wafer sequence and the second wafer sequence in the aforementioned embodiment, respectively. If the difference operation result obtained is less than the set threshold, all the labels of the abnormal points in the front window, that is, the first wafer sequence, are reassigned to the normal second label. The final post-processed two-dimensional anomaly array is output and the abnormal points in the timing window of the visualization page are marked in red in real time for chip engineers to observe and monitor the changes in data in the window in real time.

进一步地，对于批判召回方法，对异常点进行处理来降低误检率，进而提高异常识别精度。具体地，首先保存每个滑动窗口序列的最大异常分值，对于每个异常序列，我们使用最大值异常评分来表示它，即前后区间的最大异常分值做差除运算得到M，如下所示：Furthermore, for the critical recall method, the outliers are processed to reduce the false positive rate, thereby improving the accuracy of anomaly recognition. Specifically, the maximum anomaly score of each sliding window sequence is first saved. For each anomaly sequence, we use the maximum anomaly score to represent it, that is, the maximum anomaly score of the previous and next intervals is calculated by the difference operation to obtain M, as shown below:

其中，k表示第k个异常序列，表示前一滑动窗口序列的最大异常分值；表示后一滑动窗口序列的最大异常分值，表示差除结果。如果M不超过预设的异常阈值θ，则将前置区间的点标签全部赋值为0。通过对异常点进行再次识别，进而有利于减少误报。Among them, k represents the kth abnormal sequence, Indicates the maximum anomaly score of the previous sliding window sequence; represents the maximum anomaly score of the next sliding window sequence, Represents the difference result. If M does not exceed the preset abnormal threshold θ, all point labels in the preceding interval are assigned 0. By re-identifying the abnormal points, it is helpful to reduce false positives.

在其中一个实施例中，参考图4，图4是本具体实施例提供的深度学习模型的结构图。整个模型主要包括数据准备组件41、模型训练组件42和异常输出组件43。In one embodiment, referring to FIG4 , FIG4 is a structural diagram of a deep learning model provided in this specific embodiment. The entire model mainly includes a data preparation component 41 , a model training component 42 and an abnormal output component 43 .

对于数据准备组件41，主要包括样本前处理模块。将批量传感器时间序列先经过数据补全、数据截断和归一化进行前处理操作，随后将前处理完的时间序列批量送入模型进行训练。The data preparation component 41 mainly includes a sample pre-processing module, which performs pre-processing operations on batch sensor time series by data completion, data truncation and normalization, and then sends the pre-processed time series to the model for training in batches.

对于模型训练组件42，针对半导体领域传感器波形构造了基于双向限制对抗学习的实时异常检测模型，即前述实施例中的深度学习模型。该模型包括两个生成器和两个判别器模块，每个模块以两到三层全连接作为基本组件构成。将生成器和判别器分别嵌入到对抗学习的框架中，输入训练时序数据对该模型进行训练。模型训练完会生成四个模型：编码器、解码器、真假判别器以及测量判别器。其中编码器和解码器对应两个生成器。编码器将时间序列映射到潜在空间，而解码器将潜在空间转换到重建的时间序列。真假判别器用来区分是输入的真实时间序列还是解码器生成的虚假时间序列，测量判别器用来测量编码器将输入序列映射到潜在空间的有效性。For the model training component 42, a real-time anomaly detection model based on bidirectional restricted adversarial learning is constructed for the sensor waveform in the semiconductor field, that is, the deep learning model in the aforementioned embodiment. The model includes two generators and two discriminator modules, each of which is composed of two to three layers of full connection as basic components. The generator and the discriminator are respectively embedded in the framework of adversarial learning, and the training time series data is input to train the model. After the model training is completed, four models will be generated: an encoder, a decoder, a true and false discriminator, and a measurement discriminator. Among them, the encoder and the decoder correspond to two generators. The encoder maps the time series to the latent space, and the decoder converts the latent space to the reconstructed time series. The true and false discriminator is used to distinguish whether it is a real time series input or a false time series generated by the decoder, and the measurement discriminator is used to measure the effectiveness of the encoder mapping the input sequence to the latent space.

优选地，每隔一段时间拉取一批最新数据，使用以上提出的深度学习模型训练。将收集完的密集时间序列输入模型，使用生成器和判别器交替优化，并保存最优模型参数。对于每一个训练周期，首先确定训练两个判别器的迭代次数，示例性地，判别器的迭代次数可以为q次，q为大于1的正整数；然后对生成器进行一次迭代训练。其后，对两个判别器和生成器分别进行损失计算，反向传播。采用学习率衰减策略和早停策略进行训练。示例性地，对于学习率衰减策略，如果有连续10个时序数据epoch的训练损失不降低，就降低学习率。对于早停策略，如果在学习率衰减后仍然有连续10个epoch的训练损失不降低，就停止训练并保存最优模型参数。其中，对训练好的深度学习模型进行优化和调参，具体是基于异常数据优化：移除表征效果不好的噪声数据，确保数据质量。清洗掉异常数据后重新训练模型，以消除异常的影响。Preferably, a batch of the latest data is pulled at regular intervals and trained using the deep learning model proposed above. The collected dense time series is input into the model, and the generator and the discriminator are alternately optimized, and the optimal model parameters are saved. For each training cycle, the number of iterations for training the two discriminators is first determined. For example, the number of iterations of the discriminator can be q times, where q is a positive integer greater than 1; then the generator is iteratively trained once. Thereafter, the loss of the two discriminators and the generator is calculated and back-propagated respectively. The learning rate decay strategy and the early stopping strategy are used for training. For example, for the learning rate decay strategy, if the training loss of 10 consecutive time series data epochs does not decrease, the learning rate is reduced. For the early stopping strategy, if the training loss of 10 consecutive epochs does not decrease after the learning rate decays, the training is stopped and the optimal model parameters are saved. Among them, the trained deep learning model is optimized and adjusted, specifically based on abnormal data optimization: remove noise data with poor representation effect to ensure data quality. After cleaning the abnormal data, retrain the model to eliminate the impact of the abnormality.

对于异常输出组件43，主要包括异常分值检测、标签转换、批判召回三个阶段。在异常分值检测阶段，仅利用两个生成器，即编码器和解码器进行重构得到重构序列，进而通过计算重构序列和输入序列的偏差来检测异常。具体地，采用L2范数损失对重构误差进行评估。The abnormal output component 43 mainly includes three stages: abnormal score detection, label conversion, and critical recall. In the abnormal score detection stage, only two generators, namely the encoder and the decoder, are used to reconstruct the reconstructed sequence, and then the anomaly is detected by calculating the deviation between the reconstructed sequence and the input sequence. Specifically, the L2 norm loss is used to evaluate the reconstruction error.

优选地，该实时异常检测方法基于双向限制对抗学习，重构序列是基于训练序列构建的。当有新序列进来时，检测之前构建的重构序列和新序列之间的差异，计算每个时间点的异常分值，并根据预设的异常分值阈值筛选出异常。使用上述的深度学习算法实时监测传感器参数的变化，可以对时序异常点快速识别，准确率Accuracy达到0.99+，精确率/召回率/f1值（Precision/Recall/F1-Score）达到0.8+的识别精度。Preferably, the real-time anomaly detection method is based on bidirectional restricted adversarial learning, and the reconstructed sequence is constructed based on the training sequence. When a new sequence comes in, the difference between the previously constructed reconstructed sequence and the new sequence is detected, the anomaly score at each time point is calculated, and the anomaly is filtered out according to the preset anomaly score threshold. Using the above-mentioned deep learning algorithm to monitor the changes in sensor parameters in real time, the time series anomalies can be quickly identified, with an accuracy of 0.99+ and a precision/recall/f1-score of 0.8+.

本申请通过采集大量芯片生产过程中的传感器数据，进行针对性的数据清洗和数据预处理，包括补全、截断、归一化步骤，有利于提高输入数据的质量。同时，根据半导体领域的晶圆制造的特定需求，定制化设计了一种适应性强的异常检测算法。该深度学习算法基于双向限制对抗学习，能够捕捉和识别不同类型的异常情况，可以提高异常检测的精确度和效率。以及，对训练好的深度学习模型进行优化和调参，以提高其准确性和鲁棒性。示例性地，对模型进行优化和调参可以通过优化器、学习率衰减策略、训练早停策略等方法。将训练好的模型对实际芯片生产中的数据流进行实时监测，并将检测结果反馈给操作人员或自动化系统。这有助于及时发现异常情况并采取相应措施以提高良率。This application collects a large amount of sensor data in the chip production process, performs targeted data cleaning and data preprocessing, including completion, truncation, and normalization steps, which is conducive to improving the quality of input data. At the same time, according to the specific needs of wafer manufacturing in the semiconductor field, a highly adaptable anomaly detection algorithm is customized and designed. The deep learning algorithm is based on bidirectional restricted adversarial learning, which can capture and identify different types of anomalies and improve the accuracy and efficiency of anomaly detection. And, the trained deep learning model is optimized and adjusted to improve its accuracy and robustness. Exemplarily, the optimization and adjustment of the model can be carried out by optimizers, learning rate decay strategies, training early stopping strategies and other methods. The trained model is used to monitor the data flow in the actual chip production in real time, and the test results are fed back to the operator or the automation system. This helps to detect abnormal situations in time and take corresponding measures to improve the yield.

需要说明的是，在上述流程中或者附图的流程图中示出的步骤可以在诸如一组计算机可执行指令的计算机系统中执行。It should be noted that the steps shown in the above process or the flowchart in the accompanying drawings can be executed in a computer system such as a group of computer executable instructions.

在本实施例中还提供了一种基于晶圆制造的异常检测装置，该装置用于实现上述实施例及优选实施方式，已经进行过说明的不再赘述。以下所使用的术语“模块”、“单元”、“子单元”等可以实现预定功能的软件和/或硬件的组合。尽管在以下实施例中所描述的装置较佳地以软件来实现，但是硬件，或者软件和硬件的组合的实现也是可能并被构想的。In this embodiment, an abnormality detection device based on wafer manufacturing is also provided, which is used to implement the above-mentioned embodiments and preferred implementation modes, and the descriptions that have been made will not be repeated. The terms "module", "unit", "subunit", etc. used below can implement a combination of software and/or hardware of predetermined functions. Although the devices described in the following embodiments are preferably implemented in software, the implementation of hardware, or a combination of software and hardware, is also possible and conceivable.

图5是本实施例的基于晶圆制造的异常检测装置的结构框图，如图5所示，该装置包括：数据获取模块10、误差确定模块20、数据评估模块30。FIG5 is a structural block diagram of an abnormality detection device based on wafer manufacturing according to this embodiment. As shown in FIG5 , the device includes: a data acquisition module 10 , an error determination module 20 , and a data evaluation module 30 .

数据获取模块10，用于获取待检测时序数据；待检测时序数据包括晶圆设备的传感器数据。The data acquisition module 10 is used to acquire the time series data to be detected; the time series data to be detected includes sensor data of the wafer equipment.

误差确定模块20，用于根据预先训练好的目标深度学习模型，确定待检测时序数据对应的重构误差；目标深度学习模型根据预设的对抗学习方法训练得到。The error determination module 20 is used to determine the reconstruction error corresponding to the time series data to be detected based on a pre-trained target deep learning model; the target deep learning model is trained according to a preset adversarial learning method.

数据评估模块30，用于根据重构误差确定异常序列，并根据预设序列长度的滑动窗口对异常序列中的异常分值进行评估，确定目标晶圆设备异常数据。The data evaluation module 30 is used to determine the abnormal sequence according to the reconstruction error, and evaluate the abnormal score in the abnormal sequence according to the sliding window of the preset sequence length to determine the abnormal data of the target wafer equipment.

需要说明的是，上述各个模块可以是功能模块也可以是程序模块，既可以通过软件来实现，也可以通过硬件来实现。对于通过硬件来实现的模块而言，上述各个模块可以位于同一处理器中；或者上述各个模块还可以按照任意组合的形式分别位于不同的处理器中。It should be noted that the above modules can be functional modules or program modules, and can be implemented by software or hardware. For modules implemented by hardware, the above modules can be located in the same processor; or the above modules can be located in different processors in any combination.

在本实施例中还提供了一种电子装置，包括存储器和处理器，该存储器中存储有计算机程序，该处理器被设置为运行计算机程序以执行上述任一项方法实施例中的步骤。This embodiment also provides an electronic device, including a memory and a processor, wherein the memory stores a computer program, and the processor is configured to run the computer program to execute the steps in any one of the above method embodiments.

可选地，上述电子装置还可以包括传输设备以及输入输出设备，其中，该传输设备和上述处理器连接，该输入输出设备和上述处理器连接。Optionally, the electronic device may further include a transmission device and an input/output device, wherein the transmission device is connected to the processor, and the input/output device is connected to the processor.

可选地，在本实施例中，上述处理器可以被设置为通过计算机程序执行以下步骤：Optionally, in this embodiment, the processor may be configured to perform the following steps through a computer program:

S1，获取待检测时序数据；待检测时序数据包括晶圆设备的传感器数据。S1, obtaining time series data to be detected; the time series data to be detected includes sensor data of wafer equipment.

S2，根据预先训练好的目标深度学习模型，确定待检测时序数据对应的重构误差；目标深度学习模型根据预设的对抗学习方法训练得到。S2, based on the pre-trained target deep learning model, determine the reconstruction error corresponding to the time series data to be detected; the target deep learning model is trained according to the preset adversarial learning method.

S3，根据重构误差确定异常序列，并根据预设序列长度的滑动窗口对异常序列中的异常分值进行评估，确定目标晶圆设备异常数据。S3, determining an abnormal sequence according to the reconstruction error, and evaluating the abnormal score in the abnormal sequence according to a sliding window of a preset sequence length to determine abnormal data of the target wafer device.

需要说明的是，在本实施例中的具体示例可以参考上述实施例及可选实施方式中所描述的示例，在本实施例中不再赘述。It should be noted that the specific examples in this embodiment can refer to the examples described in the above embodiments and optional implementation modes, and will not be repeated in this embodiment.

此外，结合上述实施例中提供的基于晶圆制造的异常检测方法，在本实施例中还可以提供一种存储介质来实现。该存储介质上存储有计算机程序；该计算机程序被处理器执行时实现上述实施例中的任意一种基于晶圆制造的异常检测方法。In addition, in combination with the wafer manufacturing-based anomaly detection method provided in the above embodiments, a storage medium may also be provided in this embodiment to implement the method. The storage medium stores a computer program; when the computer program is executed by a processor, any of the wafer manufacturing-based anomaly detection methods in the above embodiments is implemented.

应该明白的是，这里描述的具体实施例只是用来解释这个应用，而不是用来对它进行限定。根据本申请提供的实施例，本领域普通技术人员在不进行创造性劳动的情况下得到的所有其它实施例，均属本申请保护范围。It should be understood that the specific embodiments described herein are only used to explain the application, rather than to limit it. Based on the embodiments provided in this application, all other embodiments obtained by ordinary technicians in this field without creative work are within the protection scope of this application.

显然，附图只是本申请的一些例子或实施例，对本领域的普通技术人员来说，也可以根据这些附图将本申请适用于其他类似情况，但无需付出创造性劳动。另外，可以理解的是，尽管在此开发过程中所做的工作可能是复杂和漫长的，但是，对于本领域的普通技术人员来说，根据本申请披露的技术内容进行的某些设计、制造或生产等更改仅是常规的技术手段，不应被视为本申请公开的内容不足。Obviously, the drawings are only some examples or embodiments of the present application. For ordinary technicians in the field, the present application can also be applied to other similar situations based on these drawings without creative work. In addition, it is understandable that although the work done in this development process may be complicated and lengthy, for ordinary technicians in the field, certain changes in design, manufacturing or production based on the technical content disclosed in this application are only conventional technical means and should not be regarded as insufficient content disclosed in this application.

“实施例”一词在本申请中指的是结合实施例描述的具体特征、结构或特性可以包括在本申请的至少一个实施例中。该短语出现在说明书中的各个位置并不一定意味着相同的实施例，也不意味着与其它实施例相互排斥而具有独立性或可供选择。本领域的普通技术人员能够清楚或隐含地理解的是，本申请中描述的实施例在没有冲突的情况下，可以与其它实施例结合。The term "embodiment" in this application refers to a specific feature, structure or characteristic described in conjunction with the embodiment that can be included in at least one embodiment of the present application. The appearance of this phrase in various places in the specification does not necessarily mean the same embodiment, nor does it mean that it is mutually exclusive with other embodiments and is independent or optional. It is clearly or implicitly understood by those of ordinary skill in the art that the embodiments described in this application can be combined with other embodiments without conflict.

以上所述实施例仅表达了本申请的几种实施方式，其描述较为具体和详细，但并不能因此而理解为对专利保护范围的限制。应当指出的是，对于本领域的普通技术人员来说，在不脱离本申请构思的前提下，还可以做出若干变形和改进，这些都属于本申请的保护范围。因此，本申请的保护范围应以所附权利要求为准。The above-mentioned embodiments only express several implementation methods of the present application, and the descriptions thereof are relatively specific and detailed, but they cannot be understood as limiting the scope of patent protection. It should be pointed out that, for a person of ordinary skill in the art, several variations and improvements can be made without departing from the concept of the present application, and these all belong to the scope of protection of the present application. Therefore, the scope of protection of the present application shall be subject to the attached claims.

Claims

1. A wafer manufacturing-based anomaly detection method, characterized in that the method comprises:

Acquire time series data to be detected; the time series data to be detected includes sensor data of wafer equipment;

Determine the reconstruction error corresponding to the time series data to be detected according to a pre-trained target deep learning model; the target deep learning model is trained according to a preset adversarial learning method;

An abnormal sequence is determined according to the reconstruction error, and an abnormal score in the abnormal sequence is evaluated according to a sliding window of a preset sequence length to determine abnormal data of a target wafer device.

2. The wafer manufacturing-based abnormality detection method according to claim 1, characterized in that before acquiring the time series data to be detected, it comprises:

Acquire multiple training time series data in a wafer manufacturing process, and process the multiple training time series data to obtain multiple unified training time series data;

Determine dense sequence data in the plurality of unified training time series data; the dense sequence data includes sequence data conforming to a normal distribution;

According to the dense sequence data and the preset adversarial learning method, the preset deep learning model is trained to determine a target deep learning model; the preset deep learning model is based on the waveform of the wafer sensor.

3. The wafer manufacturing-based anomaly detection method according to claim 2, characterized in that before determining the reconstruction error corresponding to the time series data to be detected according to the pre-trained target deep learning model, it includes:

Acquire initial timing data from the sensor on the wafer according to the preset sampling frequency.

The initial time series data is processed according to the dense sequence data to obtain the time series data to be detected.

4. The wafer manufacturing-based abnormality detection method according to claim 1, characterized in that:

The step of determining the reconstruction error corresponding to the time series data to be detected according to the pre-trained target deep learning model includes:

Mapping the time series data to be detected to a preset latent space to obtain a coding vector;

When the encoding vector is within a preset threshold range, processing the encoding vector and generating a reconstructed sequence;

The difference between the data of the reconstructed sequence and the time series data to be detected is determined as a reconstruction error.

5. The wafer manufacturing-based anomaly detection method according to claim 1, wherein determining the anomaly sequence according to the reconstruction error comprises:

Normalizing the reconstruction errors corresponding to multiple time points in the time series data to be detected to obtain error scores for the multiple time points;

Determine an array of abnormal scores according to preset abnormal values; the preset abnormal scores are obtained by processing the error scores according to a preset statistical method;

Determine that the sequence corresponding to the abnormal score array is an abnormal sequence.

6. The wafer manufacturing-based anomaly detection method according to claim 1, characterized in that the step of evaluating the anomaly scores in the anomaly sequence according to the sliding window of the preset sequence length to determine the target wafer equipment anomaly data comprises:

Dividing the abnormal sequence according to a sliding window of a preset sequence length to obtain multiple subsequences;

Determine the abnormal scores exceeding the preset statistical value in the subsequence as abnormal points of wafer equipment, and determine the abnormal scores other than the abnormal points of wafer equipment in the subsequence as normal points of wafer equipment;

Setting a first label for the abnormal point of the wafer equipment in the subsequence, and setting a second label for the normal point of the wafer equipment;

The data of the abnormal point of the wafer device corresponding to the first tag is determined as the abnormal data of the wafer device.

7. The wafer manufacturing-based anomaly detection method according to claim 6, characterized in that the sliding window of the preset sequence length is used to evaluate the anomaly score in the anomaly sequence to determine the target wafer device anomaly data, and further comprises:

According to the sliding window, the abnormal sequence is divided to obtain a first wafer sequence and a second wafer sequence; the first wafer sequence and the second wafer sequence are adjacent subsequences respectively;

Performing a difference operation on a first abnormal score in the first wafer sequence and a second abnormal score in the second wafer sequence to obtain a target sequence abnormal score; the first abnormal score and the second abnormal score include an abnormal score ranked first in the abnormal sequence;

When the abnormal score of the target sequence does not exceed a preset score threshold, setting a second label for the abnormal score in the first wafer sequence;

Determine the abnormal scores other than those set as the second label as target wafer device abnormal data.

8. An abnormality detection device based on wafer manufacturing, characterized in that the device comprises: a data acquisition module, an error determination module, and a data evaluation module;

The data acquisition module is used to acquire the time series data to be detected; the time series data to be detected includes sensor data of the wafer equipment;

The error determination module is used to determine the reconstruction error corresponding to the time series data to be detected according to a pre-trained target deep learning model; the target deep learning model is trained according to a preset adversarial learning method;

The data evaluation module is used to determine an abnormal sequence according to the reconstruction error, and evaluate the abnormal score in the abnormal sequence according to a sliding window of a preset sequence length to determine the abnormal data of the target wafer equipment.

9. An electronic device comprising a memory and a processor, wherein a computer program is stored in the memory, and the processor is configured to run the computer program to execute the wafer manufacturing-based anomaly detection method according to any one of claims 1 to 7.

10. A computer-readable storage medium having a computer program stored thereon, wherein when the computer program is executed by a processor, the steps of the wafer manufacturing-based anomaly detection method according to any one of claims 1 to 7 are implemented.