TW202343613A

TW202343613A - Detection formula configuration and optimization method and apparatus, electronic device and storage medium

Info

Publication number: TW202343613A
Application number: TW112115870A
Authority: TW
Inventors: 王敬賢; 劉濤; 潘成安; 鄧帥飛; 易兵; 魯陽; 張記晨; 周許超
Original assignee: 大陸商上海微電子裝備（集團）股份有限公司
Priority date: 2022-04-29
Filing date: 2023-04-27
Publication date: 2023-11-01
Also published as: WO2023208091A1; CN117058064A

Abstract

Provided in the present invention is a detection formula configuration and optimization method and apparatus, an electronic device and a storage medium. The method comprises: labeling a first data sample to obtain a second data sample; wherein the first data sample comprises a plurality of pieces of detection result data, and the second data sample comprises the detection result data and a label corresponding to each piece of data; according to the second data sample, obtaining data feature distribution information of a detection object; using a preset outlier statistical analysis strategy, performing outlier statistical analysis on the data feature distribution information, so as to obtain defect distribution boundary information and determine a detection formula; finally, according to the defect distribution boundary information and the preset outlier statistical analysis strategy, determining or optimizing the values of detection parameters of the detection formula by means of reverse derivation. In the invention, the coupling relationship between the parameters is considered, such that repeated adjustment of parameters can be avoided, and meanwhile, a whole set of detection parameters are inferred, so that rapid modeling of the detection formula is achieved; and manpower and time costs can be saved.

Description

Testing recipe setup and optimization methods, devices, electronic equipment and storage media

本發明涉及半導體技術領域，特別涉及一種檢測配方設置與優化方法、裝置、電子設備和儲存介質。The invention relates to the field of semiconductor technology, and in particular to a detection recipe setting and optimization method, device, electronic equipment and storage medium.

在半導體晶圓的製造過程中，晶圓翹曲度(Bow)及晶圓表面的形貌是影響制程工藝穩定性及產品良率的關鍵參數，對晶圓的良率(Yield)有著關鍵的影響。比如晶圓在經過刻蝕或薄膜沉積等不同工藝後，晶圓會發生不同程度的翹曲或使晶圓表面凹凸不平；又比如，在半導體集成電路製造過程中機械手可能會刮傷晶圓。因此，晶圓的缺陷是所有芯片製造廠在良率檢測中最為關注的部分。晶圓一旦存在缺陷，很難通過後續工藝進行補救，因此如何快速準確地檢驗出晶圓表面的缺陷，避免因有缺陷的產品流入下道工序造成生產資源的浪費變得至關重要。In the manufacturing process of semiconductor wafers, wafer warpage (Bow) and wafer surface morphology are key parameters that affect process stability and product yield, and are critical to wafer yield (Yield). influence. For example, after the wafer undergoes different processes such as etching or thin film deposition, the wafer will warp to varying degrees or the wafer surface will be uneven; another example is that a robot may scratch the wafer during the manufacturing process of semiconductor integrated circuits. . Therefore, wafer defects are what all chip manufacturers pay most attention to during yield inspection. Once a wafer is defective, it is difficult to remedy it through subsequent processes. Therefore, it is crucial to quickly and accurately detect defects on the wafer surface to avoid wasting production resources due to defective products flowing into the next process.

現有技術中，晶圓缺陷的檢測流程通常採用正向流程調參，然而由於現場工藝的多樣性，需要每次生成大量資訊，加上缺少先驗知識，通常對檢測流程的檢測參數進行逐一調節，由於不能將參數之間的耦合關係考慮在內，因此，單個參數的反復調整可能導致調參結果的偏差，為了達到較好的檢測效果，檢測配方需要反復調節參數，帶來人力和時間成本的增加。而且，由於工藝的多樣性，已有的檢測配方很難適用於新工藝的缺陷檢測，而調整檢測配方的參數需要具有一定的算法背景，因此對用戶要求較高。In the existing technology, the wafer defect detection process usually uses forward process parameter adjustment. However, due to the diversity of on-site processes, a large amount of information needs to be generated each time, and coupled with the lack of prior knowledge, the detection parameters of the detection process are usually adjusted one by one. , since the coupling relationship between parameters cannot be taken into account, repeated adjustments of a single parameter may lead to deviations in the parameter adjustment results. In order to achieve better detection results, the detection formula needs to repeatedly adjust parameters, which brings manpower and time costs. increase. Moreover, due to the diversity of processes, existing detection formulas are difficult to apply to defect detection in new processes. Adjusting the parameters of the detection formula requires a certain algorithm background, so the requirements for users are high.

需要說明的是，公開于該發明背景技術部分的資訊僅僅旨在加深對本發明一般背景技術的理解，而不應當被視為承認或以任何形式暗示該資訊構成已為本領域技術人員所公知的現有技術。It should be noted that the information disclosed in the background technology section of this invention is only intended to deepen the understanding of the general background technology of the invention, and should not be regarded as an admission or any form of implication that the information constitutes a component that is already known to those skilled in the art. existing technology.

本發明的目的在於針對現有技術中存在的缺陷，提供一種檢測配方設置與優化方法、系統、電子設備和儲存介質，本發明提供的檢測配方設置與優化方法，基於檢測結果數據的先驗知識，並充分考慮參數之間的耦合關係一次性確定檢測配方的策略及參數設置值，不僅確定所述檢測流程的效率高，而且提高了檢測配方的檢測精度。The purpose of the present invention is to provide a detection formula setting and optimization method, system, electronic equipment and storage medium in view of the defects existing in the prior art. The detection formula setting and optimization method provided by the invention is based on the prior knowledge of the detection result data. And fully consider the coupling relationship between parameters to determine the strategy and parameter setting values of the detection formula at one time, which not only ensures high efficiency of the detection process, but also improves the detection accuracy of the detection formula.

為達到上述目的，本發明提供一種檢測配方設置與優化方法，一種檢測配方設置與優化方法，包括：In order to achieve the above objectives, the present invention provides a detection formula setting and optimization method, a detection formula setting and optimization method, including:

對第一數據樣本進行標注，得到第二數據樣本；其中，所述第一數據樣本包括若干條檢測結果數據；所述第二數據樣本包括所述檢測結果數據以及每條所述檢測結果數據對應的標籤；Annotate the first data sample to obtain a second data sample; wherein, the first data sample includes several pieces of detection result data; the second data sample includes the detection result data and the corresponding data of each piece of detection result data. label;

根據所述第二數據樣本，得到檢測對象的數據特徵分布資訊；According to the second data sample, obtain the data characteristic distribution information of the detection object;

採用預設離群統計分析策略，對所述數據特徵分布資訊進行離群統計分析，獲取缺陷分布邊界資訊，並根據所述預設離群統計分析策略，確定檢測配方；Using a preset outlier statistical analysis strategy, perform outlier statistical analysis on the data feature distribution information, obtain defect distribution boundary information, and determine the detection formula according to the preset outlier statistical analysis strategy;

根據所述缺陷分布邊界資訊和所述預設離群統計分析策略，通過反向推導，設置或優化所述檢測配方的檢測參數的取值。According to the defect distribution boundary information and the preset outlier statistical analysis strategy, the values of the detection parameters of the detection formula are set or optimized through reverse derivation.

可選地，所述檢測結果數據包括所述檢測對象的基本資訊和特徵數據資訊；其中，所述特徵數據資訊包括檢測結果在所述檢測對象上的位置資訊，以及所述檢測對象的工藝流程資訊、所述檢測結果的數據資訊的灰度資訊、形狀資訊和紋理資訊中的一種或多種；Optionally, the detection result data includes basic information and characteristic data information of the detection object; wherein the characteristic data information includes position information of the detection result on the detection object, and the process flow of the detection object. Information, one or more of the grayscale information, shape information and texture information of the data information of the detection results;

所述對第一數據樣本進行標注，得到第二數據樣本，包括：Annotating the first data sample to obtain the second data sample includes:

獲取所述第一數據樣本中每一條檢測結果數據對應的所述檢測對象的基本資訊；Obtain the basic information of the detection object corresponding to each piece of detection result data in the first data sample;

對於每一條檢測結果數據，根據所述檢測對象的基本資訊和所述檢測結果在所述檢測對象上的位置資訊，獲取該條檢測結果數據在所述檢測對象上對應的原始資訊；For each piece of detection result data, obtain the original information corresponding to the detection result data on the detection object based on the basic information of the detection object and the location information of the detection result on the detection object;

根據所述原始資訊，判斷所述檢測結果的數據資訊標出的缺陷是否為真缺陷，若是，則將該條檢測結果數據標記為真缺陷數據；若否，則將該條檢測結果數據標記為噪擾數據；According to the original information, it is judged whether the defect marked by the data information of the detection result is a real defect. If so, the detection result data is marked as true defect data; if not, the detection result data is marked as Noisy data;

根據所有的所述檢測結果數據及每條所述檢測結果數據對應的標籤，得到所述第二數據樣本。The second data sample is obtained based on all the detection result data and the label corresponding to each piece of detection result data.

可選地，所述檢測對象包括Wafer；所述Wafer的基本資訊包括所述Wafer的編號、包含的Die個數以及每一個Die的基本資訊；所述Die的基本資訊包括該Die的Die編號和圖像資訊；Optionally, the detection object includes a Wafer; the basic information of the Wafer includes the number of the Wafer, the number of Dies it contains, and the basic information of each Die; the basic information of the Die includes the Die number and the Die number of the Die. image information;

所述根據所述檢測對象的基本資訊和所述檢測結果在所述檢測對象上的位置資訊，獲取該條檢測結果數據在所述檢測對象上對應的原始資訊，包括：Obtaining the original information corresponding to the detection result data on the detection object based on the basic information of the detection object and the location information of the detection result on the detection object includes:

根據所述Wafer的基本資訊，獲取所述Wafer的每一個Die的Die編號及每一所述Die的基本資訊；According to the basic information of the Wafer, obtain the Die number of each Die of the Wafer and the basic information of each Die;

根據所述檢測結果在所述Die上的位置資訊以及所述Die的圖像資訊，獲取該條檢測結果數據在所述Die上對應的檢測結果的圖像資訊。According to the position information of the detection result on the Die and the image information of the Die, the image information of the detection result corresponding to the detection result data on the Die is obtained.

可選地，所述根據所述第二數據樣本，得到所述檢測對象的數據特徵分布資訊，包括：Optionally, obtaining the data feature distribution information of the detection object according to the second data sample includes:

確定特徵數據軸和分割數據軸，並根據所述特徵數據軸和分割數據軸建立特徵空間；其中，所述特徵數據軸代表所述檢測結果數據的特徵數據資訊，所述分割數據軸代表分割特徵資訊；其中，所述分割特徵資訊包括除用於所述特徵數據軸之外的其他特徵數據資訊；Determine the characteristic data axis and the segmentation data axis, and establish a feature space based on the characteristic data axis and the segmentation data axis; wherein the characteristic data axis represents the characteristic data information of the detection result data, and the segmentation data axis represents the segmentation feature Information; wherein the segmentation feature information includes other feature data information in addition to the feature data axis;

根據所述特徵空間對所述第二數據樣本進行排列，得到所述檢測對象的數據特徵分布資訊。Arrange the second data samples according to the feature space to obtain data feature distribution information of the detection object.

可選地，所述特徵空間包括一個或多個所述特徵數據軸以及一個或多個所述分割數據軸。Optionally, the feature space includes one or more feature data axes and one or more segmentation data axes.

可選地，所述根據所述特徵空間對所述第二數據樣本進行排列，得到所述檢測對象的數據特徵分布資訊，包括：Optionally, the second data samples are arranged according to the feature space to obtain the data feature distribution information of the detection object, including:

將所述特徵數據軸作為橫軸，將所述分割數據軸作為縱軸，建立直角坐標系；Use the feature data axis as the horizontal axis and the segmented data axis as the vertical axis to establish a rectangular coordinate system;

在所述直角坐標系內，在所述橫軸方向按照所述特徵數據軸代表的所述特徵數據資訊的特徵值大小、在所述縱軸方向按照所述分割數據軸代表的所述特徵數據資訊的特徵值大小對所述第二數據樣本進行排列，得到缺陷特徵分布圖。In the rectangular coordinate system, in the horizontal axis direction, the feature value size of the feature data information represented by the feature data axis, and in the vertical axis direction, according to the feature data represented by the segmented data axis. The second data samples are arranged according to the characteristic value size of the information to obtain a defect characteristic distribution map.

可選地，所述採用預設離群統計分析策略，對所述數據特徵分布資訊進行離群統計分析，獲取缺陷分布邊界資訊，包括：Optionally, using a preset outlier statistical analysis strategy to perform outlier statistical analysis on the data feature distribution information to obtain defect distribution boundary information includes:

判斷是否自動尋找缺陷分布邊界資訊，若是，則根據選擇的離群統計分析模型，對所述離群統計分析模型進行訓練，獲取缺陷分布邊界資訊；若否，則採用數據分割法對所述數據特徵分布資訊進行離群統計分析，獲取缺陷分布邊界資訊；Determine whether to automatically find the defect distribution boundary information. If so, train the outlier statistical analysis model according to the selected outlier statistical analysis model to obtain the defect distribution boundary information; if not, use the data segmentation method to analyze the data. Conduct outlier statistical analysis on feature distribution information to obtain defect distribution boundary information;

其中，所述對所述離群統計分析模型進行訓練，包括：根據所述檢測結果數據和所述數據特徵分布資訊，對選定的所述離群統計分析模型進行訓練，直至得到的所述檢測對象的缺陷分布邊界資訊滿足第一預設條件；Wherein, training the outlier statistical analysis model includes: training the selected outlier statistical analysis model according to the detection result data and the data feature distribution information until the detection result is obtained. The defect distribution boundary information of the object satisfies the first preset condition;

所述採用數據分割法對所述數據特徵分布資訊進行離群統計分析，包括：根據所述檢測結果數據和所述數據特徵分布資訊，在所述特徵數據軸和/或所述分割數據軸上獲取至少一個第一分割閾值；並根據所述第一分割閾值獲取所述缺陷邊界資訊，直至得到的所述檢測對象的缺陷分布邊界資訊滿足第二預設條件。The use of data segmentation method to perform outlier statistical analysis on the data feature distribution information includes: based on the detection result data and the data feature distribution information, on the feature data axis and/or the segmented data axis Obtain at least one first segmentation threshold; and acquire the defect boundary information according to the first segmentation threshold until the obtained defect distribution boundary information of the detection object satisfies the second preset condition.

可選地，所述分割數據軸代表工藝流程資訊；所述根據所述檢測結果數據和所述數據特徵分布資訊，對所述特徵數據軸和/或所述分割數據軸進行閾值分割，直至得到的所述檢測對象的缺陷分布邊界資訊滿足第二預設條件，包括：Optionally, the segmented data axis represents process flow information; and based on the detection result data and the data feature distribution information, threshold segmentation is performed on the characteristic data axis and/or the segmented data axis until the The defect distribution boundary information of the detection object satisfies the second preset condition, including:

根據所述數據特徵分布資訊，以及標籤為真缺陷數據和標籤為噪擾數據的檢測結果數據分布的一致性，確定所述分割數據軸的第一分割閾值；Determine the first segmentation threshold of the segmented data axis based on the data feature distribution information and the consistency of the data distribution of the detection results labeled as true defect data and labeled as noise data;

根據所述數據特徵分布資訊，以及標籤為真缺陷數據和標籤為噪擾數據的檢測結果數據分布的一致性，確定所述特徵數據軸的第二分割閾值；Determine the second segmentation threshold of the feature data axis based on the data feature distribution information and the consistency of the data distribution of the detection results labeled as true defect data and labeled as noise data;

根據所述分割數據軸的第一分割閾值和所述特徵數據軸的第二分割閾值，得到所述檢測對象的缺陷分布邊界資訊。According to the first segmentation threshold of the segmentation data axis and the second segmentation threshold of the feature data axis, the defect distribution boundary information of the detection object is obtained.

可選地，所述採用預設離群統計分析策略還包括：數據分割和模型學習相結合的離群統計分析策略；Optionally, the use of preset outlier statistical analysis strategies also includes: an outlier statistical analysis strategy that combines data segmentation and model learning;

所述數據分割和模型學習相結合的離群統計分析策略包括：根據所述數據特徵分布資訊，獲取標籤為真缺陷的所述檢測結果數據在所述分割數據軸上的至少一個第一分割閾值；並根據所述第一分割閾值和所述數據特徵分布資訊，對選定的所述離群統計分析模型進行訓練，直至得到的所述檢測對象的缺陷分布邊界資訊滿足第三預設條件。The outlier statistical analysis strategy that combines data segmentation and model learning includes: obtaining at least one first segmentation threshold on the segmentation data axis of the detection result data labeled as a true defect based on the data feature distribution information. ; And according to the first segmentation threshold and the data feature distribution information, train the selected outlier statistical analysis model until the obtained defect distribution boundary information of the detection object meets the third preset condition.

可選地，所述根據所述缺陷分布邊界資訊和所述預設離群統計分析策略，通過反向推導，設置或優化所述檢測配方的檢測參數的取值，包括：Optionally, setting or optimizing the values of detection parameters of the detection formula through reverse derivation based on the defect distribution boundary information and the preset outlier statistical analysis strategy includes:

根據所述預設離群統計分析策略，確定反向推導策略；Determine a reverse derivation strategy according to the preset outlier statistical analysis strategy;

根據所述反向推導策略，確定所述反向推導策略的輸入數據資訊；According to the reverse derivation strategy, determine the input data information of the reverse derivation strategy;

根據所述輸入數據資訊，確定所述檢測結果數據的數據分布模型；Determine the data distribution model of the detection result data according to the input data information;

根據所述數據分布模型和所述缺陷分布邊界資訊，確定所述檢測配方的檢測參數；Determine the detection parameters of the detection formula according to the data distribution model and the defect distribution boundary information;

根據所述檢測配方的策略和所述反向推導的輸入數據資訊，設置或優化所述檢測配方的檢測參數的取值。According to the strategy of the detection recipe and the input data information of the reverse derivation, the values of the detection parameters of the detection recipe are set or optimized.

可選地，所述預設離群統計分析策略為數據分割法；Optionally, the preset outlier statistical analysis strategy is a data segmentation method;

根據所述數據分割法，將統計所述檢測對象的檢測結果數據的數據分布密度作為所述反向推導策略；According to the data segmentation method, the data distribution density of the detection result data of the detection object is counted as the reverse derivation strategy;

根據所述統計數據分布密度的反向推導策略，將所述檢測對象的所有檢測結果數據作為所述輸入數據資訊；According to the reverse derivation strategy of the statistical data distribution density, all detection result data of the detection object are used as the input data information;

根據所有檢測結果數據，假設所有的所述檢測結果數據的特徵數據資訊的特徵值在特徵空間的數據分布密度分為正常區域、噪擾區域和真缺陷區域；所述正常區域為數據分布密度大於第一密度閾值的區域，噪擾區域為數據密度小於或等於所述第一密度閾值且大於第二密度閾值的區域，真缺陷區域為數據密度小於或等於所述第二密度閾值的區域；According to all detection result data, it is assumed that the data distribution density of the characteristic data information of all the detection result data in the feature space is divided into normal areas, noise areas and true defect areas; the normal area is a data distribution density greater than The area of the first density threshold, the noise area is the area where the data density is less than or equal to the first density threshold and greater than the second density threshold, and the true defect area is the area where the data density is less than or equal to the second density threshold;

根據所有檢測結果數據和所有檢測結果數據的標籤，計算所述第一密度閾值和所述第二密度閾值；其中，所述第一密度閾值大於所述第二密度閾值；Calculate the first density threshold and the second density threshold according to all detection result data and labels of all detection result data; wherein the first density threshold is greater than the second density threshold;

根據所述第一密度閾值、所述第二密度閾值和所述缺陷分布邊界資訊，計算所述檢測配方的位移參數。Calculate the displacement parameter of the detection formula according to the first density threshold, the second density threshold and the defect distribution boundary information.

可選地，所述預設離群統計分析策略為基於高斯模型的離群統計分析策略；Optionally, the preset outlier statistical analysis strategy is an outlier statistical analysis strategy based on Gaussian model;

根據所述基於高斯模型的離群統計分析策略，將獲取所述檢測對象的檢測結果數據的高斯分布作為所述反向推導策略，將高斯模型檢測作為檢測配方的策略；According to the outlier statistical analysis strategy based on the Gaussian model, the Gaussian distribution of the detection result data of the detection object is obtained as the reverse derivation strategy, and Gaussian model detection is used as the detection formula strategy;

根據統計高斯分布的反向推導策略，將所述檢測對象的所有檢測結果數據作為所述輸入數據資訊和所述缺陷分布邊界資訊作為所述輸入數據資訊；According to the reverse derivation strategy of statistical Gaussian distribution, all detection result data of the detection object are used as the input data information and the defect distribution boundary information is used as the input data information;

根據所有檢測結果數據，假設所有的所述檢測結果數據的特徵數據資訊的特徵值在特徵空間的數據分布密度服從高斯分布；According to all detection result data, it is assumed that the data distribution density of the feature values of all the feature data information of the detection result data in the feature space obeys Gaussian distribution;

根據所述輸入數據資訊和所述缺陷分布邊界資訊，確定所述高斯模型檢測的參數。According to the input data information and the defect distribution boundary information, the parameters of the Gaussian model detection are determined.

可選地，所述預設離群統計分析策略為機器學習的離群統計分析策略；Optionally, the preset outlier statistical analysis strategy is a machine learning outlier statistical analysis strategy;

根據所述機器學習的離群統計分析策略，將獲取所述檢測對象的檢測結果數據的密度閾值和距離閾值作為所述反向推導策略，將機器學習模型作為檢測配方的策略；According to the outlier statistical analysis strategy of machine learning, the density threshold and distance threshold for obtaining the detection result data of the detection object are used as the reverse derivation strategy, and the machine learning model is used as the strategy of detection formula;

根據所述獲取所述檢測對象的檢測結果數據的密度閾值和距離閾值的反向推導策略，將獲取的所述檢測對象的檢測結果數據的密度和距離作為所述輸入數據資訊；According to the reverse derivation strategy of obtaining the density threshold and distance threshold of the detection result data of the detection object, the obtained density and distance of the detection result data of the detection object are used as the input data information;

根據所有檢測結果數據和所述缺陷邊界分布資訊，反向推導所述機器學習模型的檢測策略的密度參數和距離參數。Based on all inspection result data and the defect boundary distribution information, the density parameters and distance parameters of the inspection strategy of the machine learning model are reversely derived.

可選地，所述檢測配方設置與優化方法，還包括：Optionally, the detection recipe setting and optimization method also includes:

根據所述檢測配方及所述檢測配方的檢測參數的取值，對待檢測對象進行缺陷分析，得到所述待檢測對象的缺陷數據資訊。According to the detection formula and the values of the detection parameters of the detection formula, defect analysis of the object to be detected is performed to obtain defect data information of the object to be detected.

為了實現上述目的，本發明還提供了一種檢測配方設置與優化裝置，所述檢測參數與調整裝置，包括：In order to achieve the above object, the present invention also provides a detection formula setting and optimization device. The detection parameter and adjustment device includes:

真缺陷及噪擾標記單元，被配置為對第一數據樣本進行標注，得到第二數據樣本；其中，所述第一數據樣本包括若干條檢測結果數據；所述第二數據樣本包括所述檢測結果數據以及每條所述檢測結果數據對應的標籤；The true defect and noise marking unit is configured to mark the first data sample to obtain a second data sample; wherein the first data sample includes several pieces of detection result data; the second data sample includes the detection result data Result data and labels corresponding to each test result data;

特徵分布資訊獲取單元，被配置為根據所述第二數據樣本，得到檢測對象的數據特徵分布資訊；The feature distribution information acquisition unit is configured to obtain the data feature distribution information of the detection object based on the second data sample;

缺陷分布邊界獲取單元，被配置為採用預設離群統計分析策略，對所述數據特徵分布資訊進行離群統計分析，獲取缺陷分布邊界資訊，並用於根據所述預設離群統計分析策略，確定檢測配方；The defect distribution boundary acquisition unit is configured to use a preset outlier statistical analysis strategy to perform outlier statistical analysis on the data feature distribution information, obtain defect distribution boundary information, and is used to perform outlier statistical analysis according to the preset outlier statistical analysis strategy. Determine the test formula;

檢測參數設置及優化單元，被配置為根據所述缺陷分布邊界資訊和所述預設離群統計分析策略，通過反向推導，確定或優化所述檢測配方的檢測參數的取值。The detection parameter setting and optimization unit is configured to determine or optimize the value of the detection parameter of the detection formula through reverse derivation based on the defect distribution boundary information and the preset outlier statistical analysis strategy.

可選地，所述檢測配方設置與優化裝置，還包括：Optionally, the detection recipe setting and optimization device also includes:

檢測配方應用單元，被配置為根據所述檢測配方及所述檢測配方的檢測參數的取值，對待檢測對象進行缺陷分析，得到所述待檢測對象的缺陷數據資訊。The detection recipe application unit is configured to perform defect analysis on the object to be detected according to the detection formula and the values of detection parameters of the detection formula, and obtain defect data information of the object to be detected.

為達到上述目的，本發明還提供一種電子設備，包括處理器和儲存器，所述儲存器上儲存有電腦程式，所述電腦程式被所述處理器執行時，實現上文所述的檢測配方設置與優化方法。In order to achieve the above object, the present invention also provides an electronic device, including a processor and a storage. A computer program is stored on the storage. When the computer program is executed by the processor, the detection formula described above is realized. Setup and optimization methods.

為達到上述目的，本發明還提供一種可讀儲存介質，所述可讀儲存介質內儲存有電腦程式，所述電腦程式被處理器執行時，實現上文所述的檢測配方設置與優化方法。In order to achieve the above object, the present invention also provides a readable storage medium. A computer program is stored in the readable storage medium. When the computer program is executed by the processor, the above-mentioned detection recipe setting and optimization method is realized.

與現有技術相比，本發明提供的檢測配方設置與優化方法、裝置、電子設備和儲存介質具有以下優點：Compared with the existing technology, the detection recipe setting and optimization method, device, electronic equipment and storage medium provided by the present invention have the following advantages:

本發明提供的檢測配方設置與優化方法，首先通過對第一數據樣本進行標注，得到第二數據樣本；其中，所述第一數據樣本包括若干條檢測結果數據；所述第二數據樣本包括所述檢測結果數據以及每條所述檢測結果數據對應的標籤；然後根據所述第二數據樣本，得到檢測對象的數據特徵分布資訊，並根據所述預設離群統計分析策略，確定檢測配方；接著採用預設離群統計分析策略，對所述數據特徵分布資訊進行離群統計分析，獲取缺陷分布邊界資訊；最後根據所述缺陷分布邊界資訊和所述預設離群統計分析策略，通過反向推導，檢測配方確定或優化所述檢測配方的檢測參數。由此，本發明提供的檢測配方設置與優化方法，所述第一數據樣本包括若干條檢測結果數據，所述檢測結果數據包括輔助的調參資訊（比如所述檢測對象的基本資訊和特徵數據資訊，所述特徵數據資訊包括但不限於檢測結果指示的缺陷的灰度、形狀、紋理等資訊），通過數據標注可以區分真缺陷數據和噪擾數據，為後續有效利用歷史資訊進行數據分析和推理從而能夠獲取到準確的先驗知識提供了重要的依據，能夠提高檢測配方的檢測精度。進一步地，本發明提供的檢測配方設置與優化方法，檢測配方的策略及檢測參數的取值是根據缺陷分布邊界資訊和所述預設離群統計分析策略，通過反向推導得到。由此，本發明通過反向推導能夠同時推理出一套檢測參數（即同時調整出所有參數），參數之間的耦合關係也考慮在內，實現了檢測配方的快速建模；避免了反復調整參數，能夠顯著節約人力和時間成本；而且，針對新工藝缺陷檢測，無需用戶具備算法基礎也能設置或優化檢測配方的策略及所述檢測配方的檢測參數的取值。The detection recipe setting and optimization method provided by the present invention first obtains a second data sample by annotating the first data sample; wherein the first data sample includes several pieces of detection result data; the second data sample includes all The detection result data and the label corresponding to each of the detection result data; then obtain the data feature distribution information of the detection object according to the second data sample, and determine the detection formula according to the preset outlier statistical analysis strategy; Then, a preset outlier statistical analysis strategy is used to perform outlier statistical analysis on the data feature distribution information to obtain defect distribution boundary information; finally, based on the defect distribution boundary information and the preset outlier statistical analysis strategy, through reverse Through direct derivation, the detection formula determines or optimizes the detection parameters of the detection formula. Therefore, in the detection recipe setting and optimization method provided by the present invention, the first data sample includes several pieces of detection result data, and the detection result data includes auxiliary parameter adjustment information (such as the basic information and characteristic data of the detection object). Information, the characteristic data information includes but is not limited to the grayscale, shape, texture and other information of defects indicated by the detection results). Through data annotation, real defect data and noise data can be distinguished, which can effectively use historical information for subsequent data analysis and Inference provides an important basis for obtaining accurate prior knowledge, which can improve the detection accuracy of detection formulas. Furthermore, in the detection recipe setting and optimization method provided by the present invention, the detection recipe strategy and detection parameter values are obtained through reverse derivation based on the defect distribution boundary information and the preset outlier statistical analysis strategy. Therefore, the present invention can deduce a set of detection parameters at the same time (that is, adjust all parameters at the same time) through reverse derivation. The coupling relationship between parameters is also taken into account, realizing rapid modeling of detection formulas; avoiding repeated adjustments. Parameters can significantly save labor and time costs; moreover, for new process defect detection, users can set or optimize the strategy of the detection formula and the values of the detection parameters of the detection formula without having any algorithm foundation.

由於本發明提供的檢測配方設置與優化裝置、電子設備和儲存介質與本發明提供的檢測參數與調整方法屬同一發明構思，因此，本發明提供的檢測配方設置與優化裝置、電子設備和儲存介質具有所述檢測配方設置與優化方法的所有優點，在此，不再一一贅述。Since the detection recipe setting and optimization device, electronic equipment and storage medium provided by the present invention and the detection parameters and adjustment method provided by the present invention belong to the same inventive concept, the detection recipe setting and optimization device, electronic equipment and storage medium provided by the present invention It has all the advantages of the detection recipe setting and optimization method, which will not be described in detail here.

以下結合附圖和具體實施方式對本發明提出的檢測配方設置與優化方法、裝置、電子設備和儲存介質作進一步詳細說明。根據下面的說明，本發明的優點和特徵將更清楚。需要說明的是，附圖採用非常簡化的形式且均使用非精准的比例，僅用以方便、明晰地輔助說明本發明實施方式的目的。為了使本發明的目的、特徵和優點能夠更加明顯易懂，請參閱附圖。須知，本說明書所附圖式所繪示的結構、比例、大小等，均僅用以配合說明書所揭示的內容，以供熟悉此技術的人士瞭解與閱讀，並非用以限定本發明實施的限定條件，任何結構的修飾、比例關係的改變或大小的調整，在與本發明所能產生的功效及所能達成的目的相同或近似的情況下，均應仍落在本發明所揭示的技術內容能涵蓋的範圍內。The detection recipe setting and optimization method, device, electronic equipment and storage medium proposed by the present invention will be further described in detail below with reference to the accompanying drawings and specific implementation modes. The advantages and features of the present invention will become clearer from the following description. It should be noted that the drawings are in a very simplified form and use imprecise proportions, and are only used to conveniently and clearly assist in explaining the embodiments of the present invention. In order to make the objects, features and advantages of the present invention more apparent, please refer to the accompanying drawings. It should be noted that the structures, proportions, sizes, etc. shown in the drawings attached to this specification are only used to coordinate with the content disclosed in the specification for the understanding and reading of those familiar with this technology, and are not used to limit the implementation of the present invention. Conditions, any structural modifications, changes in proportions or adjustments in size should still fall within the technical content disclosed in the present invention, provided that they are the same as or similar to the effects that the present invention can produce and the purposes that can be achieved. within the scope that can be covered.

需要說明的是，在本文中，諸如第一和第二等之類的關係術語僅僅用來將一個實體或者操作與另一個實體或操作區分開來，而不一定要求或者暗示這些實體或操作之間存在任何這種實際的關係或者順序。而且，術語“包括”、“包含”或者其任何其他變體意在涵蓋非排他性的包含，從而使得包括一系列要素的過程、方法、物品或者設備不僅包括那些要素，而且還包括沒有明確列出的其他要素，或者是還包括為這種過程、方法、物品或者設備所固有的要素。在沒有更多限制的情況下，由語句“包括一個……”限定的要素，並不排除在包括所述要素的過程、方法、物品或者設備中還存在另外的相同要素。It should be noted that in this article, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply that these entities or operations are mutually exclusive. any such actual relationship or sequence exists between them. Furthermore, the terms "comprises," "comprises," or any other variations thereof are intended to cover a non-exclusive inclusion such that a process, method, article, or apparatus that includes a list of elements includes not only those elements, but also those not expressly listed other elements, or elements inherent to the process, method, article or equipment. Without further limitation, an element defined by the statement "comprises a..." does not exclude the presence of additional identical elements in a process, method, article, or apparatus that includes the stated element.

本發明的其中一個實施例提供了一種檢測配方設置與優化方法，具體地，請參考圖1，其示意性地給出了本發明一實施方式提供的檢測配方設置與優化方法的流程示意圖。如圖1所示，所述檢測配方設置與優化方法包括如下步驟： S100：對第一數據樣本進行標注，得到第二數據樣本；其中，所述第一數據樣本包括若干條檢測結果數據；所述第二數據樣本包括所述檢測結果數據以及每條所述檢測結果數據對應的標籤； S200：根據所述第二數據樣本，得到檢測對象的數據特徵分布資訊； S300：採用預設離群統計分析策略，對所述數據特徵分布資訊進行離群統計分析，獲取缺陷分布邊界資訊，並根據所述預設離群統計分析策略，確定檢測配方； S400：根據所述缺陷分布邊界資訊和所述預設離群統計分析策略，通過反向推導，設置或優化所述檢測配方的檢測參數的取值。 One embodiment of the present invention provides a detection recipe setting and optimization method. Specifically, please refer to FIG. 1 , which schematically provides a flow chart of the detection recipe setting and optimization method provided by an embodiment of the present invention. As shown in Figure 1, the detection recipe setting and optimization method includes the following steps: S100: Annotate the first data sample to obtain a second data sample; wherein the first data sample includes several pieces of detection result data; the second data sample includes the detection result data and each of the detection results The label corresponding to the data; S200: Obtain the data feature distribution information of the detection object according to the second data sample; S300: Use a preset outlier statistical analysis strategy to perform outlier statistical analysis on the data feature distribution information, obtain defect distribution boundary information, and determine the detection formula according to the preset outlier statistical analysis strategy; S400: Based on the defect distribution boundary information and the preset outlier statistical analysis strategy, set or optimize the values of the detection parameters of the detection formula through reverse derivation.

由此，本發明提供的檢測配方設置與優化方法，所述第一數據樣本包括若干條檢測結果數據，所述檢測結果數據包括大量輔助的調參資訊（比如所述檢測對象的基本資訊和特徵數據資訊，所述特徵數據資訊包括但不限於檢測結果指示的缺陷的灰度、形狀、紋理等資訊），通過數據標注可以區分真缺陷數據和噪擾數據，為後續有效利用歷史資訊進行數據分析和推理從而能夠獲取到準確的先驗知識提供了重要的依據，能夠提高檢測配方的檢測精度。進一步地，本發明提供的檢測配方設置與優化方法，檢測配方的策略及參數設置值是根據缺陷分布邊界資訊和所述預設離群統計分析策略，通過反向推導得到。由此，本發明通過反向推導能夠同時推理出一套檢測參數（即同時調整出所有參數），參數之間的耦合關係也考慮在內，實現了檢測流程的快速建模，避免了反復調整參數，能夠顯著節約人力和時間成本。而且，針對新工藝缺陷檢測，無需用戶具備算法基礎也能設置或優化檢測配方的策略及檢測參數的取值。Therefore, in the detection recipe setting and optimization method provided by the present invention, the first data sample includes several pieces of detection result data, and the detection result data includes a large amount of auxiliary parameter adjustment information (such as the basic information and characteristics of the detection object). Data information, the characteristic data information includes but is not limited to the grayscale, shape, texture and other information of the defects indicated by the detection results). Through data annotation, real defect data and noise data can be distinguished, and historical information can be effectively used for subsequent data analysis. And reasoning can provide an important basis for obtaining accurate prior knowledge, which can improve the detection accuracy of detection formulas. Furthermore, in the detection recipe setting and optimization method provided by the present invention, the detection recipe strategy and parameter setting values are obtained through reverse derivation based on the defect distribution boundary information and the preset outlier statistical analysis strategy. Therefore, the present invention can deduce a set of detection parameters at the same time (that is, adjust all parameters at the same time) through reverse derivation. The coupling relationship between parameters is also taken into account, realizing rapid modeling of the detection process and avoiding repeated adjustments. parameters, which can significantly save labor and time costs. Moreover, for new process defect detection, users can set or optimize the detection recipe strategy and detection parameter values without having any algorithm foundation.

需要特別說明的是，所述檢測結果數據為所述檢測對象的歷史檢測結果數據。舉例而言，在初次設置缺陷檢測流程中使用的檢測配方的策略及檢測參數時，可以先隨機或人為選擇檢測配方的策略和檢測參數，獲取一定量的缺陷檢測數據，而該一定量的缺陷檢測數據為檢測結果數據（也即為第一樣本數據）。在對檢測配方的策略及檢測參數的配方優化時，所述檢測結果數據（也即為第一樣本數據）包括待優化的檢測配方歷史檢測的數據全部或其中一部分。為了便於理解和說明，下文所述的檢測結果數據為進行晶圓缺陷的歷史檢測數據，很顯然地，這並非本發明的限制，在其他的實施方式中，本發明提供的檢測配方設置與優化方法也可以適應于初檢測晶圓缺陷的其他的檢測配方，不再一一示例。It should be noted that the detection result data is the historical detection result data of the detection object. For example, when setting up the detection recipe strategy and detection parameters used in the defect detection process for the first time, you can first randomly or artificially select the detection recipe strategy and detection parameters to obtain a certain amount of defect detection data. The detection data is the detection result data (that is, the first sample data). When optimizing the strategy of the detection formula and the detection parameters, the detection result data (that is, the first sample data) includes all or part of the historical detection data of the detection formula to be optimized. For ease of understanding and explanation, the detection result data described below are historical detection data of wafer defects. Obviously, this is not a limitation of the present invention. In other embodiments, the detection recipe setting and optimization provided by the present invention The method can also be adapted to other detection formulas for initial detection of wafer defects, so no examples will be given one by one.

優選地，在其中一種優選實施方式中，所述檢測結果數據包括所述檢測對象的基本資訊和特徵數據資訊；其中，所述特徵數據資訊包括檢測結果在所述檢測對象上的位置資訊，以及所述檢測對象的工藝流程資訊、所述檢測結果的數據資訊的灰度資訊、形狀資訊和紋理資訊中的一種或多種。如本領域技術人員可以理解的，很顯然地，所述檢測結果的數據資訊也必然包括用以指示檢測結果的結論資訊（缺陷數據或非缺陷數據）。為了便於理解，所述檢測結果的數據資訊的具體示例將在下文與檢測結果的圖像資訊對比說明，在此，不再對檢測結果的數據資訊進行示例性闡述。由此可見，所述檢測結果數據包含了所述檢測對象的基本資訊和特徵數據資訊(比如nuisance的灰度、形狀、紋理等資訊)等輔助調參的資訊，且在後續的缺陷分布圖繪製和參數反向推理過程基於所述檢測結果數據，由此，本發明提供的檢測配方設置與優化方法，能夠提高檢測配方的檢測精度。Preferably, in one of the preferred embodiments, the detection result data includes basic information and characteristic data information of the detection object; wherein the characteristic data information includes location information of the detection result on the detection object, and One or more of the process flow information of the detection object, the grayscale information, the shape information and the texture information of the data information of the detection result. As those skilled in the art can understand, it is obvious that the data information of the detection results must also include conclusion information (defective data or non-defective data) used to indicate the detection results. In order to facilitate understanding, specific examples of the data information of the detection results will be described below in comparison with the image information of the detection results. Here, the data information of the detection results will not be exemplified. It can be seen that the detection result data includes the basic information and characteristic data information of the detection object (such as the grayscale, shape, texture and other information of nuisance) and other auxiliary parameter adjustment information, and will be used in the subsequent drawing of the defect distribution map. The parameter reverse reasoning process is based on the detection result data. Therefore, the detection formula setting and optimization method provided by the present invention can improve the detection accuracy of the detection formula.

優選地，在其中一種實施方式中，請參見圖2，其示意性地給出了數據樣本標注方法流程示意圖。從圖2可以看出，步驟S100中，所述對第一數據樣本進行標注，得到第二數據樣本，包括： S110：獲取所述第一數據樣本中每一條檢測結果數據對應的所述檢測對象的基本資訊； S120：對於每一條檢測結果數據，根據所述檢測對象的基本資訊和所述檢測結果在所述檢測對象上的位置資訊，獲取該條檢測結果數據在所述檢測對象上對應的原始資訊； S130：根據所述原始資訊，判斷所述檢測結果的數據資訊標出的缺陷是否為真缺陷，若是，則將該條檢測結果數據標記為真缺陷數據；若否，則將該條檢測結果數據標記為噪擾數據； S140：根據所有的所述檢測結果數據及每條所述檢測結果數據對應的標籤，得到所述第二數據樣本。 Preferably, in one of the implementations, please refer to Figure 2, which schematically shows a schematic flow chart of the data sample annotation method. As can be seen from Figure 2, in step S100, the first data sample is annotated to obtain the second data sample, including: S110: Obtain the basic information of the detection object corresponding to each piece of detection result data in the first data sample; S120: For each piece of detection result data, obtain the original information corresponding to the detection result data on the detection object based on the basic information of the detection object and the location information of the detection result on the detection object; S130: Based on the original information, determine whether the defect marked by the data information of the detection result is a true defect. If so, mark the detection result data as true defect data; if not, mark the detection result data as true defect data. Marked as noisy data; S140: Obtain the second data sample based on all the detection result data and the tag corresponding to each detection result data.

如此配置，本發明提供的檢測配方設置與優化方法，通過對所述第一數據樣本進行標注，可以準確地將檢測結果數據（歷史數據）中真正的缺陷數據和噪擾數據（nusiance，噪聲干擾數據）進行準確地區分，從而為後續獲取數據特徵分布資訊、進而根據所述數據特徵分布資訊獲取缺陷分布邊界資訊進一步進行反向推導提供準確的先驗知識，從而提高檢測配方的檢測精度。With such configuration, the detection recipe setting and optimization method provided by the present invention can accurately classify the real defect data and noise data (nusiance, noise interference) in the detection result data (historical data) by labeling the first data sample. Data) are accurately distinguished, thereby providing accurate prior knowledge for subsequent acquisition of data feature distribution information, and then obtaining defect distribution boundary information based on the data feature distribution information for further reverse derivation, thereby improving the detection accuracy of the detection formula.

需要特別說明的是，本領域的技術人員應該能夠理解，所述特徵數據資訊是對檢測對象執行缺陷檢測的全部檢測結果，包括缺陷數據和非缺陷數據。It should be noted that those skilled in the art should be able to understand that the characteristic data information is all detection results of defect detection on the detection object, including defect data and non-defect data.

作為應用本發明提供的檢測配方設置與優化方法的其中一種優選示例，以下以所述檢測對象為Wafer（晶圓）為例進行說明，很顯然地，所述第一數據樣本為所述Wafer的歷史檢測結果數據。更具體地，所述Wafer的基本資訊包括所述Wafer的編號、包含的Die（裸芯）個數以及每一個Die的基本資訊；所述Die的基本資訊包括該Die的Die編號和圖像資訊。對應地，步驟S120中，所述根據所述檢測對象的基本資訊和所述缺陷在所述檢測對象上的位置資訊，獲取該條檢測結果數據在所述檢測對象上對應的原始資訊，包括： S121：根據所述Wafer的基本資訊，獲取所述Wafer的每一個Die的Die編號及每一所述Die的基本資訊； S122：根據所述檢測結果在所述Die上的位置資訊以及所述Die的圖像資訊，獲取該條檢測結果數據在所述Die上對應的檢測結果的圖像資訊。 As one of the preferred examples of applying the detection recipe setting and optimization method provided by the present invention, the following description takes the detection object as a wafer (wafer) as an example. Obviously, the first data sample is the wafer. Historical test result data. More specifically, the basic information of the Wafer includes the number of the Wafer, the number of Dies (die) contained, and the basic information of each Die; the basic information of the Die includes the Die number and image information of the Die. . Correspondingly, in step S120, the original information corresponding to the detection result data on the detection object is obtained based on the basic information of the detection object and the location information of the defect on the detection object, including: S121: According to the basic information of the Wafer, obtain the Die number of each Die of the Wafer and the basic information of each Die; S122: According to the position information of the detection result on the Die and the image information of the Die, obtain the image information of the detection result corresponding to the detection result data on the Die.

為了便於更準確地理解本發明，以下對所述檢測結果的數據資訊和所述檢測結果的圖像資訊予以解釋說明，所述檢測結果的數據資訊包括在所述檢測結果數據中對所述檢測結果的圖像資訊的描述，而檢測結果的圖像資訊為所述檢測結果的數據資訊在所述檢測對象上對應的原始圖像，換句話說，所述檢測結果的數據資訊包括所述檢測結果的圖像資訊的數據表達。仍以晶圓為檢測對象舉例來說：若所述缺陷為紋理缺陷，則所述檢測結果的數據資訊記錄了所述紋理缺陷的紋理特徵，比如紋理的粗糙度等，而所述檢測結果的圖像資訊為所述紋理缺陷對應的原始圖像，由此，根據檢測結果的圖像資訊，可以對所述檢測結果的圖像資訊對應的所述檢測結果數據進行複判，是真缺陷數據還是噪擾數據。In order to facilitate a more accurate understanding of the present invention, the data information of the detection results and the image information of the detection results are explained below. The data information of the detection results includes the detection results in the detection result data. Description of the image information of the result, and the image information of the detection result is the original image corresponding to the data information of the detection result on the detection object. In other words, the data information of the detection result includes the detection result. Data representation of the resulting image information. Still taking the wafer as the detection object, for example: if the defect is a texture defect, the data information of the detection result records the texture characteristics of the texture defect, such as the roughness of the texture, etc., and the data information of the detection result The image information is the original image corresponding to the texture defect. Therefore, according to the image information of the detection result, the detection result data corresponding to the image information of the detection result can be re-judged as true defect data. Or noisy data.

具體地，請參見圖3，其示意性地給出了本發明一實施方式提供的對數據樣本進行缺陷標注的其中一種界面示意圖。從圖3可以看出，在所述缺陷標注的界面上，共有3個主功能區，包括Wafer展示窗口區、檢測數據列表窗口區以及缺陷（Defect）顯示區。具體地，所述Wafer展示窗口區用於圖示化顯示所述Wafer的基本資訊，包括但不限於Wafer上各個Die在所述Wafer上的位置及所述Die的編號。在所述Wafer展示窗口區下方，用戶可以選擇要進行缺陷標注的Die編號，根據用戶選擇的Die編號，在所述檢測數據列表窗口區會刷新選中的Die編號對應的Die的歷史檢測數據結果。由此，根據所述檢測數據列表窗口區內的檢測結果數據的列表，用戶可以逐條選擇所述檢測結果數據，在所述缺陷顯示區會顯示所述檢測結果數據對應的原始資訊（即所述檢測結果的圖像資訊為所述檢測結果在所述Die上的位置資訊指示的圖像資訊），由此，根據該原始資訊的各種特徵（紋理、大小、彎曲度、形狀等），可以通過人工複判或機器複判等方式進一步確認所述檢測結果的數據資訊指示的缺陷，是否為真缺陷，如果是，則將該條檢測結果數據標記為真缺陷數據（比如將在所述檢測數據列表窗口區中該條檢測結果數據的標籤標記為真缺陷，將人工判斷是否為真實缺陷欄對應的值置為是）；如果否，則將該條檢測結果數據標記為噪擾數據（比如將在所述檢測數據列表窗口區中該條檢測結果數據的標籤標記為假缺陷，將人工判斷是否為真實缺陷欄對應的值置為否）。一直重複上述過程，依次選擇每個Die編號，並依次將當前Die下的每個檢測數據結果進行人工標注，就能完成整個Wafer的檢測數據結果的標注，以此類推，就能將所述第一數據樣本標注，從而獲取第二數據樣本。Specifically, please refer to FIG. 3 , which schematically illustrates one of the interface diagrams for defect marking of data samples provided by an embodiment of the present invention. As can be seen from Figure 3, there are three main functional areas on the defect annotation interface, including the Wafer display window area, the detection data list window area and the defect display area. Specifically, the Wafer display window area is used to graphically display the basic information of the Wafer, including but not limited to the position of each Die on the Wafer and the number of the Die. Below the Wafer display window area, the user can select the Die number to be marked for defects. According to the Die number selected by the user, the historical detection data results of the Die corresponding to the selected Die number will be refreshed in the detection data list window area. . Therefore, according to the list of test result data in the test data list window area, the user can select the test result data one by one, and the original information corresponding to the test result data (that is, all the test result data) will be displayed in the defect display area. The image information of the detection result is the image information indicated by the position information of the detection result on the Die). Therefore, according to various characteristics of the original information (texture, size, curvature, shape, etc.), it is possible to Through manual re-judgment or machine re-judgment, etc., it is further confirmed whether the defect indicated by the data information of the test result is a true defect. If so, the test result data is marked as true defect data (for example, it will be included in the test result). The label of the detection result data in the data list window area is marked as a true defect, and the value corresponding to the manual judgment column of whether it is a real defect is set to yes); if not, the detection result data is marked as noise data (such as Mark the label of the detection result data in the detection data list window area as a false defect, and set the value corresponding to the column of manual judgment whether it is a real defect to No). Repeat the above process, select each Die number in turn, and manually annotate each detection data result under the current Die, then you can complete the annotation of the entire Wafer detection data results, and so on, you can add the above-mentioned first Die number. Label one data sample to obtain a second data sample.

需要特別說明的是，上文雖然以人工標注的方式為例說明所述第一數據樣本的標注方法，但很顯然地，這並非本發明的限制，在其他的實施方式中，也可以通過機器學習等方法進行標注，本發明對此不作限定。進一步地，如前所述，本發明提供的檢測配方設置與優化方法，雖然以晶圓為例作為檢測對象進行說明，但如本領域技術人員所能理解地，這僅是較佳實施方式的示例性說明，而非本發明的限制，在其他的實施方式中，所述檢測對象也可以為除晶圓之外的其他產品，包括但不限於鏡片、顯示屏、3D打印產品等等，不再一一示例說明。It should be noted that although manual labeling is used as an example to illustrate the labeling method of the first data sample, it is obvious that this is not a limitation of the present invention. In other implementations, machines can also be used to label the first data sample. Annotation may be performed by learning or other methods, which is not limited by the present invention. Furthermore, as mentioned above, although the detection recipe setting and optimization method provided by the present invention is explained by taking a wafer as an example as a detection object, as those skilled in the art can understand, this is only a preferred embodiment. To illustrate, but not to limit the present invention, in other embodiments, the detection object may also be other products besides wafers, including but not limited to lenses, display screens, 3D printing products, etc. Explain with examples one by one.

優選地，在其中一種示範性實施方式中，步驟S200中，所述根據所述第二數據樣本，得到所述檢測對象的數據特徵分布資訊，包括： S210：確定特徵數據軸和分割數據軸，並根據所述特徵數據軸和分割數據軸建立特徵空間；其中，所述特徵數據軸代表所述檢測結果數據的特徵數據資訊，所述分割數據軸代表分割特徵資訊；其中，所述分割特徵資訊包括除用於所述特徵數據軸之外的其他特徵數據資訊； S220：根據所述特徵空間對所述第二數據樣本進行排列，得到所述檢測對象的數據特徵分布資訊。 Preferably, in one of the exemplary implementations, in step S200, obtaining the data feature distribution information of the detection object based on the second data sample includes: S210: Determine the characteristic data axis and the segmented data axis, and establish a feature space based on the characteristic data axis and the segmented data axis; wherein the characteristic data axis represents the characteristic data information of the detection result data, and the segmented data axis represents Segmentation feature information; wherein the segmentation feature information includes other feature data information except for the feature data axis; S220: Arrange the second data samples according to the feature space to obtain data feature distribution information of the detection object.

如此配置，本發明提供的檢測配方設置與優化方法，通過所述特徵空間將所述第二數據樣本進行排列，其目的是為了使所述檢測結果數據在特徵空間中的分布呈現出某種趨勢，使真缺陷數據和噪擾數據的區分更加明顯，以便於獲取缺陷分布邊界資訊。So configured, the detection recipe setting and optimization method provided by the present invention arranges the second data samples through the feature space, and the purpose is to make the distribution of the detection result data in the feature space show a certain trend. , making the distinction between true defect data and noise data more obvious, so as to facilitate the acquisition of defect distribution boundary information.

優選地，所述特徵空間包括一個或多個所述特徵數據軸以及一個或多個所述分割數據軸。Preferably, the feature space includes one or more feature data axes and one or more segmentation data axes.

本發明提供的檢測配方與優化方法，所述特徵空間可以包括多個所述特徵數據軸以及多個所述分割數據軸，及所述特徵空間可以為多維特徵空間。比如，特徵數據軸為兩個，其中一個用於代表缺陷的灰度資訊，另一個用於代表缺陷的紋理資訊；分割數據軸的其中一個用於代表所述缺陷的形狀資訊，另一個代表所述缺陷的大小。由此，本發明提供的檢測配方與優化方法，由於參考了所述缺陷的更多的特徵資訊，因此，為進一步提升所述檢測配方的檢測精度奠定了良好的基礎。需要特別說明的是，上述僅是示例性說明而非本發明的限制，在實際應用中，所述特徵數據軸、所述分割數據軸及各自的個數應更根據實際需要合理選擇。In the detection formula and optimization method provided by the present invention, the feature space may include multiple feature data axes and multiple segmentation data axes, and the feature space may be a multi-dimensional feature space. For example, there are two feature data axes, one of which is used to represent the grayscale information of the defect, and the other is used to represent the texture information of the defect; one of the split data axes is used to represent the shape information of the defect, and the other is used to represent all the defects. Describe the size of the defect. Therefore, the detection formula and optimization method provided by the present invention refer to more characteristic information of the defects, thus laying a good foundation for further improving the detection accuracy of the detection formula. It should be noted that the above is only an illustrative description and not a limitation of the present invention. In practical applications, the characteristic data axis, the segmented data axis and their respective numbers should be reasonably selected according to actual needs.

優選地，在其中一種示範性實施方式中，步驟S220中，所述根據所述特徵空間對所述第二數據樣本進行排列，得到所述檢測對象的數據特徵分布資訊，包括： S221：將所述特徵數據軸作為橫軸，將所述分割數據軸作為縱軸，建立直角坐標系； S222：在所述直角坐標系內，在所述橫軸方向按照所述特徵數據軸代表的所述特徵數據資訊的特徵值大小、在所述縱軸方向按照所述分割數據軸代表的所述特徵數據資訊的特徵值大小對所述第二數據樣本進行排列，得到缺陷特徵分布圖。 Preferably, in one of the exemplary implementations, in step S220, the second data samples are arranged according to the feature space to obtain the data feature distribution information of the detection object, including: S221: Use the feature data axis as the horizontal axis and the segmented data axis as the vertical axis to establish a rectangular coordinate system; S222: In the rectangular coordinate system, in the horizontal axis direction, according to the characteristic value size of the characteristic data information represented by the characteristic data axis, and in the vertical axis direction, according to the characteristic value represented by the segmented data axis. The second data samples are arranged according to the characteristic value size of the characteristic data information to obtain a defect characteristic distribution map.

具體地，請參見圖4，其示意性地給出了其中一具體示例的檢測結果數據在二維特徵空間的分布示例圖。從圖4可以看出，該示例為橫軸表示特徵數據軸，縱軸表示分割數據軸形成的二維數據特徵分布圖。即坐標系內每個點的橫坐標表示特徵值大小，縱坐標表示對應的分割特徵值大小，如此，所有檢測結果數據的特徵值構成了整個特徵分布圖。Specifically, please refer to Figure 4, which schematically shows an example diagram of the distribution of detection result data in a two-dimensional feature space of one specific example. As can be seen from Figure 4, in this example, the horizontal axis represents the feature data axis, and the vertical axis represents the two-dimensional data feature distribution map formed by dividing the data axis. That is, the abscissa of each point in the coordinate system represents the size of the feature value, and the ordinate represents the size of the corresponding segmentation feature value. In this way, the feature values of all detection result data constitute the entire feature distribution map.

需要特別說明的是，如前所述，上述示例雖然以二維特徵空間分布為例進行說明，但是，在實際應用中，所述特徵數據軸和所述分割數據軸可以是多維的。即所述分割數據軸可以選擇多個分割值，以將檢測結果數據（即第二樣本數據）分為幾種不同的特徵分布。It should be noted that, as mentioned above, although the above example takes two-dimensional feature space distribution as an example, in actual applications, the feature data axis and the segmentation data axis may be multi-dimensional. That is, multiple segmentation values can be selected for the segmented data axis to divide the detection result data (ie, the second sample data) into several different feature distributions.

進一步地，本發明並不限制特徵空間的具體選取方法，在其中一種實施方式中，可使用特徵選擇算法進行特徵數據軸和分割數據軸的選擇從而自動選擇特徵空間；在其他的實施方式中，也可以手動進行特徵數據軸和分割數據軸的選擇，本發明對此不作任何限定。更具體地，所述特徵數據軸可以表示顏色、紋理、形狀、大小等資訊，所述分割數軸可以是經過訓練的均值圖等資訊。Furthermore, the present invention does not limit the specific selection method of the feature space. In one embodiment, a feature selection algorithm can be used to select the feature data axis and the segmentation data axis to automatically select the feature space; in other embodiments, The feature data axis and segmentation data axis can also be selected manually, and the present invention does not impose any limitation on this. More specifically, the feature data axis can represent information such as color, texture, shape, size, etc., and the segmentation axis can be information such as a trained mean map.

進一步地，作為其中一種優選實施方式，所述特徵空間選擇的標準為：所述分割數據軸可以將不同的工藝區域進行較好的區分，所述特徵數據軸可以使真缺陷數據和噪擾數據（噪聲點）之間有明顯的區別，最終目的是使檢測結果數據在特徵空間中的分布呈現出某種趨勢，使真正的缺陷和噪聲點的區分更為明顯。比如，對於晶圓缺陷的檢測結果數據，如果以特徵數據資訊中的形狀作為特徵數據軸比以特徵數據資訊中的紋理作為特徵數據軸，更能將所述檢測結果數據在所述特徵空間中將真正的缺陷和噪擾數據區分的更為明顯，則以所述特徵數據資訊中的形狀作為特徵數據軸，而不是以所述特徵數據資訊中的紋理作為特徵數據軸。可以理解地，所述特徵數據資訊中的形狀就不再作為所述分割數據軸。Further, as one of the preferred embodiments, the criteria for selecting the feature space are: the segmented data axis can better distinguish different process areas, and the feature data axis can make true defect data and noise data (noise points), the ultimate goal is to make the distribution of detection result data in the feature space show a certain trend, making the distinction between real defects and noise points more obvious. For example, for the detection result data of wafer defects, if the shape in the feature data information is used as the feature data axis rather than the texture in the feature data information as the feature data axis, the detection result data can be better placed in the feature space. To make the distinction between real defects and noise data more obvious, the shape in the feature data information is used as the feature data axis instead of the texture in the feature data information as the feature data axis. It can be understood that the shape in the feature data information is no longer used as the segmentation data axis.

優選地，在其中一種優選實施方式中，請參考圖5，其示意性地給出了本發明一實施方式提供的檢測配方設置與優化方法的流程示意圖。從圖5可以看出，步驟S300中，所述採用預設離群統計分析策略，對所述數據特徵分布資訊進行離群統計分析，獲取缺陷分布邊界資訊，包括：Preferably, in one of the preferred embodiments, please refer to FIG. 5 , which schematically provides a flow chart of a detection recipe setting and optimization method provided by an embodiment of the present invention. It can be seen from Figure 5 that in step S300, the preset outlier statistical analysis strategy is used to perform outlier statistical analysis on the data feature distribution information to obtain defect distribution boundary information, including:

判斷是否自動尋找缺陷分布邊界資訊，若是，則根據選擇的離群統計分析模型，對所述離群統計分析模型進行訓練，獲取缺陷分布邊界資訊；若否，則採用數據分割法對所述數據特徵分布資訊進行離群統計分析，獲取缺陷分布邊界資訊。Determine whether to automatically find the defect distribution boundary information. If so, train the outlier statistical analysis model according to the selected outlier statistical analysis model to obtain the defect distribution boundary information; if not, use the data segmentation method to analyze the data. Conduct outlier statistical analysis on feature distribution information to obtain defect distribution boundary information.

具體地，請參見圖6，其中，圖6為應用本發明提供的離群統計分析模型得到的缺陷分布邊界資訊示意圖。圖6中，feature1為分割數據軸，feartrue2為特徵數據軸。從圖6可以看出，在該示例中，所述缺陷分布邊界資訊3為一條曲線。由此可見，本發明提供的檢測配方設置與優化方法，通過根據所述檢測結果數據和所述數據特徵分布資訊，確定所述預設離群統計分析策略，並根據確定的所述預設離群統計分析策略，對所述數據特徵分布資訊進行離群統計分析，獲取缺陷分布邊界資訊，能夠使得所述缺陷分布邊界資訊能將真缺陷數據2和噪擾數據1較好的進行分離，即所述缺陷分布邊界資訊能夠使得在不產生漏檢缺陷的情況下盡可能的減少過檢問題，以濾除掉更多的噪聲數據。從而能夠保證後續依據所述缺陷分布邊界資訊進行反向推導確定的檢測配方不出現漏檢和過檢，從而提高檢測流程的缺陷檢測精度。Specifically, please refer to FIG. 6 , which is a schematic diagram of defect distribution boundary information obtained by applying the outlier statistical analysis model provided by the present invention. In Figure 6, feature1 is the segmentation data axis, and feartrue2 is the feature data axis. It can be seen from Figure 6 that in this example, the defect distribution boundary information 3 is a curve. It can be seen that the detection formula setting and optimization method provided by the present invention determines the preset outlier statistical analysis strategy based on the detection result data and the data feature distribution information, and determines the preset outlier statistical analysis strategy based on the determined preset outlier. The group statistical analysis strategy performs outlier statistical analysis on the data feature distribution information to obtain defect distribution boundary information, which can enable the defect distribution boundary information to better separate the true defect data 2 and the noise data 1, that is, The defect distribution boundary information can reduce over-inspection problems as much as possible without causing missed defects, so as to filter out more noise data. This can ensure that the subsequent inspection formula determined by reverse derivation based on the defect distribution boundary information will not cause missed inspections or over-inspections, thereby improving the defect detection accuracy of the inspection process.

需要特別說明的是，針對於同樣的第二樣本數據使用同樣的特徵空間，若採用的離群統計分析策略不同，則得到的所述缺陷分布邊界資訊可能不相同，由此，後續的反向推導以及檢測配方的策略都與所述離群統計分析策略緊密相關，針對圖6使用的相同的第二樣本數據，如採用數據分割法，則所述缺陷分布邊界資訊的形狀與圖6中完全不同，具體請參見下文的描述，為了避免贅述，此處暫不展開。It should be noted that for the same second sample data using the same feature space, if different outlier statistical analysis strategies are adopted, the defect distribution boundary information obtained may be different. Therefore, the subsequent reverse The strategies for deriving and detecting recipes are closely related to the outlier statistical analysis strategy. For the same second sample data used in Figure 6, if the data segmentation method is used, the shape of the defect distribution boundary information is completely the same as that in Figure 6. Different, please refer to the description below for details. To avoid redundancy, we will not go into details here.

為了便於理解和說明，以下均以二維數據分布為例進行說明，首先，對所述離群統計分析模型予以詳細說明，然後，再對數據分割法進行說明。In order to facilitate understanding and explanation, the following description takes two-dimensional data distribution as an example. First, the outlier statistical analysis model is explained in detail, and then the data segmentation method is explained.

具體地，所述對所述離群統計分析模型進行訓練包括：根據所述檢測結果數據和所述數據特徵分布資訊，對選定的所述離群統計分析模型進行訓練，直至得到的所述檢測對象的缺陷分布邊界資訊滿足第一預設條件。Specifically, training the outlier statistical analysis model includes: training the selected outlier statistical analysis model according to the detection result data and the data feature distribution information until the detection result is obtained. The defect distribution boundary information of the object satisfies the first preset condition.

更具體地，如本領域技術人員可以理解地，可以根據所述檢測結果數據和所述數據特徵分布資訊進行綜合分析進行離群統計分析模型的選擇，所述離群分析統計模型包括但不限於基於統計的離群算法（如3σ原則）、基於距離和鄰近度的聚類算法（如K-means等）、基於密度的離群算法（如DBSCAN等）、基於樹的離群分析算法（如孤立森林等）。需要特別說明的是，算法模型的選擇是非常關鍵的，算法模型不同則意味著離群邊界形狀的不同，一個最優的算法模型能使數據集的訓練既不發生欠擬合也不會出現過擬合。比如，若在所述特徵空間中所述第二樣本數據的分布更接近正態分布，則所述離群分析統計模型優選基於統計的離群算法（如3σ原則），再比如，若在所述特徵空間中所述第二樣本數據的分布真缺陷數據以及噪擾數據之間距離較近，而缺陷數據和噪擾數據之間距離較遠，則所述離群分析統計模型優選基於距離和鄰近度的聚類算法。本領域的技術人員應該能夠據此舉一反三，在此不再一一贅述。More specifically, as those skilled in the art can understand, comprehensive analysis can be performed based on the detection result data and the data feature distribution information to select an outlier statistical analysis model. The outlier analysis statistical model includes but is not limited to Statistics-based outlier algorithms (such as the 3σ principle), distance and proximity-based clustering algorithms (such as K-means, etc.), density-based outlier algorithms (such as DBSCAN, etc.), tree-based outlier analysis algorithms (such as isolated forest, etc.). It should be noted that the choice of algorithm model is very critical. Different algorithm models mean different shapes of outlier boundaries. An optimal algorithm model can make the training of the data set neither underfitting nor outliers occur. Overfitting. For example, if the distribution of the second sample data in the feature space is closer to a normal distribution, the outlier analysis statistical model is preferably based on a statistical outlier algorithm (such as the 3σ principle). For another example, if The distribution of the second sample data in the feature space is close to the true defect data and the noise data, but the distance between the defect data and the noise data is far, then the outlier analysis statistical model is preferably based on the distance sum Proximity clustering algorithm. Those skilled in the art should be able to draw inferences based on this and will not go into details here.

進一步地，本領域的技術人員應該能夠理解，所述離群分析統計模型的目的是找到最優化的邊界結果，在確定離群分析統計模型之後，應該使用所述第二樣本數據對選定的所述離群分析統計模型進行訓練，通過不斷學習和目標優化過程從而使得模型訓練的結果可以找到最佳的所述分割數據軸的拐點以及根據所述特徵數據軸將真缺陷和噪擾數據（干擾噪聲點）進行區別。由此，在所述離群分析統計模型訓練完成之後，得到一個邊界結果（即缺陷分布邊界資訊），請參見圖6，如圖6所示，缺陷分布邊界曲線3（即缺陷分布邊界資訊）能將缺陷數據和噪聲數據較好的進行分離，保證檢測結果不漏檢也不產生過檢的問題。即所述第一預設條件為缺陷分布邊界資訊能夠將所述第二樣本中的標籤為真缺陷數據和標籤為噪擾數據的檢測結果數據進行區分。Further, those skilled in the art should be able to understand that the purpose of the outlier analysis statistical model is to find the optimal boundary result. After determining the outlier analysis statistical model, the second sample data should be used to analyze all selected The outlier analysis statistical model is trained. Through continuous learning and target optimization processes, the model training results can find the best inflection point of the segmented data axis and classify the true defects and noise data (interference data) according to the characteristic data axis. noise points) to distinguish. Therefore, after the training of the outlier analysis statistical model is completed, a boundary result (i.e., defect distribution boundary information) is obtained. Please refer to Figure 6. As shown in Figure 6, defect distribution boundary curve 3 (i.e., defect distribution boundary information) It can better separate defect data and noise data to ensure that the detection results will not miss detection or cause over-inspection problems. That is, the first preset condition is that the defect distribution boundary information can distinguish the detection result data labeled as true defect data and the detection result data labeled as noise data in the second sample.

進一步地，所述採用數據分割法對所述數據特徵分布資訊進行離群統計分析包括：根據所述檢測結果數據和所述數據特徵分布資訊，在所述特徵數據軸和/或所述分割數據軸上獲取至少一個第一分割閾值；並根據所述第一分割閾值獲取所述缺陷邊界資訊，直至得到的所述檢測對象的缺陷分布邊界資訊滿足第二預設條件。Further, using the data segmentation method to perform outlier statistical analysis on the data feature distribution information includes: based on the detection result data and the data feature distribution information, on the feature data axis and/or the segmented data At least one first segmentation threshold is obtained on the axis; and the defect boundary information is obtained according to the first segmentation threshold until the obtained defect distribution boundary information of the detection object satisfies the second preset condition.

作為其中一種優選實施方式，所述數據分割法包括在所述特徵空間採用手動分割的方式，以獲取所述第一分割閾值。如本領域技術人員可以理解地，本發明並不限定所述數據分割法的具體實施方式，在其他的實施方式中，也可以通過數據分割算法來獲取所述第一分割閾值。As one of the preferred embodiments, the data segmentation method includes manually segmenting the feature space to obtain the first segmentation threshold. As those skilled in the art can understand, the present invention is not limited to the specific implementation of the data segmentation method. In other implementations, the first segmentation threshold can also be obtained through a data segmentation algorithm.

為了便於理解和說明，以下以二維數據分布、手動分割為例，對所述數據分割法予以說明如下： S321：根據所述數據特徵分布資訊，以及標籤為真缺陷數據和標籤為噪擾數據的檢測結果數據分布的一致性，確定所述分割數據軸的第一分割閾值。 S322：根據所述數據特徵分布資訊，以及標籤為真缺陷數據和標籤為噪擾數據的檢測結果數據分布的一致性，確定所述特徵數據軸的第二分割閾值； S323：根據所述分割數據軸的第一分割閾值和所述特徵數據軸的第二分割閾值，得到所述檢測對象的缺陷分布邊界資訊。 In order to facilitate understanding and explanation, the following takes two-dimensional data distribution and manual segmentation as an example to explain the data segmentation method as follows: S321: Determine the first segmentation threshold of the segmented data axis based on the data feature distribution information and the consistency of the data distribution of the detection results labeled as true defect data and labeled as noise data. S322: Determine the second segmentation threshold of the feature data axis based on the data feature distribution information and the consistency of the data distribution of the detection results labeled as true defect data and labeled as noise data; S323: Obtain the defect distribution boundary information of the detection object based on the first segmentation threshold of the segmentation data axis and the second segmentation threshold of the feature data axis.

具體地，步驟S321中，以所述數據特徵分布資訊作為輸入，在這個特徵分布圖中對分割數據軸進行分割，分割的標準是檢測結果數據分布的一致性，將具有一致分布的數據作為一個簇，找到簇與簇之間的分割值，使不同工藝的數據能夠進行區分。所述一致分布包括檢測結果數據的特徵數據資訊的分布規律，包括但不限於在特徵空間中的分布密度、空間點的相對位置關係等，以此進行分割軸和特徵軸閾值的確定依據，比如，在其中一個示例中，共設置兩個第一分割閾值segment_value1和segment_value2。Specifically, in step S321, the data feature distribution information is used as input, and the segmented data axis is segmented in this feature distribution map. The segmentation standard is the consistency of the detection result data distribution, and the data with consistent distribution is regarded as a Cluster, find the segmentation value between clusters, so that the data of different processes can be distinguished. The consistent distribution includes the distribution law of the characteristic data information of the detection result data, including but not limited to the distribution density in the characteristic space, the relative position relationship of the spatial points, etc., and is based on this to determine the segmentation axis and the characteristic axis threshold, such as , in one of the examples, two first segmentation thresholds segment_value1 and segment_value2 are set.

對應地，步驟S322中，在特徵分布中對所述特徵數據軸進行第二分割閾值確定。由於在特徵分布中已經對缺陷數據點進行了標注，因此，第二分割閾值確定的原則是讓噪擾數據和真缺陷數據分離的儘量遠，這樣可以在保證檢測結果數據不發生漏檢的同時也盡可能減少過檢的發生。即所述第二預設條件優選為所述缺陷邊界資訊能夠將所述真缺陷數據和所述噪擾數據分離。Correspondingly, in step S322, a second segmentation threshold is determined for the feature data axis in the feature distribution. Since the defect data points have been marked in the feature distribution, the principle of determining the second segmentation threshold is to separate the noise data and the real defect data as far as possible, so as to ensure that the detection result data will not be missed at the same time. Also minimize the occurrence of over-inspections. That is, the second preset condition is preferably that the defect boundary information can separate the true defect data and the noise data.

由此，在分別對分割數據軸的第一分割閾值和所述特徵數據軸的第二分割閾值確定後，即可得到離群統計分析的缺陷分別邊界資訊。下圖仍是以二維特徵數據分布為例，將手動分割出來的缺陷分布邊界資訊進行了展示。在所述分割軸上使用兩個第一分割閾值segment_value1和segment_value2進行了檢測結果數據的分割，將所有的檢測結果數據分為三段不同的分布。在每一個分割閾值的區間，在所述特徵數據軸上使用三條不同的第二分割閾值將真缺陷和噪擾數據進行區分，得到最終的缺陷分布邊界資訊。即所述缺陷分布邊界資訊包括由兩個第一分割閾值segment_value1和segment_value2形成的2條平行於所述特徵數據軸featrue1的直線，以及分別位於特徵數據軸featreu1、所述第一分割閾值segment_value1和segment_value2之間，且分別與所述特徵數據軸featreu1和所述第一分割閾值segment_value1相交的第一線段、所述第一分割閾值segment_value1和segment_value2相交的第二線段以及與所述第一分割閾值segment_value2相交且沿著所述分割數據軸feature2向上延伸的第三直線。Therefore, after the first segmentation threshold of the segmentation data axis and the second segmentation threshold of the feature data axis are determined respectively, the defect separation boundary information of the outlier statistical analysis can be obtained. The figure below still takes the two-dimensional feature data distribution as an example to display the manually segmented defect distribution boundary information. The detection result data is segmented using two first segmentation thresholds segment_value1 and segment_value2 on the segmentation axis, and all the detection result data is divided into three different distributions. In each segmentation threshold interval, three different second segmentation thresholds are used on the feature data axis to distinguish true defects from noise data, and the final defect distribution boundary information is obtained. That is, the defect distribution boundary information includes two straight lines parallel to the feature data axis featureu1 formed by the two first segmentation thresholds segment_value1 and segment_value2, and are respectively located on the feature data axis featreu1 and the first segmentation thresholds segment_value1 and segment_value2. between the first line segment that intersects the feature data axis featreu1 and the first segmentation threshold segment_value1, the second line segment that intersects the first segmentation thresholds segment_value1 and segment_value2, and the first segmentation threshold segment_value2 A third straight line that intersects and extends upward along the segmented data axis feature2.

優選地，在其中一種示範性實施方式中，所述採用預設離群統計分析策略還包括：數據分割和模型學習相結合的離群統計分析策略。所述數據分割和模型學習相結合的離群統計分析策略包括：根據所述數據特徵分布資訊，獲取標籤為真缺陷的所述檢測結果數據在所述分割數據軸上的至少一個第一分割閾值；並根據所述第一分割閾值和所述數據特徵分布資訊，對選定的所述離群統計分析模型進行訓練，直至得到的所述檢測對象的缺陷分布邊界資訊滿足第三預設條件。Preferably, in one of the exemplary implementations, the use of a preset outlier statistical analysis strategy further includes: an outlier statistical analysis strategy that combines data segmentation and model learning. The outlier statistical analysis strategy that combines data segmentation and model learning includes: obtaining at least one first segmentation threshold on the segmentation data axis of the detection result data labeled as a true defect based on the data feature distribution information. ; And according to the first segmentation threshold and the data feature distribution information, train the selected outlier statistical analysis model until the obtained defect distribution boundary information of the detection object meets the third preset condition.

如此配置，本發明提供的檢測配方設置與優化方法，在獲取離群分布邊界資訊時，通過數據分割和模型學習相結合的離群統計分析策略，可以進一步減少機器學習模型訓練的不確定性，使機器學習模型的輸入有一定的約束條件，將手動分割的結果作為約束條件，從而能夠進一步提高缺陷邊界分布資訊獲取的效率。With such configuration, the detection recipe setting and optimization method provided by the present invention can further reduce the uncertainty of machine learning model training through an outlier statistical analysis strategy that combines data segmentation and model learning when obtaining outlier distribution boundary information. The input of the machine learning model has certain constraints, and the results of manual segmentation are used as constraints, which can further improve the efficiency of obtaining defect boundary distribution information.

所述第三預設條件優選為在保證檢測結果數據不發生漏檢的同時也盡可能減少過檢的發生，即所述第二預設條件優選為所述缺陷邊界資訊能夠將所述真缺陷數據和所述噪擾數據分離或所述離群統計分析模型的訓練次數達到預設值。The third preset condition is preferably to ensure that the detection result data does not miss detection while also minimizing the occurrence of over-inspection. That is, the second preset condition is preferably that the defect boundary information can reduce the true defect to The data and the noise data are separated or the number of training times of the outlier statistical analysis model reaches a preset value.

如本領域技術人員可以理解地，與數據分割法不同，採用數據分割和模型學習相結合的離群統計分析策略得到的缺陷分布邊界資訊與上述採用數據分割法得到的缺陷分布邊界資訊不同，採用數據分割和模型學習相結合的離群統計分析策略得到的缺陷分布邊界資訊包括由兩個第一分割閾值segment_value1和segment_value2形成的2條平行於所述特徵數據軸featrue1的直線，以及分別位於特徵數據軸featreu1、所述第一分割閾值segment_value1和segment_value2形成的3個區間的包圍所述真缺陷數據的封閉曲線。由於採用的離群統計分析策略不同，得到的缺陷邊界分布資訊截然不同，但很顯然地，不管採用何種離群統計分析策略，得到的所述缺陷邊界分布資訊均能將所述檢測結果數據中的真缺陷數據和噪擾數據準確地區分。如前所述，基於此，本發明並不限定所述離群統計分析策略的具體實現方式。As those skilled in the art can understand, unlike the data segmentation method, the defect distribution boundary information obtained by using the outlier statistical analysis strategy that combines data segmentation and model learning is different from the defect distribution boundary information obtained by the above-mentioned data segmentation method. The defect distribution boundary information obtained by the outlier statistical analysis strategy that combines data segmentation and model learning includes two straight lines parallel to the feature data axis featureure1 formed by the two first segmentation thresholds segment_value1 and segment_value2, and two straight lines located on the feature data respectively. The three intervals formed by axis featreu1, the first segmentation thresholds segment_value1 and segment_value2 are closed curves surrounding the true defect data. Due to the different outlier statistical analysis strategies used, the defect boundary distribution information obtained is completely different. However, it is obvious that no matter what outlier statistical analysis strategy is used, the defect boundary distribution information obtained can be compared with the detection result data. Accurately distinguish between true defect data and noise data. As mentioned above, based on this, the present invention does not limit the specific implementation of the outlier statistical analysis strategy.

另外，所述數據分割和模型學習相結合的離群統計分析策略中的數據分割法和模型學習的詳細內容，請參見上文中有關數據分割法和離群統計分析模型的詳細說明，為了避免贅述，在此不再詳述。In addition, for details on the data segmentation method and model learning in the outlier statistical analysis strategy that combines data segmentation and model learning, please refer to the detailed description of the data segmentation method and outlier statistical analysis model above. In order to avoid redundancy , will not be described in detail here.

優選地，在其中一種示範性實施方式中，請參見圖7，其示意性地給出了圖1中步驟S400的詳細流程示意圖。從圖7可以看出，步驟S400中，所述根據所述缺陷分布邊界資訊和所述預設離群統計分析策略，通過反向推導，確定設置或優化檢測配方的檢測參數的取值，包括： S410：根據所述預設離群統計分析策略，確定反向推導策略； S420：根據所述反向推導策略，確定所述反向推導策略的輸入數據資訊； S430：根據所述輸入數據資訊，確定所述檢測結果數據的數據分布模型； S440：根據所述數據分布模型和所述缺陷分布邊界資訊，確定所述檢測配方的檢測參數； S450：根據所述檢測配方的策略和所述反向推導的輸入數據資訊，設置或優化所述檢測配方的檢測參數的取值。 Preferably, in one of the exemplary implementations, please refer to FIG. 7 , which schematically provides a detailed flow chart of step S400 in FIG. 1 . It can be seen from Figure 7 that in step S400, based on the defect distribution boundary information and the preset outlier statistical analysis strategy, through reverse derivation, the values of the detection parameters for setting or optimizing the detection formula are determined, including : S410: Determine the reverse derivation strategy according to the preset outlier statistical analysis strategy; S420: Determine the input data information of the reverse derivation strategy according to the reverse derivation strategy; S430: Determine the data distribution model of the detection result data according to the input data information; S440: Determine the detection parameters of the detection formula according to the data distribution model and the defect distribution boundary information; S450: Set or optimize the values of detection parameters of the detection recipe according to the strategy of the detection recipe and the input data information of the reverse derivation.

由此，與現有技術中正向的參數設置採用的根據正向調參反饋的結果，進行相應參數的調整（可能只調整一個或兩個檢測參數）相比，本發明提供的檢測配方設置與優化方法，採用逆向推導的方式確定檢測配方策略，並根據缺陷邊界分布資訊來反向推理出所述檢測配方的所有參數設置值（關鍵參數，比如數據密度、數據稀疏距離和/或公差範圍等），將檢測流程的參數之間的耦合關係也考慮在內，從而避免了反復調參過程；而且調參過程根據用戶標注結果，用戶無需具有先驗知識，也可自動推理出一套相對準確的檢測流程的參數，一次性地將所有檢測參數均調到最優水平，在提高檢測流程調參效率的同時，提高了檢測配方的檢測精度。Therefore, compared with the forward parameter setting in the prior art, which adopts the adjustment of corresponding parameters based on the results of forward parameter adjustment feedback (maybe only one or two detection parameters are adjusted), the detection formula setting and optimization provided by the present invention Method, use reverse derivation to determine the detection recipe strategy, and use the defect boundary distribution information to reversely infer all parameter settings of the detection recipe (key parameters, such as data density, data sparse distance and/or tolerance range, etc.) , the coupling relationship between the parameters of the detection process is also taken into account, thereby avoiding the repeated parameter adjustment process; and the parameter adjustment process is based on the user's annotation results, the user does not need to have prior knowledge, and can automatically deduce a relatively accurate set of parameters. For the parameters of the detection process, all detection parameters are adjusted to the optimal level at one time, which not only improves the efficiency of parameter adjustment in the detection process, but also improves the detection accuracy of the detection formula.

更具體地，請參見圖8，其示意性地給出了應用本發明提供的檢測配方設置與優化方法進行反向推導的一具體示例圖。從圖8可以看出，本發明提供的檢測配方設置與優化方法，所述離群統計分析策略、所述反向推導策略以及所述檢測流程的參數設置值是緊密相關的：即所述反向推導的策略和檢測配方的策略與獲取所述缺陷邊界分布資訊的所述離群統計分析策略的核心是一致的。舉例而言，若採用離群分割法作為離群統計分析的策略，則反向推導以及檢測配方的策略的基本原理也應該與所述離群分割法的基本原理一致。More specifically, please refer to FIG. 8 , which schematically shows a specific example of reverse derivation using the detection recipe setting and optimization method provided by the present invention. It can be seen from Figure 8 that in the detection recipe setting and optimization method provided by the present invention, the outlier statistical analysis strategy, the reverse derivation strategy and the parameter setting values of the detection process are closely related: that is, the reverse derivation strategy The core of the strategy of directional derivation and detection of recipes is consistent with the outlier statistical analysis strategy of obtaining the defect boundary distribution information. For example, if the outlier segmentation method is used as the strategy for outlier statistical analysis, the basic principles of the strategy for reverse derivation and detection of recipes should also be consistent with the basic principles of the outlier segmentation method.

為了便於理解本發明，以下分別以採用數據分割法作為離群統計分析策略、基於高斯模型的離群統計分析策略以及機器學習的離群統計分析策略為例，對反向推導獲取檢測配方的參數設置值的過程予以詳細說明。In order to facilitate the understanding of the present invention, the following uses the data segmentation method as the outlier statistical analysis strategy, the outlier statistical analysis strategy based on Gaussian model and the outlier statistical analysis strategy of machine learning as examples to perform reverse derivation to obtain the parameters of the detection formula. The process of setting values is explained in detail.

一、數據分割法，反向推導新的數據流程和參數設置值1. Data segmentation method, reversely deriving new data processes and parameter setting values

在對運用數據分割法的基本原理進行反向推導以獲取檢測配方和參數設置值的具體步驟具體說明之前，先對該方法的核心思想說明如下：Before describing the specific steps of reverse derivation of the basic principles of the data segmentation method to obtain the detection formula and parameter setting values, the core idea of the method is explained as follows:

為了便於理解本發明，請結合圖9，其示意性地給出了本實施例一實施方式提供的其中一種檢測結果數據的數據密度分布示意圖。該方法的基本思想在於通過將特徵分布圖中檢測結果數據的點（所述檢測結果數據的特徵值）密度大於第一閾值的區域定義為正常（normal）區域，即將正常區域表示為和數據密度相關的函數。由此，數據密度data_density大於所述第一閾值的所有數據點（檢測結果數據的特徵值）均是正常的，那麼數據密度data_density就是需要反向推理的其中一項檢測參數。進一步地，數據密度小於或等於所述第一閾值且大於第二閾值的區域定義為噪擾（nuisance）區域，噪擾區域的檢測參數表示處於該區域的檢測結果數據包含噪聲，而這些噪聲是允許的誤差（即由於工藝誤差和噪聲影響會產生噪擾區域），而不屬缺陷數據。即認為噪擾區域是在正常區域的基礎上增加一個公差值（位移參數）用來描述噪擾區域，用下式表示： nuisance_threshold = f1(data_density) （1） In order to facilitate understanding of the present invention, please refer to FIG. 9 , which schematically provides a schematic diagram of the data density distribution of one of the detection result data provided in an implementation manner of this embodiment. The basic idea of this method is to define the area where the density of the detection result data points (the characteristic value of the detection result data) in the feature distribution map is greater than the first threshold as a normal (normal) area, that is, the normal area is expressed as the sum of the data density related functions. Therefore, all data points (feature values of detection result data) whose data density data_density is greater than the first threshold are normal, then data density data_density is one of the detection parameters that requires reverse inference. Further, an area where the data density is less than or equal to the first threshold and greater than the second threshold is defined as a nuisance area. The detection parameters of the nuisance area indicate that the detection result data in this area contains noise, and these noises are Allowable errors (that is, noisy areas due to process errors and noise effects) are not defective data. That is to say, the noise area is considered to be a tolerance value (displacement parameter) added to the normal area to describe the noise area, expressed by the following formula: nuisance_threshold = f1(data_density) (1)

由於在離群統計分析中，已經對真缺陷數據進行了標注（即獲得了噪擾區域和真缺陷區域的邊界），因此，可以根據缺陷邊界分布資訊將位移參數（公差值）進行反向推理得到。將數據密度小於或等於第二閾值（即噪擾區域之外）的區域定位為真缺陷（defect）區域，具體地，可以通過下式表示： boundary_threshold = f2(inspection_data) （2） defect_threshold = f3(boundary_threshold) （3） offset_parameter = abs (defect_threshold - nuisance_threshold) （4） Since the true defect data has been annotated in the outlier statistical analysis (that is, the boundaries between the noise area and the true defect area have been obtained), the displacement parameters (tolerance values) can be reversed based on the defect boundary distribution information. inferred. Locate the area where the data density is less than or equal to the second threshold (that is, outside the noise area) as the true defect area. Specifically, it can be expressed by the following formula: boundary_threshold = f2(inspection_data) (2) defect_threshold = f3(boundary_threshold) (3) offset_parameter = abs (defect_threshold - nuisance_threshold) (4)

式中，boundary threshold為離群統計分析算法得到的缺陷分布邊界結果，defect_threshold為所述缺陷分布邊界boundary_threshold相關的函數，最後利用defect_threshold和nuisance_threshold可以將位移參數offset_parameter）計算出來。In the formula, boundary threshold is the defect distribution boundary result obtained by the outlier statistical analysis algorithm, defect_threshold is a function related to the defect distribution boundary boundary_threshold, and finally the displacement parameter offset_parameter can be calculated using defect_threshold and nuisance_threshold.

根據上述分析可知，作為其中一種優選實施方式，若所述預設離群統計分析策略為數據分割法，則通過以下步驟獲取所述檢測配方的位移參數：步驟A1：根據所述數據分割法，將統計所述檢測對象的檢測結果數據的數據分布密度作為所述反向推導策略。步驟A2：根據所述統計數據分布密度的反向推導策略，將所述檢測對象的所有檢測結果數據作為所述輸入數據資訊。步驟A3：根據所有檢測結果數據，假設所有的所述檢測結果數據的特徵數據資訊的特徵值在特徵空間的數據分布密度分為正常區域、噪擾區域和真缺陷區域；所述正常區域為數據分布密度大於第一密度閾值的區域，噪擾區域為數據密度小於或等於所述第一密度閾值且大於第二密度閾值的區域，真缺陷區域為數據密度小於或等於所述第二密度閾值的區域。步驟A4：根據所有檢測結果數據和所有檢測結果數據的標籤，計算所述第一密度閾值和所述第二密度閾值；其中，所述第一密度閾值大於所述第二密度閾值；步驟A5：根據所述第一密度閾值、所述第二密度閾值和所述缺陷分布邊界資訊，計算所述檢測配方的位移參數。 According to the above analysis, as one of the preferred implementation methods, if the preset outlier statistical analysis strategy is the data segmentation method, the displacement parameters of the detection formula are obtained through the following steps: Step A1: According to the data segmentation method, count the data distribution density of the detection result data of the detection object as the reverse derivation strategy. Step A2: According to the reverse derivation strategy of the statistical data distribution density, use all detection result data of the detection object as the input data information. Step A3: Based on all the detection result data, it is assumed that the data distribution density of the characteristic data information of all the detection result data in the feature space is divided into normal areas, noise areas and true defect areas; the normal area is the data The area where the distribution density is greater than the first density threshold. The noise area is the area where the data density is less than or equal to the first density threshold and greater than the second density threshold. The true defect area is the area where the data density is less than or equal to the second density threshold. area. Step A4: Calculate the first density threshold and the second density threshold according to all detection result data and the labels of all detection result data; wherein the first density threshold is greater than the second density threshold; Step A5: Calculate the displacement parameter of the detection formula according to the first density threshold, the second density threshold and the defect distribution boundary information.

更具體地，為了更清楚地理解本發明，接下來以晶圓宏缺陷檢測為例對採用數據分割法獲取缺陷邊界分布資訊、進行反向推導獲取檢測流程的參數設置值進行詳細說明。More specifically, in order to understand the present invention more clearly, next, taking wafer macro-defect detection as an example, the data segmentation method is used to obtain defect boundary distribution information, and reverse derivation is performed to obtain parameter setting values of the inspection process.

參見圖10，其示意性地給出了本實施例一實施方式提供的標準分割軸的平均灰度級範圍內的真缺陷分布示意圖。如圖10所示，假設在標準（standard）分割軸（即分割數據軸，對應圖中的縱軸Feature2）的每個灰度級範圍內都存在缺陷，並對真缺陷數據和噪擾數據進行了標注。其中，所述標準分割軸為通過N（N可以根據實際需要設定，比如圖10中N=10，本發明對此不作限定）張標準的無缺陷工藝數據圖統計產生的平均值圖，即使用N張標準圖對應像素灰度的平均值作為最終結果。具體地，請參見圖11(a)-圖11(c)以及圖12，其中，圖11(a)為本發明一實施方式提供的多張測試圖示例圖，圖11(b)為圖11(a)中多張測試圖生成的平均值圖示例圖，圖11(c)為圖11(a)中多張測試圖生成的標準差圖示例圖，圖12為應用本發明提供的灰度動態閾值示意圖。圖中，像素點A為測試圖中的一個像素點，像素點A1和像素點A2分別為像素點A在均值圖和標準差圖中對應的像素點。Referring to FIG. 10 , a schematic diagram of true defect distribution within the average gray level range of the standard segmentation axis provided by an implementation of this embodiment is provided. As shown in Figure 10, it is assumed that there are defects in each gray level range of the standard segmentation axis (that is, the segmentation data axis, corresponding to the vertical axis Feature2 in the figure), and the true defect data and noise data are marked. Wherein, the standard dividing axis is an average value graph generated by the statistics of N (N can be set according to actual needs, such as N=10 in Figure 10, the present invention is not limited to this) standard defect-free process data graphs, that is, using The average value of the corresponding pixel grayscales of N standard images is used as the final result. Specifically, please refer to Figure 11(a)-Figure 11(c) and Figure 12. Figure 11(a) is an example of multiple test charts provided by an embodiment of the present invention, and Figure 11(b) is a diagram. Figure 11(a) is an example of the average value chart generated by multiple test charts. Figure 11(c) is an example of the standard deviation chart generated by multiple test charts in Figure 11(a). Figure 12 is provided for applying the present invention. Schematic diagram of grayscale dynamic threshold. In the figure, pixel A is a pixel in the test image, and pixel A1 and A2 are the corresponding pixels of pixel A in the mean map and standard deviation map respectively.

a、選定樣本：選定樣本(如圖11(a)所示)，根據N張所述測試圖統計（訓練）得到平均值圖和標準差圖。a. Selected samples: Select samples (as shown in Figure 11(a)), and obtain the average value map and the standard deviation map based on the statistics (training) of N test charts.

b、確定特徵數據軸和分割數據軸，參見圖10，feature1為圖10中的特徵數據軸，fearture2為圖10中的分割數據軸。根據所述測試圖的灰度值與所述分割數據軸的灰度值，計算得到各個測試圖在特徵數據軸feature1的值，如下式所示： feature1=test-mean （5） b. Determine the feature data axis and segmentation data axis. See Figure 10. Feature1 is the feature data axis in Figure 10, and fearture2 is the segmentation data axis in Figure 10. According to the gray value of the test image and the gray value of the segmented data axis, the value of each test image on the feature data axis feature1 is calculated, as shown in the following formula: feature1=test-mean (5)

式中，feature1為圖10中的特徵數據軸，test為所述測試圖的灰度值，mean為N張所述測試圖統計得到的平均值圖的灰度值。In the formula, feature1 is the feature data axis in Figure 10, test is the gray value of the test image, and mean is the gray value of the average image obtained by statistics of N test images.

如前所述，所述分割數據軸feature2通過下式獲得： feature2=mean （6） As mentioned before, the segmented data axis feature2 is obtained by the following formula: feature2=mean

式中，mean為N張所述測試圖統計得到的平均值圖的灰度值。In the formula, mean is the gray value of the average image obtained by statistics of N test images.

c、假設缺陷分布邊界資訊（閾值）通過下式表示： defect_threshold=mean+/-（sigma*std+gray）（7） c. It is assumed that the defect distribution boundary information (threshold) is expressed by the following formula: defect_threshold=mean+/-（sigma*std+gray）（7）

式中，mean為N張所述測試圖統計得到的平均值圖的灰度值，std為所述測試圖中的其中一個像素點對應的標準差，sigma是標準差的係數，為待求解參數，gray為動態閾值。動態閾值gray相當於上文中的位移參數offset_parameter，可以定義為任意曲線，其和灰度值的平均值mean之間存在如下關係： gray=b+a1*mean + a2*mean^2 + a3*mean^3+……+am*mean^m (8) In the formula, mean is the gray value of the average image obtained by statistics of N test images, std is the standard deviation corresponding to one of the pixels in the test image, sigma is the coefficient of the standard deviation, and is the parameter to be solved , gray is the dynamic threshold. The dynamic threshold gray is equivalent to the displacement parameter offset_parameter mentioned above, which can be defined as any curve. There is the following relationship between it and the average value of gray value: gray=b+a1*mean + a2*mean^2 + a3*mean^3+……+am*mean^m (8)

其中，當只取多項式的前兩項時，動態閾值gray =b+a1*mean為一條直線形式，當繼續取後續的多項式時，變為曲線形式。將多個點的對應值帶入式（7），整理得到： Among them, when only the first two terms of the polynomial are taken, the dynamic threshold gray =b+a1*mean is in the form of a straight line, and when the subsequent polynomials are continued to be taken, it becomes a curve form. Put the corresponding values of multiple points into equation (7), and get:

由此，將反向推理檢測參數的問題轉化為使用最小二乘法求解上述方程組的最優解問題，其中sigma的值即為邊界閾值假設公式中的方差std的係數，而[b a1 a2 … an]為上述要擬合的分段曲線圖中的所有係數。From this, the problem of reverse inference detection parameters is transformed into the problem of using the least squares method to solve the optimal solution of the above equations, where the value of sigma is the coefficient of the variance std in the boundary threshold hypothesis formula, and [b a1 a2 ... an] are all coefficients in the above piecewise curve to be fitted.

d、求解多項式中的各個係數d. Solve each coefficient in the polynomial

對方程組進行求解，得到sigma及[b a1 a2 … an]各個係數的值，即可將算法中涉及的參數全部解析出來。如下式： By solving the system of equations and obtaining the values of sigma and [b a1 a2 ... an] of each coefficient, all parameters involved in the algorithm can be analyzed. As follows:

將上述方程組轉化為矩陣形式，如下所示： A x = b A’ A x= A’ b x =(A’ A )^(-1) * (A’ b) Convert the above system of equations into matrix form as follows: A x = b A’ A x= A’ b x =(A’ A )^(-1) * (A’ b)

由此，x即為最終的解，可通過上述的矩陣運算得到向量： Therefore, x is the final solution, and the vector can be obtained through the above matrix operation:

根據上述向量，可以獲取檢測流程中的標準差的係數sigma，以及動態閾值曲線所需要的多個係數，由此，動態閾值gray的曲線也可得出。因此，在檢測流程中使用下式即可得到測試圖像中每個像素點的真缺陷閾值： defect_threshold =std*sigma+gray （9） According to the above vector, the coefficient sigma of the standard deviation in the detection process can be obtained, as well as the multiple coefficients required for the dynamic threshold curve. From this, the curve of the dynamic threshold gray can also be obtained. Therefore, the true defect threshold of each pixel in the test image can be obtained by using the following formula in the inspection process: defect_threshold =std*sigma+gray (9)

即大於上述閾值defect_threshold的像素點即為正常點，小於或等於閾值defect_threshold的像素點即為缺陷點。That is, pixels greater than the above threshold defect_threshold are normal points, and pixels less than or equal to the threshold defect_threshold are defective points.

具體地，請參見圖11(d)和圖11(e)，其中，圖11(d)為其中一張測試圖的放大示例圖，圖11(e)為使用機器學習算法檢測出來的缺陷位置示意圖。通過對比圖11(d)和圖11(e)不難發現，使用本發明提供的檢測配方設置與優化方法得到的檢測配方，能夠準確地檢測出待檢測對象的真缺陷。Specifically, please refer to Figure 11(d) and Figure 11(e). Figure 11(d) is an enlarged example of one of the test images, and Figure 11(e) is the defect location detected using a machine learning algorithm. Schematic diagram. By comparing Figure 11(d) and Figure 11(e), it is easy to find that the detection formula obtained by using the detection formula setting and optimization method provided by the present invention can accurately detect the true defects of the object to be detected.

二、基於高斯模型的離群統計分析策略，反向推導新的數據流程和參數設置值2. Outlier statistical analysis strategy based on Gaussian model, reversely deriving new data processes and parameter setting values

為了便於理解本發明，在具體說明本發明提供的基於離群學習的反向推導獲取新的數據流程和參數設置值之前，先對基於高斯模型的離群統計分析策略，反向推導新的數據流程和參數設置值的核心思想予以說明。該方法的基本原理為假設在特徵分布圖中所有數據點（檢測結果數據）的分布都服從高斯分布。然後根據離群統計分析中的缺陷（defect）邊界分布資訊反向推理檢測模型(檢測流程的策略)中需要用到的均值和方差以及方差係數等參數，以得到用高斯模型檢測所需的相關參數。與所述數據分割法，反向推導新的數據流程和參數設置值的流程類似，基於高斯模型的離群統計分析策略，反向推導新的數據流程和參數設置值，包括以下步驟：步驟B1：所述預設離群統計分析策略為基於高斯模型的離群統計分析策略；步驟B2：根據所述基於高斯模型的離群統計分析策略，將獲取所述檢測對象的檢測結果數據的高斯分布作為所述反向推導策略，將高斯模型檢測作為檢測配方的策略；步驟B3：根據統計高斯分布的反向推導策略，將所述檢測對象的所有檢測結果數據作為所述輸入數據資訊和所述缺陷分布邊界資訊作為所述輸入數據資訊；步驟B4：根據所有檢測結果數據，假設所有的所述檢測結果數據的特徵數據資訊的特徵值在特徵空間的數據分布密度服從高斯分布；步驟B5：根據所述輸入數據資訊和所述缺陷分布邊界資訊，確定所述高斯模型檢測的參數。 In order to facilitate the understanding of the present invention, before specifically describing the reverse derivation based on outlier learning provided by the present invention to obtain new data processes and parameter setting values, the outlier statistical analysis strategy based on the Gaussian model is first used to reversely deduce new data. The core ideas of the process and parameter setting values are explained. The basic principle of this method is to assume that the distribution of all data points (detection result data) in the feature distribution map obeys Gaussian distribution. Then based on the defect boundary distribution information in the outlier statistical analysis, the parameters such as mean, variance and variance coefficient that need to be used in the detection model (strategy of the detection process) are reversely inferred to obtain the correlation required for Gaussian model detection. parameters. Similar to the data segmentation method and the process of reversely deriving new data processes and parameter settings, the Gaussian model-based outlier statistical analysis strategy to reversely derive new data processes and parameter settings includes the following steps: Step B1: The preset outlier statistical analysis strategy is an outlier statistical analysis strategy based on Gaussian model; Step B2: According to the outlier statistical analysis strategy based on the Gaussian model, use the Gaussian distribution of the detection result data of the detection object as the reverse derivation strategy, and use Gaussian model detection as the detection formula strategy; Step B3: According to the reverse derivation strategy of statistical Gaussian distribution, use all detection result data of the detection object as the input data information and the defect distribution boundary information as the input data information; Step B4: Based on all detection result data, assume that the data distribution density of the characteristic values of all the characteristic data information of the detection result data in the characteristic space obeys Gaussian distribution; Step B5: Determine the parameters of the Gaussian model detection based on the input data information and the defect distribution boundary information.

更具體地，通過下述各函數關係式表示如下： boundary_threshold = f2(inspection_data) （2） 𝜇=𝑓4(inspection_data ) （10） ∑ = f5(inspection_data,，𝜇) （11） ∏ = f6(boundary_threshold，inspection_data，𝜇, ，∑) （12） More specifically, it is expressed as follows through the following functional relationship expressions: boundary_threshold = f2(inspection_data) (2) 𝜇=𝑓4(inspection_data ) ∑ = f5(inspection_data,,𝜇) (11) ∏ = f6(boundary_threshold, inspection_data, 𝜇, ,∑) (12)

式中，boundary_threshold為離群算法得到的缺陷邊界分布結果，可以已經得到這個邊界矩陣boundary_threshold。均值𝜇可以由檢測結果數據得到，是對當前檢測數據圖像求取灰度的平均值而得。方差∑的計算是通過待檢測圖像的像素點的灰度值與均值𝜇相減的平方和，再求平均得到的。權重∏可以表示為boundary_threshold，inspection_data， 𝜇 和 ∑相關聯的函數，它表現為方差∑的係數，根據下式： 𝜇 +∏ * ∑= boundary_threshold （13） In the formula, boundary_threshold is the defect boundary distribution result obtained by the outlier algorithm. This boundary matrix boundary_threshold can already be obtained. The mean value 𝜇 can be obtained from the detection result data, which is obtained by averaging the grayscale of the current detection data image. The variance Σ is calculated by subtracting the sum of squares from the gray value of the pixels of the image to be detected and the mean 𝜇, and then averaging. The weight ∏ can be expressed as a function associated with boundary_threshold, inspection_data, 𝜇 and ∑, which is expressed as the coefficient of the variance ∑, according to the following formula: 𝜇 +∏ * ∑= boundary_threshold (13)

上式中，由於𝜇，方差∑和邊界boundary_threshold都已經計算得到，因此可以解方程得到權重∏。In the above formula, since 𝜇, variance ∑ and boundary_threshold have been calculated, the equation can be solved to obtain the weight ∏.

三、基於機器學習的離群統計分析策略，反向推導新的數據流程和參數設置值3. Outlier statistical analysis strategy based on machine learning, reversely deriving new data processes and parameter setting values

作為優選，在其中一種示範性實施方式中，所述基於機器學習的離群統計分析策略，反向推導新的數據流程和參數設置值，包括以下步驟：步驟C1：所述預設離群統計分析策略為機器學習的離群統計分析策略；步驟C2：根據所述機器學習的離群統計分析策略，將獲取所述檢測對象的檢測結果數據的密度閾值和距離閾值作為所述反向推導策略，將機器學習模型作為檢測配方的策略；步驟C3：根據所述獲取所述檢測對象的檢測結果數據的密度閾值和距離閾值的反向推導策略，將獲取的所述檢測對象的檢測結果數據的密度和距離作為所述輸入數據資訊；步驟C4：根據所有檢測結果數據和所述缺陷邊界分布資訊，反向推導所述機器學習模型的檢測策略的密度參數和距離參數。 Preferably, in one of the exemplary implementations, the outlier statistical analysis strategy based on machine learning to reversely derive new data processes and parameter setting values includes the following steps: Step C1: The preset outlier statistical analysis strategy is a machine learning outlier statistical analysis strategy; Step C2: According to the outlier statistical analysis strategy of machine learning, the density threshold and distance threshold for obtaining the detection result data of the detection object are used as the reverse derivation strategy, and the machine learning model is used as the detection formula strategy; Step C3: According to the reverse derivation strategy for obtaining the density threshold and distance threshold of the detection result data of the detection object, use the obtained density and distance of the detection result data of the detection object as the input data information; Step C4: Based on all detection result data and the defect boundary distribution information, reversely derive the density parameters and distance parameters of the detection strategy of the machine learning model.

如本領域技術人員可以理解地，由於機器學習模型需要制定多個參數，基於機器學習的離群統計分析算法，參數的確定直接影響檢測精度的高低。如:k-means算法中的初始聚類中心，DBSCAN算法中的鄰域和數量閾值等。由此，通過離群統計分析中的缺陷邊界分布資訊（結果）對這些機器學習參數進行反向推理，可以得到具有先驗知識的機器學習模型，進而提高模型檢測的精度。具體地，可以通過以下各式： boundary_threshold = f7(inspection_data) density_parameters = f8(boundary_threshold, inspection_data) distance_parameters = f9(boundary_threshold, inspection_data) As those skilled in the art can understand, since the machine learning model needs to formulate multiple parameters, the determination of the parameters of the outlier statistical analysis algorithm based on machine learning directly affects the detection accuracy. For example: the initial clustering center in the k-means algorithm, the neighborhood and number threshold in the DBSCAN algorithm, etc. Therefore, by conducting reverse reasoning on these machine learning parameters through the defect boundary distribution information (results) in outlier statistical analysis, a machine learning model with prior knowledge can be obtained, thereby improving the accuracy of model detection. Specifically, the following formulas can be used: boundary_threshold = f7(inspection_data) density_parameters = f8(boundary_threshold, inspection_data) distance_parameters = f9(boundary_threshold, inspection_data)

式中，boundary_threshold為離群算法得到的缺陷邊界分布資訊，和檢測結果數據相關，在缺陷邊界分析流程中已經得到。基於距離和密度進行聚類算法重要的兩個參數是密度density_parameters和距離distance_parameters，密度density_parameters和距離distance_parameters來源於檢測結果數據和邊界矩陣，通過反推距離和密度參數，以使得缺陷恰好位於預設閾值之外能夠被檢測到；而正常的像素點則位於密度較大的閾值範圍內，被過濾掉，由此，提高檢測精度。In the formula, boundary_threshold is the defect boundary distribution information obtained by the outlier algorithm, which is related to the detection result data and has been obtained during the defect boundary analysis process. The two important parameters of the clustering algorithm based on distance and density are density density_parameters and distance distance_parameters. Density density_parameters and distance distance_parameters are derived from the detection result data and the boundary matrix. By inverting the distance and density parameters, the defects are exactly located at the preset threshold. can be detected; while normal pixels are located within a threshold range with a larger density and are filtered out, thereby improving detection accuracy.

優選地，其中一種示範性實施方式中，請繼續參見圖1，從圖1可以看出，所述檢測配方設置與優化方法還包括：Preferably, in one of the exemplary implementations, please continue to refer to Figure 1. As can be seen from Figure 1, the detection recipe setting and optimization method also includes:

S500：根據所述檢測配方及所述檢測配方的檢測參數的取值，對待檢測對象進行缺陷分析，得到所述待檢測對象的缺陷數據資訊。S500: Perform defect analysis on the object to be detected according to the detection formula and the values of the detection parameters of the detection formula to obtain defect data information of the object to be detected.

請參見圖13，其示意性地給出了應用本發明提出的檢測配方設置與優化方法得到的檢測流程檢測得到的檢測結果數據與原始檢測流程得到的檢測結果數據的對比示意圖。從圖13可以看出，應用本發明反向推導得到的檢測流程的策略和參數設置值用於檢測過程，nuisance噪聲數據被過濾掉，真缺陷數據（defect缺陷數據）被保留，通過檢測結果數據在特徵空間的分布可以直觀檢驗結果的正確性。Please refer to Figure 13, which schematically shows a comparison diagram of the detection result data obtained by the detection process using the detection recipe setting and optimization method proposed by the present invention and the detection result data obtained by the original detection process. It can be seen from Figure 13 that by applying the strategy and parameter setting values of the detection process obtained by reverse derivation of the present invention for the detection process, the nuisance noise data is filtered out, the true defect data (defect defect data) is retained, and the detection result data is passed The distribution in the feature space can visually test the correctness of the results.

綜上所述，本發明提供的檢測配方設置與優化方法，所述第一數據樣本包括若干條檢測結果數據，所述檢測結果數據包括大量輔助的調參資訊，通過數據標注，為後續有效利用歷史資訊進行數據分析和推理從而能夠獲取到準確的先驗知識提供了重要的依據，能夠提高檢測配方的檢測精度。進一步地，本發明提供的檢測配方設置與優化方法，檢測配方的策略及參數設置值是根據缺陷分布邊界資訊和所述預設離群統計分析策略，通過反向推導得到。由此，本發明通過反向推導能夠同時推理出一套檢測參數（同時調整出所有參數），參數之間的耦合關係也考慮在內，實現了檢測流程的快速建模；避免了反復調整參數，能夠顯著節約人力和時間成本；而且，針對新工藝缺陷檢測，無需用戶具備算法基礎也能確定檢測流程的策略及參數設置值。To sum up, in the detection recipe setting and optimization method provided by the present invention, the first data sample includes several pieces of detection result data, and the detection result data includes a large amount of auxiliary parameter adjustment information, which can be used effectively for subsequent use through data annotation. Historical information can be used for data analysis and reasoning to obtain accurate prior knowledge, which provides an important basis and can improve the detection accuracy of detection formulas. Furthermore, in the detection recipe setting and optimization method provided by the present invention, the detection recipe strategy and parameter setting values are obtained through reverse derivation based on the defect distribution boundary information and the preset outlier statistical analysis strategy. Therefore, the present invention can deduce a set of detection parameters at the same time (adjusting all parameters at the same time) through reverse derivation. The coupling relationship between parameters is also taken into account, realizing rapid modeling of the detection process; avoiding repeated adjustment of parameters. , which can significantly save labor and time costs; moreover, for new process defect detection, users can determine the strategy and parameter setting values of the detection process without the need for algorithm foundation.

本發明的再一實施例提供了一種檢測配方設置與優化裝置，具體地，請參見圖14，其示意性地給出了本實施方式提供的檢測配方設置與優化裝置的結構框圖。從圖14可以看出，本實施例提供的檢測配方設置與優化裝置，包括：真缺陷及噪擾標記單元100、特徵分布資訊獲取單元200、缺陷分布邊界獲取單元300和檢測參數設置及優化單元400。Yet another embodiment of the present invention provides a detection recipe setting and optimization device. Specifically, please refer to FIG. 14 , which schematically provides a structural block diagram of the detection recipe setting and optimization device provided by this embodiment. As can be seen from Figure 14, the detection recipe setting and optimization device provided by this embodiment includes: a true defect and noise marking unit 100, a feature distribution information acquisition unit 200, a defect distribution boundary acquisition unit 300, and a detection parameter setting and optimization unit. 400.

具體地，所述真缺陷及噪擾標記單元100，被配置為對第一數據樣本進行標注，得到第二數據樣本；其中，所述第一數據樣本包括若干條檢測結果數據；所述第二數據樣本包括所述檢測結果數據以及每條所述檢測結果數據對應的標籤。所述特徵分布資訊獲取單元200，被配置為根據所述第二數據樣本，得到檢測對象的數據特徵分布資訊。所述缺陷分布邊界獲取單元300，被配置為採用預設離群統計分析策略，對所述數據特徵分布資訊進行離群統計分析，獲取缺陷分布邊界資訊，並用於根據所述預設離群統計分析策略，確定檢測配方。所述檢測參數設置及優化單元400，被配置為根據所述缺陷分布邊界資訊和所述預設離群統計分析策略，通過反向推導，設置或優化所述檢測配方的檢測參數的取值。Specifically, the true defect and noise marking unit 100 is configured to mark a first data sample to obtain a second data sample; wherein the first data sample includes several pieces of detection result data; and the second The data sample includes the detection result data and the label corresponding to each piece of the detection result data. The feature distribution information acquisition unit 200 is configured to obtain data feature distribution information of the detection object based on the second data sample. The defect distribution boundary acquisition unit 300 is configured to use a preset outlier statistical analysis strategy to perform outlier statistical analysis on the data feature distribution information, obtain defect distribution boundary information, and use it to perform outlier statistical analysis according to the preset outlier statistics. Analyze strategies and determine detection recipes. The detection parameter setting and optimization unit 400 is configured to set or optimize the values of detection parameters of the detection recipe through reverse derivation based on the defect distribution boundary information and the preset outlier statistical analysis strategy.

優選地，作為其中一種示範性實施方式，所述檢測配方設置與優化裝置還包括檢測配方應用單元500。具體地，所述檢測配方應用單元500被配置為根據所述檢測配方及所述檢測配方的檢測參數的取值，對待檢測對象進行缺陷分析，得到所述待檢測對象的缺陷數據資訊。Preferably, as one of the exemplary implementations, the detection recipe setting and optimization device further includes a detection recipe application unit 500 . Specifically, the detection recipe application unit 500 is configured to perform defect analysis on the object to be detected according to the detection recipe and the values of detection parameters of the detection recipe, and obtain defect data information of the object to be detected.

由於本發明提供的檢測配方設置與優化裝置與上述各實施方式提供的檢測配方設置與優化方法的基本原理類似，因此，為了避免贅述，對上述檢測配方設置與優化裝置實施方式的具體內容介紹的比較粗略，詳細的內容可參見上文有關檢測配方設置與優化方法的詳細說明。進一步地，由於本發明提供的檢測配方設置與優化裝置與上述各實施方式提供的檢測配方設置與優化方法屬同一發明構思，因此，本發明提供的檢測配方設置與優化裝置至少具有與所述檢測配方設置與優化方法相同的有益效果，可以參考上文中的檢測配方設置與優化方法中的相關內容，故對此不再進行贅述。此外，由於本發明中的檢測配方設置與優化裝置與上文所述的檢測配方設置與優化方法屬同一發明構思，因此本文對檢測配方設置與優化裝置的介紹較為簡單，關於是如何的，可以參考上文中的檢測配方設置與優化方法中的相關內容，故對此不再進行贅述。Since the basic principles of the detection recipe setting and optimization device provided by the present invention are similar to the detection recipe setting and optimization methods provided by the above embodiments, in order to avoid redundancy, the specific content of the above detection recipe setting and optimization device implementation is introduced. It is relatively rough. For detailed information, please refer to the detailed description of the detection recipe settings and optimization methods above. Furthermore, since the detection recipe setting and optimization device provided by the present invention and the detection recipe setting and optimization method provided by the above embodiments belong to the same inventive concept, the detection recipe setting and optimization device provided by the present invention at least has the same features as the detection recipe setting and optimization method. The recipe setting and optimization method have the same beneficial effects. You can refer to the relevant content in the detection recipe setting and optimization method above, so this will not be described again. In addition, since the detection formula setting and optimization device in the present invention and the detection formula setting and optimization method described above belong to the same inventive concept, the introduction to the detection formula setting and optimization device in this article is relatively simple. Regarding how, you can Please refer to the relevant content in the detection recipe settings and optimization methods above, so we will not go into details here.

基於同一發明構思，本發明還提供一種電子設備，請參考圖15，其示意性地給出了本發明一實施方式提供的電子設備的方框結構示意圖。如圖15所示，所述電子設備包括處理器601和儲存器603，所述儲存器603上儲存有電腦程式，所述電腦程式被所述處理器601執行時，實現上文所述的檢測配方設置與優化方法。由於本發明提供的電子設備與上文所述的檢測配方設置與優化方法屬同一發明構思，因此其具有上文所述的檢測配方設置與優化方法的所有優點，故對此不再進行贅述。Based on the same inventive concept, the present invention also provides an electronic device. Please refer to FIG. 15 , which schematically shows a block structure diagram of the electronic device provided by an embodiment of the present invention. As shown in Figure 15, the electronic device includes a processor 601 and a storage 603. A computer program is stored on the storage 603. When the computer program is executed by the processor 601, the detection described above is realized. Recipe setting and optimization methods. Since the electronic device provided by the present invention belongs to the same inventive concept as the above-mentioned detection recipe setting and optimization method, it has all the advantages of the above-mentioned detection recipe setting and optimization method, and thus will not be described again.

如圖15所示，所述電子設備還包括通訊接口602和通訊總線604，其中所述處理器601、所述通訊接口602、所述儲存器603通過通訊總線604完成相互間的通訊。所述通訊總線604可以是外設部件互連標準（Peripheral Component Interconnect，PCI）總線或擴展工業標准結構（Extended Industry Standard Architecture，EISA）總線等。該通訊總線604可以分為地址總線、數據總線、控制總線等。為便於表示，圖中僅用一條粗線表示，但並不表示僅有一根總線或一種類型的總線。所述通訊接口602用於上述電子設備與其他設備之間的通訊。As shown in FIG. 15 , the electronic device also includes a communication interface 602 and a communication bus 604 , wherein the processor 601 , the communication interface 602 , and the storage 603 complete communication with each other through the communication bus 604 . The communication bus 604 may be a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (EISA) bus, etc. The communication bus 604 can be divided into an address bus, a data bus, a control bus, etc. For ease of presentation, only one thick line is used in the figure, but it does not mean that there is only one bus or one type of bus. The communication interface 602 is used for communication between the above electronic device and other devices.

本發明中所稱處理器601可以是中央處理單元(Central Processing Unit，CPU)，還可以是其他通用處理器、數字訊號處理器(Digital Signal Processor，DSP)、專用集成電路 (Application Specific Integrated Circuit，ASIC)、現成可編程門陣列 (Field-Programmable Gate Array，FPGA)或者其他可編程邏輯器件、分立門或者晶體管邏輯器件、分立硬體組件等。通用處理器可以是微處理器或者該處理器也可以是任何常規的處理器等，所述處理器601是所述電子設備的控制中心，利用各種接口和線路連接整個電子設備的各個部分。The processor 601 in the present invention can be a central processing unit (CPU), or other general-purpose processor, a digital signal processor (Digital Signal Processor, DSP), or an application specific integrated circuit (Application Specific Integrated Circuit). ASIC), off-the-shelf programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. The general processor may be a microprocessor or the processor may be any conventional processor, etc. The processor 601 is the control center of the electronic device and uses various interfaces and lines to connect various parts of the entire electronic device.

所述儲存器603可用於儲存所述電腦程式，所述處理器601通過運行或執行儲存在所述儲存器603內的電腦程式，以及調用儲存在儲存器603內的數據，實現所述電子設備的各種功能。The storage 603 can be used to store the computer program. The processor 601 implements the electronic device by running or executing the computer program stored in the storage 603 and calling the data stored in the storage 603. various functions.

所述儲存器603可以包括非易失性和/或易失性儲存器。非易失性儲存器可包括只讀儲存器(ROM)、可編程ROM(PROM)、電可編程ROM(EPROM)、電可擦除可編程ROM(EEPROM)或閃存。易失性儲存器可包括隨機存取儲存器(RAM)或者外部高速緩沖儲存器。作為說明而非局限，RAM以多種形式可得，諸如靜態RAM(SRAM)、動態RAM(DRAM)、同步DRAM(SDRAM)、雙數據率SDRAM(DDRSDRAM)、增強型SDRAM(ESDRAM)、同步鏈路(Synchlink)DRAM(SLDRAM)、儲存器總線(Rambus)直接RAM(RDRAM)、直接儲存器總線動態RAM(DRDRAM)、以及儲存器總線動態RAM(RDRAM)等。The storage 603 may include non-volatile and/or volatile storage. Non-volatile memory may include read-only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory may include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous link (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.

本發明還提供了一種可讀儲存介質，所述可讀儲存介質內儲存有電腦程式，所述電腦程式被處理器執行時可以實現上文所述的檢測配方設置與優化方法。由於本發明提供的可讀儲存介質與上文所述的檢測配方設置與優化方法屬同一發明構思，因此其具有上文所述的檢測配方設置與優化方法的所有優點，故對此不再進行贅述。The present invention also provides a readable storage medium. A computer program is stored in the readable storage medium. When the computer program is executed by a processor, the above-mentioned detection recipe setting and optimization method can be realized. Since the readable storage medium provided by the present invention and the detection recipe setting and optimization method described above belong to the same inventive concept, it has all the advantages of the detection recipe setting and optimization method described above, so this will not be discussed further. Repeat.

本發明實施方式的可讀儲存介質，可以採用一個或多個電腦可讀的介質的任意組合。可讀介質可以是電腦可讀訊號介質或者電腦可讀儲存介質。電腦可讀儲存介質例如可以是但不限於電、磁、光、電磁、紅外線或半導體的系統、裝置或器件，或者任意以上的組合。電腦可讀儲存介質的更具體的例子（非窮舉的列表）包括：具有一個或多個導線的電連接、便攜式電腦硬盤、硬盤、隨機存取儲存器（RAM）、只讀儲存器（ROM）、可擦式可編程只讀儲存器（EPROM或閃存）、光纖、便攜式緊湊磁盤只讀儲存器（CD-ROM）、光儲存器件、磁儲存器件、或者上述的任意合適的組合。在本文中，電腦可讀儲存介質可以是任何包含或儲存程式的有形介質，該程式可以被指令執行系統、裝置或者器件使用或者與其組合使用。The readable storage medium in the embodiment of the present invention may be any combination of one or more computer-readable media. The readable medium may be a computer-readable signal medium or a computer-readable storage medium. The computer-readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared or semiconductor system, device or device, or any combination thereof. More specific examples (non-exhaustive list) of computer-readable storage media include: an electrical connection having one or more conductors, a portable computer hard drive, a hard drive, random access memory (RAM), read only memory (ROM) ), erasable programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. As used herein, a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in combination with an instruction execution system, device, or device.

電腦可讀的訊號介質可以包括在基帶中或者作為載波一部分傳播的數據訊號，其中承載了電腦可讀的程式碼。這種傳播的數據訊號可以採用多種形式，包括但不限於電磁訊號、光訊號或上述的任意合適的組合。電腦可讀的訊號介質還可以是電腦可讀儲存介質以外的任何電腦可讀介質，該電腦可讀介質可以發送、傳播或者傳輸用於由指令執行系統、裝置或者器件使用或者與其結合使用的程式。A computer-readable signal medium may include a data signal that carries computer-readable program code in baseband or as part of a carrier wave. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above. A computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can send, propagate, or transmit a program for use by or in connection with an instruction execution system, device, or device .

可以以一種或多種程式設計語言或其組合來編寫用於執行本發明操作的電腦程式碼，所述程式設計語言包括面向對象的程式設計語言-諸如Java、Smalltalk、C++，還包括常規的過程式程式設計語言-諸如“C”語言或類似的程式設計語言。程式碼可以完全地在用戶電腦上執行、部分地在用戶電腦上執行、作為一個獨立的軟體套件執行、部分在用戶電腦上部分在遠程電腦上執行、或者完全在遠程電腦或伺服器上執行。在涉及遠程電腦的情形中，遠程電腦可以通過任意種類的網路——包括區域網路（LAN）或廣域網路（WAN）連接到用戶電腦，或者可以連接到外部電腦（例如利用網際網路服務提供商來通過網際網路連接）。Computer code for performing operations of the present invention may be written in one or more programming languages, including object-oriented programming languages such as Java, Smalltalk, C++, and conventional procedural programming, or a combination thereof. Programming language - such as "C" or a similar programming language. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server. In situations involving remote computers, the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (e.g., using an Internet service provider to connect via the Internet).

綜上所述，與現有技術相比，本發明提供的檢測配方設置與優化方法、裝置、電子設備和儲存介質具有以下優點：所述第一數據樣本包括若干條檢測結果數據，所述檢測結果數據包括輔助的調參資訊（比如所述檢測對象的基本資訊和特徵數據資訊，所述特徵數據資訊包括但不限於檢測結果指示的缺陷的灰度、形狀、紋理等資訊），通過數據標注可以區分真缺陷數據和噪擾數據，為後續有效利用歷史資訊進行數據分析和推理從而能夠獲取到準確的先驗知識提供了重要的依據，能夠提高檢測配方的檢測精度。進一步地，本發明提供的檢測配方設置與優化方法，檢測配方的策略及檢測參數的取值是根據缺陷分布邊界資訊和所述預設離群統計分析策略，通過反向推導得到。由此，本發明通過反向推導能夠同時推理出一套檢測參數（同時調整出所有參數），參數之間的耦合關係也考慮在內，實現了檢測配方的快速建模；避免了反復調整參數，能夠顯著節約人力和時間成本；而且，針對新工藝缺陷檢測，無需用戶具備算法基礎也能確定檢測配方的策略及檢測配方的檢測參數的取值。To sum up, compared with the existing technology, the detection recipe setting and optimization method, device, electronic equipment and storage medium provided by the present invention have the following advantages: the first data sample includes several detection result data, and the detection result The data includes auxiliary parameter adjustment information (such as the basic information and characteristic data information of the detection object. The characteristic data information includes but is not limited to the grayscale, shape, texture and other information of the defects indicated by the detection results). Through data annotation, the data can be Distinguishing between true defect data and noise data provides an important basis for subsequent effective use of historical information for data analysis and reasoning to obtain accurate prior knowledge, which can improve the detection accuracy of detection formulas. Furthermore, in the detection recipe setting and optimization method provided by the present invention, the detection recipe strategy and detection parameter values are obtained through reverse derivation based on the defect distribution boundary information and the preset outlier statistical analysis strategy. Therefore, the present invention can deduce a set of detection parameters at the same time (adjusting all parameters at the same time) through reverse derivation. The coupling relationship between parameters is also taken into account, realizing rapid modeling of detection formulas; avoiding repeated adjustment of parameters. , which can significantly save labor and time costs; moreover, for new process defect detection, the user can determine the strategy of the detection formula and the values of the detection parameters of the detection formula without having any algorithm foundation.

應當注意的是，在本文的實施方式中所揭露的裝置和方法，也可以通過其他的方式實現。以上所描述的裝置實施方式僅僅是示意性的，例如，附圖中的流程圖和框圖顯示了根據本文的多個實施方式的裝置、方法和電腦程式產品的可能實現的體系架構、功能和操作。在這點上，流程圖或框圖中的每個方框可以代表一個模組、程式或代碼的一部分，所述模組、代碼段或代碼的一部分包含一個或多個用於實現規定的邏輯功能的可執行指令，所述模組、代碼段或代碼的一部分包含一個或多個用於實現規定的邏輯功能的可執行指令。也應當注意，在有些作為替換的實現方式中，方框中所標注的功能也可以以不同於附圖中所標注的順序發生。例如，兩個連續的方框實際上可以基本並行地執行，它們有時也可以按相反的順序執行，這依所涉及的功能而定。也要注意的是，框圖和/或流程圖中的每個方框、以及框圖和/或流程圖中的方框的組合，可以用於執行規定的功能或動作的專用的基於硬體的系統來實現，或者可以用專用硬體與電腦指令的組合來實現。It should be noted that the devices and methods disclosed in the embodiments of this article can also be implemented in other ways. The above-described device implementations are only illustrative. For example, the flowcharts and block diagrams in the accompanying drawings illustrate possible implementation architectures, functions, and implementations of devices, methods, and computer program products according to various embodiments of this document. operate. In this regard, each block in the flowchart or block diagram may represent a module, program, or portion of code that contains one or more logic for implementing the specified Functional executable instructions. The module, code segment or part of the code contains one or more executable instructions for implementing the specified logical function. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two consecutive blocks may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved. It is also noted that each block in the block diagram and/or flowchart illustration, and combinations of blocks in the block diagram and/or flowchart illustration, can be configured to perform special purpose hardware-based hardware to perform the specified functions or actions. system, or can be implemented using a combination of dedicated hardware and computer instructions.

另外，在本文各個實施方式中的各功能模組可以集成在一起形成一個獨立的部分，也可以是各個模組單獨存在，也可以兩個或兩個以上模組集成形成一個獨立的部分。In addition, each functional module in each embodiment of this article can be integrated together to form an independent part, each module can exist alone, or two or more modules can be integrated to form an independent part.

上述描述僅是對本發明較佳實施方式的描述，並非對本發明範圍的任何限定，本發明領域的普通技術人員根據上述揭示內容做的任何變更、修飾，均屬本發明的保護範圍。顯然，本領域的技術人員可以對本發明進行各種改動和變形而不脫離本發明的精神和範圍。這樣，倘若這些修改和變形屬本發明及其等同技術的範圍之內，則本發明也意圖包括這些改動和變形在內。The above description is only a description of the preferred embodiments of the present invention, and does not limit the scope of the present invention in any way. Any changes or modifications made by those of ordinary skill in the field of the present invention based on the above disclosure fall within the scope of the present invention. Obviously, those skilled in the art can make various changes and modifications to the present invention without departing from the spirit and scope of the present invention. Thus, if these modifications and variations are within the scope of the present invention and equivalent technologies, the present invention is also intended to include these modifications and variations.

S100~S500:步驟 1:噪擾數據 2:真缺陷數據 3:缺陷分布邊界曲線 segment_value1、segment_value2:第一分割閾值 A、A1、A2:像素點 100:真缺陷及噪擾標記單元 200:特徵分布資訊獲取單元 300:缺陷分布邊界獲取單元 400:檢測參數設置及優化單元 500:檢測配方應用單元 601:處理器 602:通訊接口 603:儲存器 604:通訊總線 S100~S500: steps 1: Noisy data 2: True defect data 3: Defect distribution boundary curve segment_value1, segment_value2: first segmentation threshold A, A1, A2: pixels 100: True defect and noise marking unit 200: Feature distribution information acquisition unit 300: Defect distribution boundary acquisition unit 400: Detection parameter setting and optimization unit 500: Detection formula application unit 601: Processor 602: Communication interface 603:Storage 604: Communication bus

圖1為本發明一實施方式提供的檢測配方設置與優化方法的流程示意圖；圖2為本發明一實施方式提供的數據樣本標注方法流程示意圖；圖3為本發明一實施方式提供的對數據樣本進行缺陷標注的其中一種界面示意圖；圖4為應用本發明的其中一具體示例的檢測結果數據在二維特徵空間的分布示例圖；圖5為本發明一實施方式提供的離群統計分析原理示意圖；圖6為應用本發明提供的離群統計分析模型得到的缺陷分布邊界資訊示意圖；圖7為圖1中步驟S400的詳細流程示意圖；圖8為應用本發明提供的檢測配方設置與優化方法進行反向推導的一具體示例圖；圖9為本發明一實施方式提供的其中一種檢測結果數據的數據密度分布示意圖；圖10為本發明一實施方式提供的標準分割軸的平均灰度級範圍內的真缺陷數據分布示意圖；圖11(a)為本發明一實施方式提供的多張測試圖示例圖；圖11(b)為圖11(a)中多張測試圖生成的均值圖示例圖；圖11(c)為圖11(a)中多張測試圖生成的標準差圖示例圖；圖11(d)為其中一張測試圖的放大示例圖；圖11(e)為使用機器學習配方檢測出來的缺陷位置示意圖；圖12為應用本發明提供的灰度動態閾值示意圖；圖13為應用本發明提供的檢測配方設置與優化方法得到的檢測配方檢測得到的檢測結果數據與原始檢測配方得到的檢測結果數據的對比示意圖；圖14為本發明一實施方式中的檢測配方設置與優化裝置的結構框圖；圖15為本發明一實施方式中的電子設備的方框結構示意圖。 Figure 1 is a schematic flow chart of a detection recipe setting and optimization method provided by an embodiment of the present invention; Figure 2 is a schematic flow chart of a data sample labeling method provided by an embodiment of the present invention; Figure 3 is a schematic diagram of an interface for defect marking of data samples provided by an embodiment of the present invention; Figure 4 is an example diagram showing the distribution of detection result data in a two-dimensional feature space in one specific example of applying the present invention; Figure 5 is a schematic diagram of the principle of outlier statistical analysis provided by an embodiment of the present invention; Figure 6 is a schematic diagram of defect distribution boundary information obtained by applying the outlier statistical analysis model provided by the present invention; Figure 7 is a detailed flow diagram of step S400 in Figure 1; Figure 8 is a specific example diagram of reverse derivation using the detection formula setting and optimization method provided by the present invention; Figure 9 is a schematic diagram of the data density distribution of one of the detection result data provided by an embodiment of the present invention; Figure 10 is a schematic diagram of true defect data distribution within the average gray level range of the standard segmentation axis provided by an embodiment of the present invention; Figure 11(a) is an example of multiple test charts provided by an embodiment of the present invention; Figure 11(b) is an example of the mean graph generated from multiple test images in Figure 11(a); Figure 11(c) is an example of the standard deviation chart generated from multiple test charts in Figure 11(a); Figure 11(d) is an enlarged example of one of the test images; Figure 11(e) is a schematic diagram of defect locations detected using machine learning recipes; Figure 12 is a schematic diagram of the grayscale dynamic threshold provided by the application of the present invention; Figure 13 is a schematic diagram comparing the detection result data obtained by applying the detection formula setting and optimization method provided by the present invention and the detection result data obtained by the original detection formula; Figure 14 is a structural block diagram of a detection recipe setting and optimization device in an embodiment of the present invention; FIG. 15 is a schematic block structure diagram of an electronic device in an embodiment of the present invention.

S100~S500:步驟 S100~S500: steps

Claims

A detection recipe setting and optimization method, which is characterized by including: Annotate the first data sample to obtain a second data sample; wherein, the first data sample includes several pieces of detection result data; the second data sample includes the detection result data and the corresponding data of each piece of detection result data. label; According to the second data sample, obtain the data characteristic distribution information of the detection object; Using a preset outlier statistical analysis strategy, perform outlier statistical analysis on the data feature distribution information to obtain defect distribution boundary information; and determine the detection formula according to the preset outlier statistical analysis strategy; According to the defect distribution boundary information and the preset outlier statistical analysis strategy, the values of the detection parameters of the detection formula are set or optimized through reverse derivation.

The detection recipe setting and optimization method as described in claim 1, wherein the detection result data includes basic information and characteristic data information of the detection object; wherein the characteristic data information includes the detection result on the detection object. Position information, as well as one or more of the process flow information of the detection object, the grayscale information, shape information and texture information of the detection result; Annotating the first data sample to obtain the second data sample includes: Obtain the basic information of the detection object corresponding to each piece of detection result data in the first data sample; For each piece of detection result data, obtain the original information corresponding to the detection result data on the detection object based on the basic information of the detection object and the location information of the detection result on the detection object; According to the original information, it is judged whether the defect marked by the data information of the detection result is a real defect. If so, the detection result data is marked as true defect data; if not, the detection result data is marked as Noisy data; The second data sample is obtained based on all the detection result data and the label corresponding to each piece of detection result data.

The detection recipe setting and optimization method as described in claim 2, wherein the detection object includes a wafer; the basic information of the wafer includes the number of the wafer, the number of Dies included, and the basic information of each Die; The basic information of Die includes the Die number and image information of the Die; Obtaining the original information corresponding to the detection result data on the detection object based on the basic information of the detection object and the location information of the detection result on the detection object includes: According to the basic information of the Wafer, obtain the Die number of each Die of the Wafer and the basic information of each Die; According to the position information of the detection result on the Die and the image information of the Die, the image information of the detection result corresponding to the detection result data on the Die is obtained.

The detection recipe setting and optimization method as described in claim 1, wherein the data feature distribution information of the detection object is obtained according to the second data sample, including: Determine the characteristic data axis and the segmentation data axis, and establish a feature space based on the characteristic data axis and the segmentation data axis; wherein the characteristic data axis represents the characteristic data information of the detection result data, and the segmentation data axis represents the segmentation feature Information; wherein the segmentation feature information includes other feature data information in addition to the feature data axis; Arrange the second data samples according to the feature space to obtain data feature distribution information of the detection object.

The detection recipe setting and optimization method as described in claim 4, wherein the feature space includes one or more feature data axes and one or more segmentation data axes.

The detection recipe setting and optimization method as described in claim 4, wherein the second data samples are arranged according to the feature space to obtain the data feature distribution information of the detection object, including: Use the feature data axis as the horizontal axis and the segmented data axis as the vertical axis to establish a rectangular coordinate system; In the rectangular coordinate system, in the horizontal axis direction, the feature value size of the feature data information represented by the feature data axis, and in the vertical axis direction, according to the feature data represented by the segmented data axis. The second data samples are arranged according to the characteristic value size of the information to obtain a defect characteristic distribution map.

The detection recipe setting and optimization method as described in claim 6, wherein the preset outlier statistical analysis strategy is used to perform outlier statistical analysis on the data feature distribution information to obtain defect distribution boundary information, including: Determine whether to automatically find the defect distribution boundary information. If so, train the outlier statistical analysis model according to the selected outlier statistical analysis model to obtain the defect distribution boundary information; if not, use the data segmentation method to analyze the data. Conduct outlier statistical analysis on feature distribution information to obtain defect distribution boundary information; Wherein, training the outlier statistical analysis model includes: training the selected outlier statistical analysis model according to the detection result data and the data feature distribution information until the detection result is obtained. The defect distribution boundary information of the object satisfies the first preset condition; The use of data segmentation method to perform outlier statistical analysis on the data feature distribution information includes: based on the detection result data and the data feature distribution information, on the feature data axis and/or the segmented data axis Obtain at least one first segmentation threshold; and acquire the defect boundary information according to the first segmentation threshold until the obtained defect distribution boundary information of the detection object satisfies the second preset condition.

The detection recipe setting and optimization method as described in claim 7, wherein the segmented data axis represents process flow information; and based on the detection result data and the data characteristic distribution information, the characteristic data axis and/or The segmented data axis is threshold segmented until the obtained defect distribution boundary information of the detection object meets the second preset condition, including: Determine the first segmentation threshold of the segmented data axis based on the data feature distribution information and the consistency of the data distribution of the detection results labeled as true defect data and labeled as noise data; Determine the second segmentation threshold of the feature data axis based on the data feature distribution information and the consistency of the data distribution of the detection results labeled as true defect data and labeled as noise data; According to the first segmentation threshold of the segmentation data axis and the second segmentation threshold of the feature data axis, the defect distribution boundary information of the detection object is obtained.

The detection recipe setting and optimization method as described in claim 7, wherein the use of a preset outlier statistical analysis strategy also includes: an outlier statistical analysis strategy that combines data segmentation and model learning; The outlier statistical analysis strategy that combines data segmentation and model learning includes: obtaining at least one first segmentation threshold on the segmentation data axis of the detection result data labeled as a true defect based on the data feature distribution information. ; And according to the first segmentation threshold and the data feature distribution information, train the selected outlier statistical analysis model until the obtained defect distribution boundary information of the detection object meets the third preset condition.

The detection recipe setting and optimization method as described in claim 1, wherein the detection parameters of the detection recipe are set or optimized through reverse derivation based on the defect distribution boundary information and the preset outlier statistical analysis strategy. The values include: Determine a reverse derivation strategy according to the preset outlier statistical analysis strategy; According to the reverse derivation strategy, determine the input data information of the reverse derivation strategy; Determine the data distribution model of the detection result data according to the input data information; Determine the detection parameters of the detection formula according to the data distribution model and the defect distribution boundary information; According to the strategy of the detection recipe and the input data information of the reverse derivation, the values of the detection parameters of the detection recipe are set or optimized.

The detection recipe setting and optimization method as described in claim 10, wherein the preset outlier statistical analysis strategy is a data segmentation method; According to the data segmentation method, the data distribution density of the detection result data of the detection object is counted as the reverse derivation strategy; According to the reverse derivation strategy of the statistical data distribution density, all detection result data of the detection object are used as the input data information; According to all detection result data, it is assumed that the data distribution density of the characteristic data information of all the detection result data in the feature space is divided into normal areas, noise areas and true defect areas; the normal area is a data distribution density greater than The area of the first density threshold, the noise area is the area where the data density is less than or equal to the first density threshold and greater than the second density threshold, and the true defect area is the area where the data density is less than or equal to the second density threshold; Calculate the first density threshold and the second density threshold according to all detection result data and labels of all detection result data; wherein the first density threshold is greater than the second density threshold; Calculate the displacement parameter of the detection formula according to the first density threshold, the second density threshold and the defect distribution boundary information.

The detection recipe setting and optimization method as described in claim 10, wherein the preset outlier statistical analysis strategy is an outlier statistical analysis strategy based on a Gaussian model; According to the outlier statistical analysis strategy based on the Gaussian model, the Gaussian distribution of the detection result data of the detection object is obtained as the reverse derivation strategy, and Gaussian model detection is used as the detection formula strategy; According to the reverse derivation strategy of statistical Gaussian distribution, all detection result data of the detection object are used as the input data information and the defect distribution boundary information is used as the input data information; According to all detection result data, it is assumed that the data distribution density of the feature values of all the feature data information of the detection result data in the feature space obeys Gaussian distribution; According to the input data information and the defect distribution boundary information, the parameters of the Gaussian model detection are determined.

The detection recipe setting and optimization method as described in claim 10, wherein the preset outlier statistical analysis strategy is a machine learning outlier statistical analysis strategy; According to the outlier statistical analysis strategy of machine learning, the density threshold and distance threshold for obtaining the detection result data of the detection object are used as the reverse derivation strategy, and the machine learning model is used as the strategy of detection formula; According to the reverse derivation strategy of obtaining the density threshold and distance threshold of the detection result data of the detection object, the obtained density and distance of the detection result data of the detection object are used as the input data information; Based on all inspection result data and the defect boundary distribution information, the density parameters and distance parameters of the inspection strategy of the machine learning model are reversely derived.

The detection recipe setting and optimization method as described in any one of requests 1 to 13, which also includes: According to the detection formula and the values of the detection parameters of the detection formula, defect analysis of the object to be detected is performed to obtain defect data information of the object to be detected.

A detection formula setting and optimization device, which is characterized by including: The true defect and noise marking unit is configured to mark the first data sample to obtain a second data sample; wherein the first data sample includes several pieces of detection result data; the second data sample includes the detection result data Result data and labels corresponding to each test result data; The feature distribution information acquisition unit is configured to obtain the data feature distribution information of the detection object based on the second data sample; The defect distribution boundary acquisition unit is configured to use a preset outlier statistical analysis strategy to perform outlier statistical analysis on the data feature distribution information, obtain defect distribution boundary information, and is used to perform outlier statistical analysis according to the preset outlier statistical analysis strategy. Determine the test formula; The detection parameter setting and optimization unit is configured to set or optimize the values of detection parameters of the detection formula through reverse derivation based on the defect distribution boundary information and the preset outlier statistical analysis strategy.

The detection recipe setting and optimization device as described in claim 15, which also includes: The detection recipe application unit is configured to perform defect analysis on the object to be detected according to the detection formula and the values of detection parameters of the detection formula, and obtain defect data information of the object to be detected.

An electronic device, characterized in that it includes a processor and a storage, and a computer program is stored on the storage. When the computer program is executed by the processor, the method described in any one of claims 1 to 14 is realized. Test recipe settings and optimization methods.

A readable storage medium, characterized in that a computer program is stored in the readable storage medium. When the computer program is executed by a processor, the detection recipe setting and optimization described in any one of claims 1 to 14 are realized. method.