KR20210017431A

KR20210017431A - Apparatus and method for generating control data of polishing process and control device comprising the same

Info

Publication number: KR20210017431A
Application number: KR1020190096657A
Authority: KR
Inventors: 김창욱; 김신영
Original assignee: 연세대학교 산학협력단
Priority date: 2019-08-08
Filing date: 2019-08-08
Publication date: 2021-02-17
Also published as: KR102279045B1

Abstract

The present invention provides an apparatus and a method for generating process control information and a process control apparatus including the same, which improve the yield of a semiconductor production process and improve the reliability of a process. According to an embodiment of the present invention, the apparatus for generating process control information required for a substrate manufacturing process comprises: an input unit to receive target scale information associated with a scale of areas belonging to a substrate to be manufactured, and process parameter information for at least one piece of equipment for manufacturing the substrate; a generation network to generate first process control information required for a process for manufacturing the substrate by using a plurality of layers based on the target scale information and the process parameter information; and a determination network to derive evaluation results for the first process control information by learning based on the first process control information and second process control information which is training data.

Description

Process control information generating apparatus, method, and process control device including the same {Apparatus and method for generating control data of polishing process and control device comprising the same}

본 발명은 공정 제어정보 생성 장치, 방법 및 이를 포함하는 공정 제어장치에 관한 것이다.The present invention relates to an apparatus and a method for generating process control information, and a process control apparatus including the same.

반도체 제조 공정은 매우 복잡한 물리적 화학적 반응과 장비 상태에 따른 비선형적인 공정 프로세스의 변화(process variation) 때문에 균일한 공정 결과를 지속적으로 얻기가 쉽지 않다. 정밀한 공정 제어를 위해 1990년대에 제시된 run-to-run (R2R) control은 매 run (a single wafer 또는 wafer lot 등 가공 단위)마다 process variation를 추정하고 이를 입력에 반영하여 공정 결과가 타깃으로부터 벗어나지 않도록 한다. In the semiconductor manufacturing process, it is difficult to consistently obtain uniform process results due to very complex physical and chemical reactions and non-linear process variations according to equipment conditions. For precise process control, the run-to-run (R2R) control proposed in the 1990s estimates the process variation for each run (processing unit such as a single wafer or wafer lot) and reflects it in the input so that the process result does not deviate from the target. do.

공정 프로세스의 변화의 유형에 해당하는 프로세스 드리프트(process drift)는 장비의 노후화에 따라 서서히 발생하는 프로세스 성능의 악화를 뜻하며, 또 다른 유형인 프로세스 시프트(shift)는 급격한 프로세스 성능의 변화, 예를 들어 장비의 소모품 교체로 인한 프로세스의 급격한 성능 개선을 뜻한다. Process drift, which is a type of process change, refers to a slow deterioration in process performance as equipment ages. Another type, process shift, is a rapid change in process performance, for example. It means a rapid performance improvement of the process by replacing the consumables of the equipment.

전통적으로 R2R control은 선형 시스템 제어에 많이 쓰이는 exponential weighted moving average (EWMA) 방식과 double EWMA 방식을 사용해 왔다. EWMA 방식은 과거 run들의 제어 결과를 가중 평균해 process drift를 추정하는데, 계산 복잡성이 낮아 R2R controller에 우선적으로 고려되어 왔다. Double EWMA 방식은 공정 결과를 추정하는 EWMA filer와 process drift를 추정하는 EWMA filter의 복합 사용으로 process drift에 대처한다.Traditionally, R2R control has used the exponential weighted moving average (EWMA) method and the double EWMA method, which are widely used for linear system control. The EWMA method estimates process drift by weighted average of the control results of past runs, and has been considered preferentially in the R2R controller due to its low computational complexity. The Double EWMA method copes with process drift by using a combination of an EWMA filer that estimates process results and an EWMA filter that estimates process drift.

반도체 제조에 필요한 많은 단위 공정 중 chemical mechanical planarization (CMP) 공정은 process drift가 뚜렷하게 발생하여 R2R control이 반드시 필요하다. 이밖에 3개의 EWMA filter를 사용한 R2R control 모델이 double EWMA 모델 대비 우수한 제어 성능을 보임을 실험을 통해 증명한 연구도 있다. 하지만 이런 EWMA 모델들은 비선형 process drift를 가진 시스템에 적용하기 어려워, drift가 급격하게 변하는 현상이 없을 시에만 사용하도록 권고하고 있다.Among the many unit processes required for semiconductor manufacturing, the chemical mechanical planarization (CMP) process has a distinct process drift, so R2R control is essential. In addition, there is a study that proved through experiment that the R2R control model using three EWMA filters shows superior control performance compared to the double EWMA model. However, these EWMA models are difficult to apply to systems with nonlinear process drift, so it is recommended to use only when the drift does not change rapidly.

Neural network-based R2R (NNR2R) control 방법은 공정 인자들이 복잡한 상관관계를 가진 비선형 시스템의 제어에 매우 유용하다. Neural network는 시스템의 입출력 데이터만을 가지고 network 가중치 조정하기에 다양한 형태의 함수를 학습할 수 있어 비선형 시스템을 모델링하고 제어하는데 효과적이다. CMP 공정에서 NNR2R controller를 사용하여 wafer의 두께가 균일한 profile을 갖도록 입력을 제어하는 것이 가능함을 입증했다. Neural network-based R2R (NNR2R) control method is very useful for controlling nonlinear systems where process factors have complex correlations. Neural network is effective in modeling and controlling nonlinear systems because it can learn various types of functions because it adjusts network weights with only the input/output data of the system. In the CMP process, it was demonstrated that it is possible to control the input so that the thickness of the wafer has a uniform profile by using the NNR2R controller.

NNR2R controller는 2개의 neural network인 process model과 control model로 구성되며, 여기서 process model은 virtual metrology (VM)을 의미한다. VM 모델은 실제 계측에 필요한 비용과 시간을 줄여주거나, process drift를 신속하고 정확하게 감지할 수 있으며, 불량 wafer를 조기에 발견하여 최종 생산량을 향상 시킨다. 이러한 이점으로 최근 기계 학습 방법을 활용하여 VM의 예측 능력을 극대화하기 위한 연구들이 지속되고 있다. VM 연구에는 process drift를 근사하기 위해 neural network을 결합한 VM 모델 구조를 제안한 연구와 에칭 공정의 trace data를 CNN에 사용한 VM 모델링 연구, 그리고 Gaussian process regression 기반 VM 모델을 통해 R2R 제어 성능을 향상시킨 연구 등이 있다. 현재까지 VM 모델에 대한 연구가 활발히 진행되고 있지만 R2R control model과 VM 모델을 통합할 때는 주의가 필요하다. R2R controller의 성능은 VM 모델의 예측 error와 control model의 error에 의해 결정되므로 단일 R2R control model에 비해 error가 높다. 특히 반도체 제조공정처럼 여러 가지 입력 요인이 intercorrelated되어 있고 비선형적인 process drift 현상을 가지는 경우에는 VM 모델의 예측 정확도가 높지 않으면 우수한 제어 성능을 보장하기가 어렵다.The NNR2R controller consists of two neural networks, a process model and a control model, where the process model means virtual metrology (VM). The VM model can reduce the cost and time required for actual measurement, detect process drift quickly and accurately, and improve final production by detecting defective wafers early. Due to this advantage, recent studies have been conducted to maximize the predictive ability of VMs using machine learning methods. In the VM study, a study that proposed a VM model structure combining neural networks to approximate process drift, a VM modeling study using trace data of the etching process in CNN, and a study that improved R2R control performance through a Gaussian process regression-based VM model. There is this. Until now, research on the VM model has been actively conducted, but care must be taken when integrating the R2R control model and the VM model. The performance of the R2R controller is determined by the prediction error of the VM model and the error of the control model, so the error is higher than that of a single R2R control model. In particular, when various input factors are intercorrelated like a semiconductor manufacturing process and a nonlinear process drift phenomenon is present, it is difficult to guarantee excellent control performance unless the prediction accuracy of the VM model is high.

한국 공개 특허 제10-2019-0041908호 (공개)Korean Patent Publication No. 10-2019-0041908 (Public)

본 발명은 상기 전술한 종래의 문제점을 해결하기 위해 반도체 제조업에서 웨이퍼의 계측, R2R 제어, 이상 탐지 등을 아우르는 시스템으로 반도체 생산공정의 수율 향상과 공정의 신뢰성을 향상시키는, 공정 제어정보 생성 장치, 방법 및 이를 포함하는 공정 제어장치를 제공한다.The present invention is a system encompassing wafer measurement, R2R control, abnormality detection, etc. in the semiconductor manufacturing industry in order to solve the above-described conventional problems, and an apparatus for generating process control information, which improves the yield and reliability of the semiconductor production process, It provides a method and a process control device including the same.

상기 목적을 달성하기 위한 본 발명의 실시예에 따른 공정 제어정보 생성 장치는, 제조하고자 하는 기판에 속하는 영역들의 스케일과 관련된 목표 스케일 정보 및 상기 기판을 제조하기 위한 적어도 하나의 장비에 대한 공정 파라미터 정보를 입력 받는 입력부, 상기 목표 스케일 정보 및 상기 공정 파라미터 정보를 기초로 복수개의 레이어들을 이용하여 상기 기판을 제조하기 위한 공정에 필요한 제1 공정 제어정보를 생성하는 생성망 및 상기 제1 공정 제어정보 및 트레이닝 데이터인 제2 공정 제어정보를 기초로 학습함에 따라 상기 제1 공정 제어정보에 대한 평가 결과를 도출하는 판별망을 포함할 수 있다.The apparatus for generating process control information according to an embodiment of the present invention for achieving the above object includes target scale information related to the scale of regions belonging to a substrate to be manufactured and process parameter information for at least one equipment for manufacturing the substrate. A generation network for generating first process control information necessary for a process for manufacturing the substrate by using a plurality of layers based on the input unit receiving the input, the target scale information and the process parameter information, and the first process control information, and It may include a discrimination network that derives an evaluation result for the first process control information by learning based on the second process control information that is training data.

상술한 목적을 달성하기 위한 본 발명의 다른 실시예에 따른 공정 제어정보 생성 방법은, 제조하고자 하는 기판에 속하는 영역들의 스케일과 관련된 목표 스케일 정보 및 상기 기판을 제조하기 위한 적어도 하나의 장비에 대한 공정 파라미터 정보를 입력 받는 단계, 상기 목표 스케일 정보 및 상기 공정 파라미터 정보를 기초로 복수개의 레이어들을 이용하여 상기 기판을 제조하기 위한 공정에 필요한 제1 공정 제어정보를 생성하는 단계 및 상기 제1 공정 제어정보 및 트레이닝 데이터인 제2 공정 제어정보를 기초로 학습함에 따라 상기 제1 공정 제어정보에 대한 평가 결과를 도출하는 단계를 포함할 수 있다.A method of generating process control information according to another embodiment of the present invention for achieving the above object includes target scale information related to the scale of regions belonging to a substrate to be manufactured, and a process for at least one equipment for manufacturing the substrate. Receiving parameter information, generating first process control information necessary for a process for manufacturing the substrate using a plurality of layers based on the target scale information and the process parameter information, and the first process control information And deriving an evaluation result for the first process control information by learning based on the second process control information that is training data.

상술한 목적을 달성하기 위한 본 발명의 또 다른 실시예에 따른 공정 제어 장치는, 폴리싱하고자 하는 기판에 속하는 영역들의 스케일과 관련된 목표 스케일정보를 입력 받아, 상기 기판을 폴리싱하기 위한 적어도 하나의 장비의 압력 설정 정보 및 상기 장비의 폴리싱 시간과 관련한 시간 설정 정보를 포함하는 공정 제어정보를 생성하는 공정 제어정보 생성부 및 상기 공정 제어정보를 기반으로 상기 장비의 공정조건을 제어하는 제어부;를 포함하되, 상기 제어부는 상기 제어된 장비의 공정조건 및 상기 장비의 누적 사용시간을 상기 공정 제어정보 생성부로 전달할 수 있다.In order to achieve the above object, a process control apparatus according to another embodiment of the present invention receives target scale information related to scales of regions belonging to a substrate to be polished, and includes at least one device for polishing the substrate. Including; a process control information generating unit for generating process control information including pressure setting information and time setting information related to the polishing time of the equipment, and a control unit for controlling the process conditions of the equipment based on the process control information; The control unit may transmit the process conditions of the controlled equipment and the accumulated use time of the equipment to the process control information generation unit.

또한 본 발명은 상기한 방법에 따른 공정 제어정보 생성 방법을 실행시키는 컴퓨터로 판독 가능한 기록매체에 저장된 컴퓨터 프로그램을 제안한다.In addition, the present invention proposes a computer program stored in a computer-readable recording medium for executing the method for generating process control information according to the above method.

본 발명의 실시예에 따른 공정 제어정보 생성 장치, 방법 및 이를 포함하는 공정 제어장치는 장비의 노후화에 의해 발생하는 프로세스 성능 악화에 따른 프로세스 변화를 정확하게 추정하여 웨이퍼를 효과적으로 폴리싱하기 위한 제어값들을 제공할 수 있는 효과가 있다The apparatus and method for generating process control information and a process control apparatus including the same according to an embodiment of the present invention provide control values for effectively polishing wafers by accurately estimating process changes caused by process performance deterioration caused by equipment aging. There is an effect that can be done

도1은 일 실시예에 따른 기판(웨이퍼) 제조 공정에 사용되는 부품 및 장비를 도시한 도면이다.
도2는 본 발명의 일 실시예에 따른 공정 제어 장치의 구성을 개략적으로 도시한 블록도이다.
도3은 본 발명의 일 실시예에 따른 공정 제어정보를 출력하기 위한 학습과정을 설명하기 위해 도시한 개념도이다.
도4는 본 발명의 일 실시예에 따른 생성망의 구성을 개략적으로 도시한 블록도이다.
도5는 본 발명의 일 실시예에 따른 드리프트 특징정보 생성부를 설명하기 위해 도시한 도3의 확대도이다.
도6은 본 발명의 일 실시예에 따른 공정 제어 장치의 구성을 개략적으로 도시한 블록도이다.
도7은 성능 평가를 위한 지표를 도시한 도면이다.
도8은 도7에서의 두 비교 모델 별로 hyper-parameter를 BO로 최적화하여 예측 성능을 높이는 실험을 수행한 결과이다.
도9은 hyper-parameter 최적화를 위한 실험 결과를 나타낸 도면이다.
도10는 hyper-parameter 최적화를 위한 실험 결과를 나타낸 도면이다.
도11은 R2R 제어 알고리즘을 실험한 결과를 나타낸 도면이다.1 is a diagram illustrating components and equipment used in a substrate (wafer) manufacturing process according to an exemplary embodiment.
2 is a block diagram schematically showing the configuration of a process control apparatus according to an embodiment of the present invention.
3 is a conceptual diagram illustrating a learning process for outputting process control information according to an embodiment of the present invention.
4 is a block diagram schematically showing the configuration of a generation network according to an embodiment of the present invention.
5 is an enlarged view of FIG. 3 illustrating a drift characteristic information generation unit according to an embodiment of the present invention.
6 is a block diagram schematically showing the configuration of a process control apparatus according to an embodiment of the present invention.
7 is a diagram showing an index for performance evaluation.
FIG. 8 is a result of an experiment to improve prediction performance by optimizing a hyper-parameter to BO for each of the two comparison models in FIG. 7.
9 is a diagram showing an experiment result for hyper-parameter optimization.
10 is a diagram showing an experiment result for hyper-parameter optimization.
11 is a diagram showing a result of an experiment of an R2R control algorithm.

첨부한 도면을 참조하여 본 발명의 바람직한 실시예를 설명함으로써, 본 발명을 상세히 설명한다. 그러나, 본 발명은 여러 가지 상이한 형태로 구현될 수 있으며, 설명하는 실시예에 한정되는 것이 아니다. 그리고, 본 발명을 명확하게 설명하기 위하여 설명과 관계 없는 부분은 생략되며, 도면의 동일한 참조부호는 동일한 부재임을 나타낸다. The present invention will be described in detail by describing a preferred embodiment of the present invention with reference to the accompanying drawings. However, the present invention may be implemented in various different forms, and is not limited to the described embodiments. Further, in order to clearly describe the present invention, parts not related to the description are omitted, and the same reference numerals in the drawings indicate the same members.

명세서 전체에서, 어떤 부분이 어떤 구성요소를 "포함" 한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라, 다른 구성요소를 더 포함할 수 있는 것을 의미한다. 또한, 명세서에 기재된 "...부", "...기", "모듈", "블록"등의 용어는 적어도 하나의 기능이나 동작을 처리하는 단위를 의미하며, 이는 하드웨어나 소프트웨어 또는 하드웨어 및 소프트웨어의 결합으로구현될 수 있다.Throughout the specification, when a certain part "includes" a certain component, it means that other components may be further included, rather than excluding other components unless otherwise stated. In addition, terms such as "... unit", "... group", "module", and "block" described in the specification mean a unit that processes at least one function or operation, which is hardware, software, or hardware. And software.

이하, 본 발명의 일 실시예를 첨부된 도면들을 참조하여 상세히 설명한다. 또한, 본 발명을 설명함에 있어, 관련된 공지 구성 또는 기능에 대한 구체적인 설명이 본 발명의 요지를 흐릴 수 있다고 판단되는 경우에는 그 상세한 설명은 생략할 수 있다.Hereinafter, an embodiment of the present invention will be described in detail with reference to the accompanying drawings. In addition, in describing the present invention, when it is determined that a detailed description of a related known configuration or function may obscure the subject matter of the present invention, a detailed description thereof may be omitted.

반도체 사업에서는 단위 면적당 더 많은 반도체 소자를 만들기 위해 미세 회로 레이어를 웨이퍼(wafer)에 적층하는 구조가 필요하게 되었다. 하지만 미세 회로 패턴을 형성하는 lithography 공정과 필요한 회로 패턴을 제외한 나머지 레이어 부분을 제거하는 에칭(etching) 공정이 반복되면서 회로가 생성된 높은 부분과 나머지 낮은 부분의 표면 단차가 발생된다. 표면 단차는 후속 레이어의 회로 패턴을 형성하는 lithograpy 공정에 해상도에 치명적인 요소로 작용하여 반도체 소자의 불량을 야기하게 된다. 따라서 적층 구조를 구현하기 위해서는 웨어퍼의 국부적인 평탄화 및 미세 회로 패턴에 필요한 광역적인 평탄화가 가능한 CMP 공정이 필요하게 되었다. CMP(chemical-mechanical polishing) 공정은 도1과 같은 로터리(rotaty) 장비를 사용한다. 이 장비는 웨이퍼를 장착하는 캐리어(carrier)와 폴리싱 패드(polishing pad)를 장착하는 플래턴(platen), 패드(pad)를 청소하는 드레서(dresser), 그리고 연마 슬러리(slurry)가 공급되는 장치로 구성될 수 있다. CMP 공정은 연마 입자가 포함된 슬러리와 웨이퍼 표면의 소재 사이에서 발생하는 화학적 작용과 폴리싱 패드와 웨이퍼의 직접적인 접촉에 의한 기계적 작용을 통해 웨이퍼의 필름을 평탄화한다. CMP의 소모성 부품(폴리싱 패드, 리테이너 링(retainer ring), 멤브래인(membrane), 및 드레서)을 사용하는 대표적 공정이다. 소모성 부품은 수명에 따라 소재제거율(material removal rate)을 달리하게 만드는 드리프트(drift)를 발생시키고, 또한 장비의 소모성 부품을 교체하면 제거율이 개선되는 프로세스 시프트(process shift)도 발생한다. 이런 프로세스의 다양한 변화 때문에 CMP 공정은 제어하기 힘든 대표적인 비선형 시스템이다.In the semiconductor business, a structure in which microcircuit layers are stacked on a wafer is required to make more semiconductor devices per unit area. However, as the lithography process of forming the microcircuit pattern and the etching process of removing the remaining layer portions except for the necessary circuit pattern are repeated, a surface level difference between the high portion where the circuit is generated and the remaining low portion occurs. The surface level difference acts as a critical factor for the resolution in the lithograpy process of forming the circuit pattern of the subsequent layer, causing a defect in the semiconductor device. Therefore, in order to implement a stacked structure, a CMP process that enables local flattening of the wearer and global flattening required for fine circuit patterns is required. The chemical-mechanical polishing (CMP) process uses rotary equipment as shown in FIG. 1. This equipment is a device that supplies a carrier for wafers, a platen for a polishing pad, a dresser for cleaning the pad, and a polishing slurry. Can be configured. In the CMP process, a film of a wafer is flattened through a chemical action occurring between a slurry containing abrasive particles and a material on the wafer surface, and a mechanical action caused by direct contact between the polishing pad and the wafer. It is a typical process using CMP's consumable parts (polishing pads, retainer rings, membranes, and dressers). Consumable parts generate a drift that causes the material removal rate to vary depending on their lifespan, and a process shift that improves the removal rate occurs when the consumable parts of the equipment are replaced. Because of the various variations in these processes, the CMP process is a typical nonlinear system that is difficult to control.

CMP 공정의 품질은 웨이퍼에 남은 소재의 두께이다. Rotary CMP 장비는 웨이퍼의 영역별 두께를 제어하기 위해 웨이퍼에 맞닿아 있는 캐리어의 멤브래인이 각 영역별로 다른 압력을 가할 수 있도록 설계되어 있어 영역별로 소재의 두께를 관리할 수 있다. 도1에 도시된 Z1~Z7은 압력이 가해지는 웨이퍼의 영역들(zones)을 나타낸다.The quality of the CMP process is the thickness of the material remaining on the wafer. Rotary CMP equipment is designed so that the membrane of the carrier in contact with the wafer can apply a different pressure to each area to control the thickness of each area of the wafer, so it is possible to manage the thickness of the material by area. Z1 to Z7 shown in FIG. 1 denote zones of the wafer to which pressure is applied.

CMP 공정 제어를 위해서는 제어 변수들(control variables)과 부품 파라미터들(equipment parameters)이 필요하다. 제어 변수로는 플래턴(platen)과 캐리어(carrier)의 상대속도, 폴리싱 시간, 슬러리 유량, 드레서(dresser)의 속도, 그리고 웨이퍼(기판)의 영역별로 패드에 누르는 압력 등이 있지만, 실제 생산 공정에서 영역의 압력과 폴리싱 시간을 제외한 제어 변수들은 기존 사용 값에서 변화량이 크지 않도록 엄격하게 통제되어 있다. 부품 파라미터들은 소모품의 수명을 기록하며 프로세스 변화 모델링(process variation modeling)을 위한 정보로 활용된다. Control variables and equipment parameters are required for CMP process control. Control variables include the relative speed of the platen and carrier, polishing time, slurry flow rate, dresser speed, and pressure applied to the pad by region of the wafer (substrate), but the actual production process Except for the pressure and polishing time in the area, the control variables are strictly controlled so that the amount of change is not large from the existing value. Part parameters record the life of the consumable and are used as information for process variation modeling.

이하에서는 전술한 바와 같은 비선형 시스템인 CMP 공정에 최적화하여 다양한 환경 변화에도 불구하고 최적의 웨이퍼(기판)의 영역별 원하는 두께를 구현할 수 있도록 하는 본 발명의 실시예에 따른 공정 제어 정보 생성 장치, 공정 제어 장치 및 방법의 구성을 관련된 도면을 참조하여 상세히 설명한다. Hereinafter, the apparatus and process for generating process control information according to an embodiment of the present invention, which optimizes the CMP process, which is a nonlinear system as described above, so that the desired thickness for each region of the wafer (substrate) can be realized despite various environmental changes. The configuration of the control device and method will be described in detail with reference to the related drawings.

본 발명의 실시예에 따른 공정 제어 정보 생성장치는 매 공정(run)마다 웨이퍼(기판)의 각 영역별로 타겟 두께를 얻기 위해 부품 파라미터들을 고려하여 영역의 압력과 폴리싱 시간을 결정하는 제어 모델을 수립하는 것을 목적으로 한다.The apparatus for generating process control information according to an embodiment of the present invention establishes a control model that determines the pressure and polishing time of the region in consideration of component parameters in order to obtain a target thickness for each region of the wafer (substrate) for each run. It aims to do.

본 발명의 실시예에 따른 공정 제어 장치는 입력부, 공정 제어 정보 생성부, 제어부, 및 업데이트부를 포함하여 구성될 수 있다. 상기 공정 제어 장치는 이미 이전에 기계학습이 완료됨에 따라 내부의 각 레이어의 필터 계수들이 설정된 상태로 실제 공정에 적용되었을 때 이용되는 장치인 것이다. 이와 같은 공정 제어 장치는 공정에 적용되기 전 학습 과정에 적용되는 공정 제어정보 생성 장치에 대하여 설명한 후 자세하게 설명하도록 한다.A process control apparatus according to an embodiment of the present invention may include an input unit, a process control information generation unit, a control unit, and an update unit. The process control device is a device that is used when the filter coefficients of each internal layer are set and applied to an actual process as machine learning has already been completed. Such a process control device will be described in detail after the process control information generating device applied to the learning process before being applied to the process is described.

도2는 본 발명의 일 실시예에 따른 공정 제어정보 생성 장치의 구성을 개략적으로 도시한 블록도이다. 도2를 참조하면, 본 발명의 실시예에 따른 공정 제어정보 생성 장치는, 입력부(100), 학습부(200), 제어부(300) 및 업데이트부(400)를 포함하여 구성될 수 있다.2 is a block diagram schematically showing a configuration of an apparatus for generating process control information according to an embodiment of the present invention. Referring to FIG. 2, an apparatus for generating process control information according to an embodiment of the present invention may include an input unit 100, a learning unit 200, a control unit 300, and an update unit 400.

본 발명의 실시예에 따른 입력부(100)는 제조하고자 하는 기판에 속하는 영역들의 스케일과 관련된 목표 스케일 정보 및 기판(이하, 웨이퍼)을 제조하기 위한 적어도 하나의 부품에 대한 공정 파라미터 정보를 입력 받을 수 있다. The input unit 100 according to an embodiment of the present invention may receive target scale information related to the scale of regions belonging to the substrate to be manufactured and process parameter information for at least one component for manufacturing the substrate (hereinafter, referred to as wafer). have.

여기서, 목표 스케일 정보란 웨이퍼를 폴리싱 하는데 있어서 달성하고자 하는 웨이퍼의 영역별 목표 두께 값을 의미하는 것으로서 사용자로부터 입력받을 수 있고, 공정 파라미터 정보는 도1에 도시된 바와 같은 웨이퍼를 폴리싱하기 위해 사용되는 부품들이 웨이퍼를 폴리싱하는데 사용된 누적 시간을 의미할 수 있다.Here, the target scale information means a target thickness value for each area of the wafer to be achieved in polishing the wafer, and can be input from the user, and the process parameter information is used to polish the wafer as shown in FIG. This can mean the cumulative time the parts have been used to polish the wafer.

일 실시예에 따른 입력부(100)는 도2에 도시된 바와 같이, 별도로 마련되어 목표 두께 값 및 부품의 누적 사용시간에 대한 정보를 입력 받아, 생성망(210) 및 판별망(220)으로 전달하도록 구현될 수도 있고, 입력부(100) 없이 바로 입력 정보들(목표 두께 값, 부품 사용 누적 시간 등) 각각이 바로 생성망(210) 또는 판별망(220)으로 입력되어 학습될 수도 있다.The input unit 100 according to an embodiment is provided separately, as shown in FIG. 2, to receive information on the target thickness value and the cumulative use time of the part, and transmit it to the generation network 210 and the discrimination network 220. It may be implemented, and each of the input information (target thickness value, accumulated part usage time, etc.) may be directly input to the generation network 210 or the discrimination network 220 for learning without the input unit 100.

그리고, 본 발명의 실시예에 따른 학습부(200)는 상술한 기계 학습이 완료된 이후 실제 공정에 적용시키기 위한 공정 제어 장치의 공정 제어정보 생성부의 각 레이어의 필터계수를 설정하기 위해서 기계 학습하는 구성으로, 본 발명의 학습부(200)는 생성망(210) 및 판별망(220)으로 구성될 수 있다.In addition, the learning unit 200 according to an embodiment of the present invention performs machine learning in order to set the filter coefficient of each layer of the process control information generation unit of the process control device to apply to the actual process after the above-described machine learning is completed. As such, the learning unit 200 of the present invention may be composed of a generation network 210 and a discrimination network 220.

생성망(210)은 입력부(100)에 입력된 스케일 정보, 및 공정 파라미터 정보를 기초로 복수개의 레이어들을 이용하여 웨이퍼를 제조하기 위한 공정에 필요한 제1 공정제어 정보를 생성할 수 있다.The generation network 210 may generate first process control information necessary for a process for manufacturing a wafer by using a plurality of layers based on scale information input to the input unit 100 and process parameter information.

판별망(220)은 제1 공정 제어정조 및 트레이닝 데이터로서 실제 기판 제조공정에 적용되었던 제2 공정 제어정보를 기초로 생성망(210)에 의해 생성된 제1 공정 제어정보에 대한 평가 결과를 도출할 수 있다.The discrimination network 220 derives an evaluation result for the first process control information generated by the generation network 210 based on the second process control information applied to the actual substrate manufacturing process as first process control adjustment and training data. can do.

도2와 같은 구조의 공정 제어정보 생성 장치는 매 공정(run) 과정이 완료될 때마다 업데이트부(400)를 이용하여 최근 가공된 공정 데이터(run data)를 입력부 및 학습부로 전달함에 따라 학습할 수 있다. The process control information generating apparatus having the structure as shown in FIG. 2 can learn by transmitting the recently processed process data to the input unit and the learning unit using the update unit 400 whenever each run process is completed. I can.

여기서, 본 발명의 학습부(200)는 생성망(210)과 생성망 학습을 경쟁을 통해 도와주는 판별망(220)으로 구성된 LSGAN(Least Squares Generative Adversarial Network)이다. 위에서 설명한 종래의 NNR2R(neural network-based run to run control)이 VM(virtual metrology) 모델을 이용하여 제어 값을 찾는 간접 제어 방식인 반면에, 본 발명의 생성망(210)은 제어-출력간 관계를 판별망(220)과의 경쟁을 통해 학습하여 R2R 제어에 직접 사용한다. 따라서, 학습된 생성망(210)은 목표 출력값(목표 스케일 정보)을 입력 받으면, 달성하기 위한 공정 제어정보(제어 값)을 출력할 수 있다.Here, the learning unit 200 of the present invention is a Least Squares Generative Adversarial Network (LSGAN) composed of a generation network 210 and a discrimination network 220 that helps the generation network learning through competition. While the conventional neural network-based run to run control (NNR2R) described above is an indirect control method that finds a control value using a virtual metrology (VM) model, the generation network 210 of the present invention has a control-output relationship. Is learned through competition with the discriminant network 220 and used directly for R2R control. Accordingly, when the learned generation network 210 receives the target output value (target scale information), it may output process control information (control value) to be achieved.

본 발명의 생성망(200)은 아래 <수학식1>과 같은 제어 함수를 이용하여 공정 제어정보를 생성할 수 있다.The generation network 200 of the present invention may generate process control information using a control function as shown in Equation 1 below.

여기서, u_k는 생성망(210)으로부터 출력되는 출력 제어 값이고, y는 출력 벡터(output vector), u는 제어 벡터(control vector), n은 생성망(210)에서 참조되는 가공된 공정(run)의 수를 의미한다. 생성망(210)은 업데이트부(400)로부터 가장 최근의 공정 제어 히스토리 정보(제어값) 그리고 출력 데이터와 폴리싱 공정에 이용되는 부품들의 누적 사용시간을 제공 받아, 최종 폴리싱되는 웨이퍼의 영역별 출력 두께값(y_k)과 사용자로부터 입력부(100)로 입력된 목표 두께값(y^target)이 같아지도록 하는 공정 제어정보(u_k)를 도출할 수 있다. Here, u _k is an output control value output from the generation network 210, y is an output vector, u is a control vector, and n is a processed process referenced by the generation network 210 ( It means the number of run). The generation network 210 receives the most recent process control history information (control value) from the update unit 400, output data, and the cumulative use time of parts used in the polishing process, and the output thickness for each area of the final polished wafer It is possible to derive process control information u _k that makes the value y _k and the target thickness y ^target input to the input unit 100 from the user become the same.

이때, 공정 제어 히스토리 정보는 공정의 시퀀스 정보, 각각의 장비를 제어하기 위한 제어 파라미터 정보를 포함할 수 있다. 또한, 공정 제어 히스토리 정보는 상기 시퀀스에서의 목표 스케일 정보를 더 포함할 수 있다. 여기에서 제어 파라미터 정보는, 예를 들어, 과거 복수번의 공정들 각각에서의 제어값, 즉 과거 공정에서 설정되었던 부품의 압력값 및 폴리싱 시간, 생성망의 각 레이블의 필터 계수(가중치), 최종 폴리싱된 영역별 두께값들을 포함하는 웨이퍼 폴리싱 공정에서의 모든 파라미터 정보들을 의미할 수 있다. In this case, the process control history information may include process sequence information and control parameter information for controlling respective equipment. In addition, the process control history information may further include target scale information in the sequence. Here, the control parameter information is, for example, the control value of each of the past multiple processes, that is, the pressure value and polishing time of the part set in the past process, the filter coefficient (weight) of each label of the generation network, and the final polishing. This may mean all parameter information in the wafer polishing process including thickness values for each region.

상기와 같은 공정 제어정보를 이용하여 생성망(210)이 제1 공정 제어정보를 생성하기 위한 제어 함수는 아래 <수학식2>와 같이 정의될 수 있다.A control function for generating the first process control information by the generation network 210 using the process control information as described above may be defined as shown in Equation 2 below.

여기서, G(·)는 생성망(210)에서 이용하는 함수이고,

는 생성망(210)이 생성하는 공정 제어정보이며, Φ는 공정 파라미터 정보인 부품의 누적 사용시간이고, W_drift 및 W_R2R은 생성망에서의 학습 가중치(레이어들의 필터계수)이다. Here, G(·) is a function used in the generation network 210,

Is the process control information generated by the generating network 210, Φ is the cumulative use time of the part, which is the process parameter information, and W _drift and W _R2R are the learning weights (filter coefficients of layers) in the generating network.

CMP 공정의 프로세스 드리프트(process drift)에 대처하기 위해서는 본 발명의 공정 제어정보 생성 장치에서의 매 공정(run)이 완료될 때마다 최근 공정의 공정 파라미터 정보을 기반으로 생성망(210)의 가중치(W_drift)를 갱신할 수 있다. 이어서, 생성망(210)과 판별망(220)의 경쟁적 학습을 통해 목표 스케일 정보(목표 두께 값)을 달성하기 위한 가중치(W_R2R)를 갱신하고 학습이 완료되면 목표 두께 값(y^target)을 달성하기 위한 공정 제어정보(u_k)를 출력할 수 있다.In order to cope with the process drift of the CMP process, the weight (W) of the generation network 210 is based on the process parameter information of the latest process whenever each run in the process control information generation apparatus of the present invention is completed. _drift ) can be updated. Subsequently, the weight (W _R2R ) for achieving target scale information (target thickness value) is updated through competitive learning of the generation network 210 and the discrimination network 220, and when the learning is completed, the target thickness value (y ^target ) is Process control information (u _k ) to achieve can be output.

종래의 신경망인 GAN은 [0, 1] 범위 내에서 정의된 임의의 latent 벡터(z)에 대응되는 출력 값이 실제 출력 값과 같아지도록 학습이 진행된다. 반면에 본 발명의 공정 제어정보 생성 장치에서의 현재 공정의 제어 변수들은 이전 공정들에서의 제어변수들과 시간적으로 종속적인 관계에 있다. 따라서, 본 발명의 공정 제어정보 생성 장치는 종래의 GAN과 같이 latent 벡터(z)에 대한 제어 변수 값들(출력 값)을 학습하지 않고, 이전 공정들에서의 공정 제어 변수들에 대한 특징정보를 기초로 하여 현재 공정에서의 공정 제어변수 값을 예측할 수 있도록 한다.The GAN, which is a conventional neural network, is trained so that an output value corresponding to an arbitrary latent vector (z) defined within [0, 1] is the same as an actual output value. On the other hand, the control variables of the current process in the process control information generating apparatus of the present invention have a temporally dependent relationship with the control variables of previous processes. Accordingly, the apparatus for generating process control information of the present invention does not learn control variable values (output values) for the latent vector z like a conventional GAN, but based on feature information on process control variables in previous processes. This makes it possible to predict the value of the process control variable in the current process.

도3은 본 발명의 일 실시예에 따른 공정 제어정보를 출력하기 위한 학습과정을 설명하기 위해 도시한 개념도이고, 도4는 본 발명의 일 실시예에 따른 생성망(210)의 구성을 개략적으로 도시한 블록도이다.3 is a conceptual diagram illustrating a learning process for outputting process control information according to an embodiment of the present invention, and FIG. 4 is a schematic diagram of a configuration of a generating network 210 according to an embodiment of the present invention. It is a block diagram shown.

도3 및 도4에 도시된 바와 같이, 본 발명의 생성망(210)은 드리프트 특징정보 생성부(211), FC 특징정보 생성부(212), 및 출력부(213)를 더 포함하여 구성될 수 있다. FC 특징정보 생성부(212)에 현재 공정에서의 부품 누적 사용시간에 대한 정보(11)와 웨이퍼의 영역별 목표 두께값(12)을 입력하면, 학습된 FC 특징정보 생성부(212)는 공정 제어정보인 영역별 폴리싱 시간과 폴리싱 압력값을 출력한다. 프로세스의 변화들을 공정 제어에 반영하기 위해 생성망(210)은 FC 특징정보 생성부(212) 앞에 드리프트 특징정보 생성부(211)를 구성한다. 여기서, 본 발명의 드리프트 특징정보 생성부(211)는 CNN(Convolution Neural Network)일 수 있다. 드리프트 특징정보 생성부(211)는 매 공정(run)이 진행됨에 따라 슬라이딩 윈도(sliding window)를 이용하여 최근 처리되었던 과거 공정들(runs)의 데이터를 업데이트부(400)로부터 전달 받아 공정 제어 히스토리 정보를 업데이트 할 수 있다. 그 후, FC 특징정보 생성부(212)는 업데이트된 공정 제어 히스토리 정보를 기반으로 드리프트 특징정보 생성부(211)로부터 출력되는 드리프트 특징 정보를 입력 받아, 판별망(220)과 경쟁적 학습을 진행하고, 이와 같은 학습이 완료되면 폴리싱을 위한 부품을 제어하기 위한 공정 제어정보(제어 값)를 출력할 수 있다.3 and 4, the generation network 210 of the present invention may further include a drift characteristic information generation unit 211, an FC characteristic information generation unit 212, and an output unit 213. I can. When the information 11 on the cumulative use time of parts in the current process and the target thickness value 12 for each area of the wafer are input to the FC feature information generator 212, the learned FC feature information generator 212 Outputs the control information, polishing time and polishing pressure value for each area. In order to reflect process changes to process control, the generation network 210 configures a drift characteristic information generation unit 211 in front of the FC characteristic information generation unit 212. Here, the drift characteristic information generation unit 211 of the present invention may be a convolution neural network (CNN). As each run proceeds, the drift characteristic information generation unit 211 receives data of the past processes that have been recently processed using a sliding window from the update unit 400 to provide a process control history. Information can be updated. After that, the FC characteristic information generation unit 212 receives the drift characteristic information output from the drift characteristic information generation unit 211 based on the updated process control history information, and performs competitive learning with the discrimination network 220. , When such learning is completed, process control information (control value) for controlling a part for polishing may be output.

상술한 내용을 도3 내지 도5를 참조하여 각 구성에 대한 동작을 순서에 따라 보다 상세하게 설명한다. 도5는 본 발명의 일 실시예에 따른 드리프트 특징정보 생성부를 설명하기 위해 도시한 도3의 확대도이다. 도5를 참조하면, 본 발명의 실시예에 따른 드리프트 특징 정보 생성부(211)는 인코딩부(510), 제2 특징 맵 생성부(520) 및 제3 특징 맵 생성부(530)를 더 포함하여 구성될 수 있다.The above-described contents will be described in more detail with reference to FIGS. 3 to 5 in order to explain the operation of each component. 5 is an enlarged view of FIG. 3 illustrating a drift characteristic information generation unit according to an embodiment of the present invention. Referring to FIG. 5, the drift feature information generation unit 211 according to an embodiment of the present invention further includes an encoding unit 510, a second feature map generator 520, and a third feature map generator 530. It can be configured.

본 발명의 입력부(100)는 공정 파라미터 정보인 부품 누적 사용시간에 대한 정보(11), 웨이퍼의 각 영역에 대한 목표 두께 값(12), 및 공정 제어 히스토리 정보와 스케일 히스토리 정보(13)를 입력 받는다. The input unit 100 of the present invention inputs information 11 on the cumulative use time of parts, which are process parameter information, a target thickness value 12 for each area of the wafer, and process control history information and scale history information 13 Receive.

그리고, 본 발명의 실시예에 따른 드리프트 특징정보 생성부(211)의 인코딩부(510)는 이전에 제조된 웨이퍼들 각각의 시퀀스(sequence) 정보, 웨이퍼들을 제조할 때 사용되었던 부품에 대한 공정 파라미터 정보, 이전에 제조된 웨이퍼들 각각을 제조할 때 소요되었던 폴리싱 시간, 및 폴리싱 압력값을 포함하는 공정 제어 히스토리 정보(13)를 이용하여 제1 특징 맵(511)을 생성할 수 있다. In addition, the encoding unit 510 of the drift characteristic information generation unit 211 according to an embodiment of the present invention includes sequence information of each of previously manufactured wafers, and process parameters for parts used when manufacturing the wafers. The first feature map 511 may be generated by using the information, the polishing time required to manufacture each of the previously manufactured wafers, and the process control history information 13 including the polishing pressure value.

일 실시예에 따른 제1 특징 맵(511)은 웨이퍼별 각 시퀀스 정보(세로 축)에 따른 상기 공정 파라미터 정보, 폴리싱 시간, 폴리싱 압력, 및 폴리싱된 웨이퍼의 영역별 출력 두께 값(가로 축)들을 표현한 것으로서, 0 내지 1 사이의 특징 값으로 표현된 특징 맵일 수 있다.The first feature map 511 according to an embodiment includes the process parameter information, polishing time, polishing pressure, and output thickness values (horizontal axis) for each region of the polished wafer according to each sequence information (vertical axis) for each wafer. As expressed, it may be a feature map expressed as a feature value between 0 and 1.

그리고, 본 발명의 실시예에 따른 제2 특징 맵 생성부(520)는 제1 필터(521)를 이용하여 제1 특징 맵(511)(컨볼루션 레이어)을 필터링함에 따라, 동일 공정과정에서의 공정 파라미터 정보들간의 연관성을 고려한 제2 특징 맵을 생성할 수 있다. In addition, the second feature map generation unit 520 according to an embodiment of the present invention filters the first feature map 511 (convolution layer) using the first filter 521, A second feature map may be generated in consideration of correlation between process parameter information.

일 예로, 제2 특징 맵 생성부(520)는 1 X 2 크기의 제1 필터(521)를 이용하여 제1 특징 맵(511)을 컨볼루션 필터링함에 따라 웨이퍼들 간의 공정 변화와 관련된 제2 특징 맵을 생성할 수 있다. As an example, the second feature map generation unit 520 convolutionally filters the first feature map 511 using the first filter 521 having a size of 1 X 2, so that a second feature related to a process change between wafers You can create a map.

또한, 본 발명의 실시예에 따른 제3 특징 맵 생성부(530)는 제2 필터(531)(컨볼루션 레이어)를 이용하여 제1 특징 맵(511)을 컨볼루션 필터링함에 따라, 동일한 공정 파라미터 정보를 기준으로 연속된 공정 과정들간의 연관성을 고려한 제3 특징 맵을 생성할 수 있다.In addition, the third feature map generator 530 according to an embodiment of the present invention convolutionally filters the first feature map 511 using the second filter 531 (convolution layer), so that the same process parameters Based on the information, a third feature map may be generated in consideration of correlation between successive process processes.

일 예로, 제3 특징 맵 생성부(530)는 2 X 1 크기의 제2 필터(531)를 이용하여 제1 특징 맵(511)을 컨볼루션 필터링함에 따라 공정 제어 히스토리 정보들 간의 공정 변화와 관련된 제3 특징 맵들을 생성할 수 있다.As an example, the third feature map generator 530 convolutionally filters the first feature map 511 using a second filter 531 having a size of 2×1, Third feature maps may be generated.

CNN인 드리프트 특징정보 생성부(211)는 복수개의 컨볼루션 레이어(convolution layer)들과 풀링 레이어(pooling layer)들을 적층한 구조이다. 드리프트 특징정보 생성부(211)는 제1 필터 및 제2 필터를 이용하여, 컨볼루션 레이어인 제2 특징 맵 및 제3 특징 맵을 생성하며, 풀링 레이어들은 제2 특징 맵 및 제3 특징 맵에서 대표적인 특징값을 선택하여 각각의 특징 맵의 크기를 줄인다. 이러한 방법으로 드리프트 특징정보 생성부(211)는 process drift를 고려한 드리프트 특징정보를 생성하여, FC 특징정보 생성부(212)로 전달한다. 이렇게 생성된 드리프트 특징정보는 FC 특징정보 생성부(212)의 FC 레이어(fully connected layer)에 전달되어 이미지 인식이나 분류에 사용된다.The drift characteristic information generation unit 211, which is a CNN, has a structure in which a plurality of convolution layers and pooling layers are stacked. The drift feature information generation unit 211 generates a second feature map and a third feature map, which are convolutional layers, using the first filter and the second filter, and the pooling layers are in the second feature map and the third feature map. Select a representative feature value to reduce the size of each feature map. In this way, the drift characteristic information generation unit 211 generates drift characteristic information in consideration of process drift, and transmits it to the FC characteristic information generation unit 212. The drift feature information generated in this way is transmitted to the FC layer (fully connected layer) of the FC feature information generator 212 and is used for image recognition or classification.

즉, 도3에 도시된 바와 같은 최근 n개의 과거에 폴리싱된 공정 제어정보들은 2차원 공정 이미지로 생각할 수 있다. 제1 필터(521)는 동일 공정에서의 변수들 간의 연관성을 고려하여 특징 맵을 생성하는 것이라면, 제2 필터(531)는 연속된 공정들간의 상관성을 고려한 특징 맵을 생성하는 것일 수 있다. That is, the process control information polished in the past n pieces as shown in FIG. 3 can be considered as a two-dimensional process image. The first filter 521 may generate a feature map in consideration of correlation between variables in the same process, while the second filter 531 may generate a feature map in consideration of correlation between successive processes.

제2 특징 맵과 제3 특징 맵 각각은 아래 <수학식3>에 의해 생성될 수 있다.Each of the second feature map and the third feature map may be generated by Equation 3 below.

여기서,

는 1X2 크기의 제1 필터를 사용한 제2 특징 맵의 노드(i, j)이며,

는 2X1 크기의 제2 필터를 사용한 제3 특징 맵의 노드(i, j)이다. α는 활성 함수이고, H와 W는 입력 데이터의 열과 행의 크기이며, p와 q는 각 필터의 행과 열이고, s는 각 필터의 이동 간격인 stride length를 의미한다. 또한,

는 입력 데이터의 (si+p, sj+q) 좌표를 의미하고, W_p,q와 b는 각 필터의 p와 q에 위치하는 가중치(weight)와 바이어스(bias)를 의미한다. 또한, 위 <수학식2>에서의 W_drift는 컨볼루션 레이어들의 가중치들을 의미한다.here,

Is the node (i, j) of the second feature map using the first filter of 1X2 size,

Is the node (i, j) of the third feature map using the second filter of 2X1 size. α is the activation function, H and W are the sizes of the columns and rows of the input data, p and q are the rows and columns of each filter, and s is the stride length, which is the moving interval of each filter. Also,

Denotes the (si+p, sj+q) coordinates of the input data, and W _p,q and b denote the weights and biases located at p and q of each filter. In addition, W _drift in Equation 2 above means weights of convolutional layers.

본 발명의 실시예에 따른 FC 특징정보 생성부(212)는 도3에 도시된 바와 같이, 드리프트 특징정보 생성부(211)로부터 출력된 드리프트 특징정보와 입력부(100)로부터 공정 파라미터 정보인 부품 누적 사용시간에 대한 정보 및 영역별 목표 두께 값에 대한 정보를 입력 받는다. 입력된 정보들 각각에 대하여 인코딩하여 특징정보(특징 벡터 또는 특징 행렬)로 변환하고, 변환된 특징 정보들을 적어도 컨볼루션 레이어들을 이용하여 컨볼루션 필터링함에 따라 FC 특징정보를 생성할 수 있다. 예컨대, FC 특징정보는 특징 벡터 또는 특징 벡터들로 이루어진 특징 행렬로 생성될 수 있고, FC 특징정보는 생성된 FC 특징정보를 디코딩하여 부품들 각각에 적용 제어할 폴리싱 시간과 영역별 압력값을 포함하는 제1 공정 제어정보(15)를 출력할 수 있다.As shown in FIG. 3, the FC characteristic information generation unit 212 according to an embodiment of the present invention accumulates the drift characteristic information output from the drift characteristic information generation unit 211 and the process parameter information from the input unit 100. Receive information on usage time and target thickness value for each area. Each of the input information may be encoded and converted into feature information (a feature vector or a feature matrix), and the converted feature information may be convolutionally filtered using at least convolutional layers to generate FC feature information. For example, the FC feature information may be generated as a feature vector or a feature matrix consisting of feature vectors, and the FC feature information includes a polishing time to be applied to each part by decoding the generated FC feature information and a pressure value for each region. The first process control information 15 may be output.

그리고, 이렇게 FC 특징정보 생성부(212)로부터 출력된 제1 공정 제어정보(15)는 팔별망(220)으로 전달된다. Then, the first process control information 15 output from the FC characteristic information generation unit 212 is transmitted to the armbyeol network 220.

도3에 도시된 바와 같은, 판별망(220)은 상기 제1 공정 제어정보(15) 및 트레이닝 정보로서 실제 폴리싱 시간 및 영역별 압력값을 포함하는 실제 웨이퍼 제조공정에 적용되었던 제2 공정 제어정보(14)를 입력 받을 수 있다.As shown in FIG. 3, the discrimination network 220 includes the first process control information 15 and the second process control information applied to the actual wafer manufacturing process including the actual polishing time and pressure values for each region as training information. (14) can be entered.

제2 공정 제어정보(14)는 도면에는 별도로 도시하지 않았으나, 본 발명의 공정 제어정보 생성 장치 내에 포함된 DB에 저장될 수 있고, DB는 생성망(210)에서 제1 공정 제어정보(15)를 출력하면 저장하고 있던 제2 공정 제어정보(14)를 판별망(220)으로 전달할 수 있다.The second process control information 14 is not separately shown in the drawings, but may be stored in a DB included in the process control information generating apparatus of the present invention, and the DB is the first process control information 15 in the generation network 210. When is outputted, the stored second process control information 14 can be transmitted to the discrimination network 220.

제2 공정 제어정보(14)는 과거 웨이퍼들 각각을 폴리싱 할 때의 부품의 사용시간, 부품의 압력 값 등을 포함하는 실제 정보로서, 업데이트부(400)에 의해 전달 받은 입력부(100)가 DB로 저장되었던 정보일 수 있다.The second process control information 14 is actual information including the use time of the part when polishing each of the past wafers, the pressure value of the part, etc., and the input unit 100 received by the update unit 400 is It may be information that has been stored as.

웨이퍼의 공정이 완료될 때마다 해당 공정의 제어정보(제어값)과 그에 따라 폴리싱된 웨이퍼의 영역별 두께 값(출력값) 그리고, 부품의 사용시간은 업데이트부(400)에 저장된다.Whenever the wafer process is completed, the control information (control value) of the process, the thickness value (output value) for each region of the polished wafer accordingly, and the usage time of the component are stored in the update unit 400.

이에 따라, 판별망(220)은 제1 공정 제어정보(15) 및 제2 공정 제어정보(14)를 인코딩하여 특징정보로 변환하고, 변환된 특징정보를 컨볼루션 레이어들을 이용하여 상기 제1 공정 제어정보는 0이 되도록, 제2 공정 제어정보는 1이 되도록 학습함에 따라 각 공정 제어정보에 판별을 수행할 수 있다. Accordingly, the discrimination network 220 converts the first process control information 15 and the second process control information 14 into feature information, and converts the converted feature information into the first process using convolution layers. As the control information becomes 0 and the second process control information becomes 1, it is possible to determine each process control information.

즉, 판별망(220)은 각 공정 제어정보를 입력 받아, 내부의 컨볼루션 레이어들을 이용하여 각각의 공정 제어정보에 대하여 판별함으로써 0 내지 1 사이의 값을 도출할 수 있다. 생성망(210)은 판별망(220)으로부터 제1 공정 제어정보(15)에 대한 출력값이 1에 수렴할 수 있도록 이후 학습단계에서는 내부 레이어들의 필터 계수(가중치)들을 변경하여 설정할 수 있다.That is, the discrimination network 220 may derive a value between 0 and 1 by receiving each process control information and discriminating for each process control information using internal convolutional layers. The generation network 210 may change and set filter coefficients (weights) of the inner layers in a later learning step so that the output value of the first process control information 15 from the discrimination network 220 converges to 1.

상술한 바와 같은, 생성망(210)과 판별망(220)으로 구성된 본 발명의 학습부(200)는 LSGANs의 최소 제곱법을 기반으로 하는 손실 함수(least square loss function)을 사용하여 역전파(back propagation)을 통해 학습되며, 학습에 필요한 손실 함수는 아래 <수학식4>와 같다.As described above, the learning unit 200 of the present invention composed of the generation network 210 and the discrimination network 220 uses the least square loss function based on the least squares method of LSGANs to backpropagation ( back propagation), and the loss function required for learning is as shown in Equation 4 below.

여기서,

과

은 판별망(220)과 생성망(210)의 손실(loss)이며, p_d와 p_g는 제2 공정 제어정보(u_k)와 생성망(210)에 의해 생성된 제1 공정 제어정보(

)의 분포이다. 판별망(220)은 제2 공정 제어정보(u_k)에 대해 D(u_k)=1, 제1 공정 제어정보 D(

)=0이 되도록 학습한다. 반면, 생성망(210)은 제2 공정 제어정보(u_k)와 아주 유사한 정보를 생성하여 D(

)=1이 되도록 학습한다. 궁극적으로 학습이 성공적으로 완료되면 분포 p_g는 p_d와 같아지게 된다.here,

and

Is the loss of the discrimination network 220 and the generation network 210, and p _d and p _g are the second process control information (u _k ) and the first process control information generated by the generation network 210 (

) Is the distribution. The discrimination network 220 is D(u _k )=1 for the second process control information (u _k ), and the first process control information D(

Learn to become )=0. On the other hand, the generation network 210 generates information very similar to the second process control information (u _k ),

Learn to be )=1. Ultimately, upon successful completion of training, the distribution p _g becomes equal to p _d .

본 발명의 공정 제어정보 생성장치가 LSGANs의 학습방식을 사용하는 이유는 생성망(210)이 학습한 다차원 제어 공간에서의 분포 p_g가 실제 분포인 p_d와 같아지도록 하는 것이 제어 성능을 높이기 때문이다. The reason why the process control information generation apparatus of the present invention uses the learning method of LSGANs is that the control performance is improved by making the distribution p _g in the multidimensional control space learned by the generation network 210 equal to the actual distribution p _d. to be.

이론적으로 생성망(210)과 판별망(220)의 손실 함수는 학습을 통해 수렴이 가능하지만, 실제로는 신경망의 구조적 복잡성 때문에 수렴이 보장되지 않는다. 이 문제를 해결하기 위해 본 발명의 학습부(200)는 베이지안 최적화 모델(Bayesian optimization)을 기반으로 필터 계수들(hyper-parameters)을 조정하여 모델의 구조를 다르게 함으로써 학습이 수렴하게 될 수 있다. 본 발명의 학습부(200)는 베이지안 최적화 모델(BO)을 기반으로 하기 때문에 반복 검색을 통해 매 학습(iteration)마다 솔루션을 지능적으로 샘플링하기 때문에 빠른 시간 내에 우수한 모델 성능을 보장하는 필터 계수(가중치)들을 찾을 수 있다.Theoretically, the loss functions of the generation network 210 and the discriminant network 220 can converge through learning, but in reality, convergence is not guaranteed due to the structural complexity of the neural network. To solve this problem, the learning unit 200 of the present invention adjusts the filter coefficients (hyper-parameters) based on the Bayesian optimization model to change the structure of the model, so that learning may converge. Since the learning unit 200 of the present invention is based on the Bayesian optimization model (BO), it intelligently samples the solution for each iteration through iterative search, so filter coefficients (weighted values) that guarantee excellent model performance in a short time. ) Can be found.

아래 <수학식5>는 본 발명의 학습부(200)는 베이지안 최적화 모델(BO)을 기반으로 필터 계수들(hyper-parameter)를 조정하는 알고리즘에 대한 것이다.<Equation 5> below is for an algorithm for adjusting the filter coefficients (hyper-parameter) based on the Bayesian optimization model (BO) in the learning unit 200 of the present invention.

여기서, t는 hyper-parameter solution, T는 solution space, c(t)는 학습 완료 후의 비용 함수(cost function)을 의미한다. 위 <수학식5>의 t^*는 모델의 성능지표를 오차(error)라 하면, 오차가 가장 낮은 모델을 구성하는 hyper-parameter solution으로 정의될 수 있다.Here, t denotes a hyper-parameter solution, T denotes a solution space, and c(t) denotes a cost function after completion of learning. T ^* in Equation 5 above can be defined as a hyper-parameter solution constituting the model with the lowest error, if the performance indicator of the model is an error.

본 발명의 학습부(200)는 베이지안 최적화 모델(BO)을 적용하기 위해 아래 <수학식6>과 같은 네트워크 함수(c(t)_LRC)를 정의한다.The learning unit 200 of the present invention defines a network function c(t) _{LRC as} shown in Equation 6 below in order to apply the Bayesian optimization model BO.

여기서, m은 LSGANs의 학습에 소요되는 에폭(epochs) 수를 의미하고, 각 에폭에서 판별망(220)과 생성망(210)이 배치(batch) 수만큼 번갈아 학습된다.

와

는 에폭(i)에서 판별망(220)과 생성망(210)의 손실을 뜻하며, l은 LSGANs의 학습이 안정되는 에폭 시점을 의미한다. LSGANs을 학습시킬 때 초기 에폭에서는 d_loss와 g_loss의 진동이 심하기 때문에 c(t)_URC 값의 오류를 줄이기 위해 학습이 안정화되는 l 시점 이후의 에폭을 선택한다. 본 발명의 학습부의 전체 손실을 줄이고 d_loss와 g_loss의 적절한 균형을 맞추기 위해 d_loss와 g_loss의 합과 d_loss와 g_loss 의 절대 차이를 곱하고 에폭 수인 m-l을 나눈 평균값을 c(t)_URC로 정의한다. c(t)_URC의 값이 작을수록 생성망(210)이나 판별망(220)의 손실차이가 줄어든다는 의미로 생성망과 판별망의 어느 한쪽으로 치우치지 않는 상태에 도달한다고 볼 수 있다.Here, m denotes the number of epochs required for learning LSGANs, and in each epoch, the discriminant network 220 and the generation network 210 are alternately learned as many as the number of batches.

Wow

Denotes the loss of the discriminant network 220 and the generation network 210 in the epoch (i), and l denotes the epoch time at which the learning of LSGANs is stable. When learning LSGANs, the vibrations of d_loss and g_loss are severe in the initial epoch, so to reduce the error of the c(t) _URC value, the epoch after the point l when the learning is stabilized is selected. In order to reduce the total loss of the learning unit of the present invention and balance d_loss and g_loss, the average value obtained by multiplying the sum of d_loss and g_loss by the absolute difference between d_loss and g_loss and dividing the epoch number ml is defined as c(t) _URC . As the value of c(t) _URC decreases, the difference in loss between the generation network 210 and the discrimination network 220 decreases, and it can be seen that the state is not biased toward either the generation network or the discrimination network.

Solution(t)에 대해 c(t)_URC는 학습이 완료되어야만 관측이 가능하므로 비용이 많이 든다. 따라서, solution space에서 효율적인 샘플링이 필요하다. BO는 c(t)_URC가 Gaussian process prior를 가진다고 가정한다. 초기에 BO는 하나의 solution을 임의로 선택하여 학습부(200)를 구성하고, 트레이닝 정보(14)를 통해 c(t)_URC를 평가한다. 관측된 데이터를 가지고 있는 BO는 포스테리어(posterior)를 계산하여 c(t)_URC을 재추정하며, 업데이트된 c(t)_URC에 기반한 획득 함수(acquisition function)을 통해 solution space에서 샘플링할 다음 지점을 선택한다. 이후 학습 종료의 조건이 만족될 때까지 샘플링을 반복하여 가장 좋은 solution(t^*)를 찾는다.For solution(t), c(t) _URC is expensive because it can be observed only after learning is completed. Therefore, efficient sampling is required in the solution space. BO assumes that c(t) _URC has a Gaussian process prior. Initially, the BO configures the learning unit 200 by randomly selecting one solution, and evaluates c(t) _URC through the training information 14. The BO with observed data re-estimates c(t) _URC by calculating the posterior, and the next point to be sampled in the solution space through an acquisition function based on the updated c(t) _URC . Choose After that, sampling is repeated until the condition of completion of learning is satisfied to find the best solution (t ^* ).

다시 도2를 참조하면, 이에 따라 본 발명의 실시예에 따른 제어부(300)는 생성망(210)으로부터 출력된 제1 공정 제어정보(u^k)를 전달 받아, 웨이퍼를 폴리싱하기 위한 부품들 각각에 제어 정보(폴리싱 시간 및 폴리싱을 위한 압력값)(y^k)을 입력하고, 이때 제어부(300)는 부품들로 전달한 제어 정보(y^k)을 업데이트부(400)에도 전달할 수 있다. 즉, 웨이퍼의 공정이 완료될 때마다 해당 공정의 제어정보(제어값)가 업데이트부(400)에 저장될 수 있고, 더불어 상기 제어정보를 기반으로 폴리싱된 웨이퍼의 영역별 두께 값(출력값)과 부품의 사용시간 까지 업데이트부(400)에 저장될 수 있다.Referring back to FIG. 2, accordingly, the control unit 300 according to an embodiment of the present invention receives the first process control information u ^k output from the generation network 210, and each of the parts for polishing the wafer. Control information (polishing time and pressure value for polishing) (y ^k ) is input to the controller 300, and at this time, the controller 300 may transmit control information (y ^k ) transmitted to the parts to the update unit 400. That is, whenever the wafer process is completed, the control information (control value) of the process may be stored in the update unit 400, and the thickness value (output value) for each region of the wafer polished based on the control information It may be stored in the update unit 400 until the use time of the part.

이에 따라, 본 발명의 실시예에 따른 업데이트부(400)는 저장된 제어정보(run data)를 학습부(200)의 이후 학습을 위하여 입력부(100)로 전달함으로써 업데이트 할 수 있다.Accordingly, the update unit 400 according to an embodiment of the present invention may update the stored control information (run data) by transmitting the stored control information (run data) to the input unit 100 for subsequent learning by the learning unit 200.

다음으로, 도2 내지 도5를 참조하여 설명한 공정 제어정보 생성 장치를 통해 학습된 공정 제어정보 생성부를 이용하여 현장 웨이퍼 폴리싱 공정에서 실시간으로 공정을 제어하는 공정 제어 장치에 대하여 설명한다. 도6은 본 발명의 일 실시예에 따른 공정 제어 장치의 구성을 개략적으로 도시한 블록도이다. Next, a process control device that controls a process in real time in an on-site wafer polishing process using the process control information generation unit learned through the process control information generation device described with reference to FIGS. 2 to 5 will be described. 6 is a block diagram schematically showing the configuration of a process control apparatus according to an embodiment of the present invention.

도6에 도시된 바와 같은 본 발명의 실시예에 따른 공정 제어 장치(600)는, 입력부(610), 공정 제어정보 생성부(620), 제어부(630), 및 업데이트부(640)를 포함하여 구성될 수 있다. 공정 제어 장치(600)의 입력부(610), 제어부(630), 및 업데이트부(640)는 도2에 도시된 입력부(100), 제어부(300), 및 업데이트부(400)의 기술적 동작과 실질적으로 유사하므로, 아래에서는 공정 제어정보 생성부(620), 및 제어부(630)를 위주로 설명한다.The process control apparatus 600 according to the embodiment of the present invention as shown in FIG. 6 includes an input unit 610, a process control information generation unit 620, a control unit 630, and an update unit 640. Can be configured. The input unit 610, the control unit 630, and the update unit 640 of the process control device 600 are substantially related to the technical operation of the input unit 100, the control unit 300, and the update unit 400 shown in FIG. Since it is similar, the process control information generation unit 620 and the control unit 630 will be mainly described below.

본 발명의 실시예에 따른 공정 제어정보 생성부(620)는, 폴리싱하고자 하는 웨이퍼에 속하는 영역들의 스케일과 관련된 목표 스케일 정보를 입력 받아, 웨이퍼를 폴리싱하기 위한 적어도 하나의 장비의 압력 설정정보 및 장비의 폴리싱 시간과 관련한 시간설정 정보를 포함하는 현장 공정 제어를 위한 공정 제어 특징값을 생성할 수 있다.The process control information generation unit 620 according to an embodiment of the present invention receives target scale information related to the scale of regions belonging to the wafer to be polished, and pressure setting information and equipment of at least one equipment for polishing the wafer. It is possible to generate a process control feature value for on-site process control including time setting information related to the polishing time of.

또한, 제어부(630)는, 공정 제어정보 생성부(620)로부터 생성된 공정 제어 특징값을 기반으로 장비의 공정조건을 제어할 수 있다.In addition, the controller 630 may control a process condition of the equipment based on the process control feature value generated from the process control information generation unit 620.

공정 제어정보 생성부(620)는, 본 발명의 공정 제어정보 생성 장치의 학습과정을 통해 최적화된 가중치로 정해진 레이어들의 필터 계수들로 설정되어, 실시간 입력값들을 입력 받아 공정 제어 특징값을 생성함으로써, 제어부(630)는 생성된 공정 제어 특징값을 토대로 폴리싱을 위한 압력 값과 폴리싱 시간으로 장비들을 제어할 수 있다. The process control information generation unit 620 is set with filter coefficients of layers determined with an optimized weight through the learning process of the process control information generating apparatus of the present invention, receives real-time input values and generates process control feature values. , The control unit 630 may control the equipment based on the generated process control feature value with a pressure value for polishing and a polishing time.

여기에서, 상기 공정 제어정보 생성부(620)로 입력되는 입력값들은 위에서 설명한 공정 제어정보 생성 장치의 생성망에 입력되는 입력 정보(값)들과 동일할 수 있다. Here, input values input to the process control information generation unit 620 may be the same as input information (values) input to the generation network of the process control information generation apparatus described above.

또한, 본 발명의 공정 제어정보 생성부(620)는 공정 제어 정보인 폴리싱을 위한 압력 값 및 폴리싱 시간으로 디코딩되기 전의 특징 벡터 또는 특징 행렬 형태인 공정 제어 특징값을 생성하여 출력하여 제어부(630)에서 상기 공정 제어 특징값을 디코딩함에 따라 압력 값과 폴리싱 시간을 획득하여, 획득된 압력 값과 폴리싱 시간으로 장비를 제어하도록 구현될 수 있고, 또 다른 예로, 상기 공정 제어 특징값을 생성하여 공정 제어 특징값을 디코딩하여 상기 압력 값과 폴리싱 시간을 제어부로 출력 전달하도록 구현될 수도 있다. In addition, the process control information generation unit 620 of the present invention generates and outputs a process control feature value in the form of a feature vector or a feature matrix before being decoded with a pressure value for polishing, which is process control information, and a polishing time, and the controller 630 By decoding the process control feature value, the pressure value and the polishing time may be obtained, and the equipment may be controlled using the obtained pressure value and the polishing time. As another example, the process control feature value is generated to control the process. It may be implemented to decode the feature value and output the pressure value and the polishing time to the controller.

R2R 제어기(controller)의 성능은 제어 오차에 기반해 측정하는데 여기서, 제어 오차는 각 공정(run)에 대해서 학습부의 출력과 실제 출력의 절대 차이를 의미한다. 본 발명에서는 각 제어 변수들별로 (SD)와 mean absolute error(MAE)를 계산하여 도7과 같이 모델들의 성능을 평가하였다. 도7에서 p는 제어 변수들(control variable), n은 테스트 데이터 크기, MAE는 학습된 모델이 실제 값을 얼마나 정확히 예측하는지를 나타내는 지표이며, 는 학습된 모델의 예측 정확도와 변동성을 함께 가늠할 수 있는 지표이다. 두 지표 모두 낮은 값을 기록할수록 제어 모델의 성능이 높다는 것을 나타내고 있다.The performance of the R2R controller is measured based on a control error. Here, the control error refers to the absolute difference between the output of the learning unit and the actual output for each run. In the present invention, (SD) and mean absolute error (MAE) were calculated for each control variable to evaluate the performance of the models as shown in FIG. 7. In FIG. 7, p denotes control variables, n denotes test data size, MAE denotes how accurately the trained model predicts the actual value, and denotes the prediction accuracy and variability of the trained model. It is an indicator. Both indicators indicate that the lower the value is, the higher the performance of the control model is.

도8은 도7에서의 두 비교 모델 별로 hyper-parameter를 BO로 최적화하여 예측 성능을 높이는 실험을 수행한 결과이다. NNR2R(A)는 두 개의 신경망을 사용하고, 하나는 VM 모델로 활용하며, 다른 하나는 R2R controller로 사용한다. NNR2R(A)은 매 공정이 끝난 후 슬라이딩 윈도에 의해 선택된 최신 공정 데이터를 이용해 VM 모델을 학습하고 학습된 VM 모델에 기반해 R2R controller를 학습한다. 따라서 NNR2R(A)의 R2R controller의 성능은 VM 모델의 성능에 크게 좌우된다. 본 발명에서는 VM 모델의 성능을 개선하기 위해 학습부에 적용한 프로세스 변화의 특징 추출을 위한 CNN을 VM 모델 에도 적용하였다. FIG. 8 is a result of an experiment to improve prediction performance by optimizing a hyper-parameter to BO for each of the two comparison models in FIG. 7. NNR2R(A) uses two neural networks, one as a VM model, and the other as an R2R controller. NNR2R(A) learns a VM model using the latest process data selected by a sliding window after each process is finished, and learns an R2R controller based on the learned VM model. Therefore, the performance of the R2R controller of NNR2R(A) largely depends on the performance of the VM model. In the present invention, in order to improve the performance of the VM model, the CNN for extracting the features of the process change applied to the learning unit is also applied to the VM model.

Hyper-parameter 최적화에서 BO의 연속 횟수는 100회, NNR2R(A)의 네트워크 함수는

로 설정하였다. 여기서

은 학습 에폭의 수를 말하며 20회를 설정했고, l은 NNR2R의 학습이 안정화되는 에폭 시점을 의미하며 5회를 설정했다.

와

는 epoch i에서 hyper-parameters solution(t)을 사용한 VM과 R2R controller의 각각 mean squared error를 뜻한다. 본 발명의 학습부(B)의 경우, NNR2R(A)과 동일하게 BO 연속 횟수는 100회, m=20, l=5로 설정했다. 도8은 최적화 실험 결과를 보여준다. 도8과 같은 실시예에서 학습주기(learning rate)는 0.000001부터 0.01사이이며, CNN 특징맵의 개수는 4개에서 300개 사이이고, 각 FC 레이어(fully connected layer)의 은닉 노드의 개수는 10개에서 400개 사이이며, 풀링 레이어의 타입은 average pooling과 max pooling이다.In hyper-parameter optimization, the number of consecutive BOs is 100, and the network function of NNR2R(A) is

Was set to. here

Is the number of learning epochs, and 20 is set, and l is the epoch time when NNR2R learning is stabilized, and 5 times is set.

Wow

Means each mean squared error of VM and R2R controller using hyper-parameters solution(t) in epoch i. In the case of the learning unit (B) of the present invention, similarly to the NNR2R(A), the number of consecutive BOs was set to 100 times, m=20, l=5. 8 shows the results of the optimization experiment. In the embodiment shown in FIG. 8, the learning rate is between 0.000001 and 0.01, the number of CNN feature maps is between 4 and 300, and the number of hidden nodes in each FC layer is 10. It is between 400 and the pooling layer types are average pooling and max pooling.

도9은 학습부의 hyper-parameters를 랜덤으로 찾는 방식의 알고리즘(a)과 BO 방식의 알고리즘(b)으로 최적화한 후의 각 학습 에폭별 생성망의 손실과 판별망의 손실을 보여주는 도면이다. 도9의 (a)는 생성망과 판별망의 손실이 학습 도중에 지속적으로 진도하며 수렴하지 않는 현상을 보여준다. 이러한 현상은 GANs의 학습과정에서 나타나는 대표적인 문제점으로 판별망과 생성망이 번갈아 가며 학습하므로 서로를 속고 속이는 과정에서 전역 해로 수렴할 수 없는 상황이다. 따라서 생성망은 실제 데이터의 분포를 모두 학습하지 못하여 R2R controller로 사용하기 부적합하다. 도9의 (b)는 hyper-parameters를 BO로 최적화한 후의 생성망과 판별망의 손실 그래프로 손실이 어느 한쪽으로 치우치지 않고 생성망과 판별망이 안정되게 학습되었음을 보여준다. 이러한 안정적인 손실 분포는 R2R controller의 재현성을 확보해주고 새로운 데이터의 빠른 학습을 가능하게 한다.9 is a diagram showing the loss of the generation network for each learning epoch and the loss of the discriminant network after optimizing with the algorithm (a) of randomly finding hyper-parameters of the learning unit and the algorithm (b) of the BO method. Fig. 9(a) shows a phenomenon in which the loss of the generation network and the discriminant network continuously progresses during learning and does not converge. This phenomenon is a typical problem that occurs in the learning process of GANs. Since the discriminant network and the generation network learn alternately, they cannot converge to the global solution in the process of deceiving each other. Therefore, the generation network is not suitable for use as an R2R controller because it cannot learn all the distribution of actual data. 9B is a graph of the loss of the generation network and the discriminant network after optimizing the hyper-parameters to BO, and shows that the generation network and the discriminant network are stably learned without biasing the loss to either side. This stable loss distribution secures the reproducibility of the R2R controller and enables fast learning of new data.

도10는 hyper-parameter 최적화를 위한 실험 결과를 나타낸 도면이다. 1,450개의 실험 데이터에 대한 hyper-parameter 최적화 알고리즘을 적용한 후의 학습부의 성능 결과를 보여준다. 도10에서 "Random"과 "BO"는 각각 hyper-parameters에 도8의 random values와 BO results를 각각 사용했을 때의 성능 측정값이다. 8개의 control parameter의 결과를 보면 hyper-parameter 최적화 후 성능이 큰 폭으로 개선되는 것을 확인할 수 있었다. MAE측면에서 각 파라미터별 적게는 약 26.4%, 많게는 약 63.8% 정도 개선된 성능을 보였고, +SD의 측면에서도 약 31.2%에서 80.3%까지의 성능 개선이 있었다. 도8과 도9의 결과로부터, BO가 찾은 hyper-parameter solution은 학습부의 생성망과 판별망의 능력에 적절한 균형을 이루는 안정적인 학습을 가능하게 함을 알 수 있다. 또한 BO 결과를 적용한 생성망은 낮은 제어 오차와 변동성을 가진 R2R controller로 사용 가능함을 확인할 수 있다.10 is a diagram showing an experiment result for hyper-parameter optimization. The performance result of the learning unit after applying the hyper-parameter optimization algorithm to 1,450 experimental data is shown. In FIG. 10, "Random" and "BO" are performance measurements when the random values and BO results of FIG. 8 are respectively used for hyper-parameters. Looking at the results of eight control parameters, it was confirmed that the performance improved significantly after hyper-parameter optimization. In terms of MAE, each parameter showed improved performance by about 26.4% and up to 63.8%, and in terms of +SD, there was a performance improvement from about 31.2% to 80.3%. From the results of FIGS. 8 and 9, it can be seen that the hyper-parameter solution found by the BO enables stable learning that balances the capabilities of the generating network and the discriminant network of the learning unit. In addition, it can be seen that the generation network applying the BO result can be used as an R2R controller with low control error and variability.

도11은 실험용 웨이퍼 공정의 제어 오차를 공정 시퀀스 플롯(run sequence plot)으로 표현한 결과다. 이 플롯에는 CMP 장비 부품의 교체에 의해 발생하는 프로세스 시프트(process shift)가 1회 포함된다. 도11의 (a)와 (b)의 왼쪽 y축은 각각 폴리싱 시간들 (

)의 절대 제어 오차의 합과 압력값들(

)의 절대 제어 오차의 합이며, 오른쪽 y축은 부품들의 정규화된 수명을 나타내며 1의 값에 도달하면 교체에 의해 0에서부터 증가함을 확인할 수 있다. 11 is a result of expressing a control error of an experimental wafer process in a run sequence plot. This plot contains one process shift caused by replacement of CMP equipment components. The y-axis on the left in (a) and (b) of Fig. 11 are the polishing times (

) Of the sum of absolute control errors and pressure values (

) Is the sum of the absolute control errors, and the y-axis on the right represents the normalized life of the parts, and when it reaches the value of 1, it can be seen that it increases from 0 by replacement.

이상에서 설명한 본 발명의 실시예를 구성하는 모든 구성요소들이 하나로 결합하거나 결합하여 동작하는 것으로 기재되어 있다고 해서, 본 발명이 반드시 이러한 실시예에 한정되는 것은 아니다. 즉, 본 발명의 목적 범위 안에서라면, 그 모든 구성요소들이 하나 이상으로 선택적으로 결합하여 동작할 수도 있다. 또한, 그 모든 구성요소들이 각각 하나의 독립적인 하드웨어로 구현될 수 있지만, 각 구성요소들의 그 일부 또는 전부가 선택적으로 조합되어 하나 또는 복수개의 하드웨어에서 조합된 일부 또는 전부의 기능을 수행하는 프로그램 모듈을 갖는 컴퓨터 프로그램으로서 구현될 수도 있다. 또한, 이와 같은 컴퓨터 프로그램은 USB 메모리, CD 디스크, 플래쉬 메모리 등과 같은 컴퓨터가 읽을 수 있는 기록매체(Computer Readable Media)에 저장되어 컴퓨터에 의하여 읽혀지고 실행됨으로써, 본 발명의 실시예를 구현할 수 있다. 컴퓨터 프로그램의 기록매체로서는 자기 기록매체, 광 기록매체 등이 포함될 수 있다.Even if all the components constituting the embodiments of the present invention described above are described as being combined into one or operating in combination, the present invention is not necessarily limited to these embodiments. That is, within the scope of the object of the present invention, all the constituent elements may be selectively combined and operated in one or more. In addition, although all of the components can be implemented as one independent hardware, a program module that performs some or all functions combined in one or more hardware by selectively combining some or all of the components. It may be implemented as a computer program having In addition, such a computer program is stored in a computer readable media such as a USB memory, a CD disk, a flash memory, etc., and is read and executed by a computer, thereby implementing an embodiment of the present invention. The recording medium of the computer program may include a magnetic recording medium or an optical recording medium.

이상의 설명은 본 발명의 기술 사상을 예시적으로 설명한 것에 불과한 것으로서, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자라면 본 발명의 본질적인 특성에서 벗어나지 않는 범위 내에서 다양한 수정, 변경 및 치환이 가능할 것이다. 따라서, 본 발명에 개시된 실시예 및 첨부된 도면들은 본 발명의 기술 사상을 한정하기 위한 것이 아니라 설명하기 위한 것이고, 이러한 실시예 및 첨부된 도면에 의하여 본 발명의 기술 사상의 범위가 한정되는 것은 아니다. 본 발명의 보호 범위는 아래의 청구 범위에 의하여 해석되어야 하며, 그와 동등한 범위 내에 있는 모든 기술 사상은 본 발명의 권리 범위에 포함되는 것으로 해석되어야 할 것이다.The above description is merely illustrative of the technical idea of the present invention, and those of ordinary skill in the technical field to which the present invention belongs can make various modifications, changes, and substitutions within the scope not departing from the essential characteristics of the present invention. will be. Accordingly, the embodiments disclosed in the present invention and the accompanying drawings are not intended to limit the technical idea of the present invention, but to explain, and the scope of the technical idea of the present invention is not limited by these embodiments and the accompanying drawings. . The scope of protection of the present invention should be interpreted by the following claims, and all technical ideas within the scope equivalent thereto should be construed as being included in the scope of the present invention.

Claims

An input unit for receiving target scale information related to the scale of regions belonging to the substrate to be manufactured and process parameter information for at least one device for manufacturing the substrate;
A generation network that generates first process control information necessary for a process for manufacturing the substrate by using a plurality of layers based on the target scale information and the process parameter information; And
A process control information generating apparatus required for a substrate manufacturing process comprising a discrimination network for deriving an evaluation result for the first process control information by learning based on the first process control information and second process control information, which is training data. .

The method of claim 1,
The scale information is thickness information of each area belonging to the substrate,
The process parameter information is information on the cumulative use time of the equipment,
The first process control information and the second process control information includes a polishing time and a polishing pressure value required in a single operation process.

The method of claim 1, wherein the generation network,
A first feature map is generated using previously stored process control history information related to the process control of the substrate, and the first feature map is subjected to convolutional filtering using filters of different scales. The apparatus for generating process control information necessary for a substrate manufacturing process, further comprising: a drift characteristic information generation unit that generates information.

The method of claim 3, wherein the generation network,
A FC feature information generator configured to receive the drift feature information, the scale information, and the process parameter information and perform convolution filtering using at least one convolution layer to generate FC feature information; and
A process control information generating apparatus required for a substrate manufacturing process, characterized in that decoding the FC characteristic information and outputting the first process control information.

The method of claim 3, wherein the drift characteristic information generation unit,
A second feature map generator configured to generate a second feature map in consideration of a correlation between the process parameter information in the same process by filtering the first feature map using a first filter; And
By filtering the first feature map using a second filter having a scale different from that of the first filter, a third feature map is generated based on the same process parameter information in consideration of correlation between successive process processes. The apparatus for generating process control information required for a substrate manufacturing process, further comprising: a feature map generator.

The method of claim 1,
And an update unit for receiving the second process control information, process control history information in a previous process of manufacturing the substrate, and scale history information for each area of the substrate manufactured in the previous process and transmitting the received information to the input unit. An apparatus for generating process control information necessary for a substrate manufacturing process.

The method of claim 1,
Wherein the generation network changes a filter coefficient of at least one of the plurality of layers in consideration of an evaluation result derived from the discrimination network.

The method of claim 1,
And the generation network and the discrimination network use a least square loss function based on a least squares method.

The method of claim 1,
The generation network and the discrimination network change and set filter coefficients to be changed based on a Bayesian optimization model.

Receiving target scale information related to the scale of regions belonging to the substrate to be manufactured and process parameter information for at least one equipment for manufacturing the substrate;
Generating first process control information necessary for a process for manufacturing the substrate by using a plurality of layers based on the target scale information and the process parameter information; And
Deriving an evaluation result for the first process control information by learning based on the first process control information and second process control information, which is training data, and generating process control information necessary for a substrate manufacturing process.

The method of claim 10,
The scale information is thickness information of each area belonging to the substrate,
The process parameter information is information on the cumulative use time of the equipment,
The first process control information and the second process control information include a polishing time and a polishing pressure value required in a single operation process.

The method of claim 10, wherein the generation network,
A first feature map is generated using previously stored process control history information related to the process control of the substrate, and the first feature map is subjected to convolutional filtering using filters of different scales. Generating information; And
Receiving the drift characteristic information, the scale information, and the process parameter information and performing convolution filtering using at least one convolution layer to generate FC characteristic information for the first process control information; further comprising ,
The method for generating process control information required for a substrate manufacturing process, characterized in that the FC characteristic information is decoded and the first process control information is output.

Receive target scale information related to the scale of regions belonging to the substrate to be polished, and perform on-site process control including pressure setting information of at least one equipment for polishing the substrate and time setting information related to the polishing time of the equipment. A process control information generation unit that generates a process control feature value for; And
Process control device comprising; a control unit for controlling the process condition of the equipment based on the process control characteristic value.

The method of claim 13,
The control unit transfers the process conditions of the controlled equipment and the accumulated use time of the equipment to the process control information generation unit,
The target scale information is information on a target thickness of each region belonging to the substrate.

The method of claim 13, wherein the process control information generation unit,
A first feature map is generated using previously stored process control history information related to the process control of the substrate, and the first feature map is subjected to convolutional filtering using filters of different scales. A drift feature information generation unit that generates information; And
An FC feature information generator configured to receive the drift feature information, the scale information, and the process parameter information and perform convolution filtering using at least one convolution layer to generate FC feature information for the first process control information; Including more,
The process control device, characterized in that for generating the process control information by decoding the FC characteristic information.

The method of claim 15, wherein the drift characteristic information generation unit,
A second feature map generator configured to generate a second feature map in consideration of a correlation between the process parameter information in the same operation process by filtering the first feature map using a first filter; And
By filtering the first feature map using a second filter having a scale different from that of the first filter, a third feature map that considers the relationship between successive operation processes based on the same factory parameter information is generated. The process control device further comprising a; feature map generator.

A computer program stored in a computer-readable recording medium in order to execute the process control information generating method according to any one of claims 10 to 12 on a computer.