TW201123254A - Cell based vector format - Google Patents

Cell based vector format Download PDF

Info

Publication number
TW201123254A
TW201123254A TW99116035A TW99116035A TW201123254A TW 201123254 A TW201123254 A TW 201123254A TW 99116035 A TW99116035 A TW 99116035A TW 99116035 A TW99116035 A TW 99116035A TW 201123254 A TW201123254 A TW 201123254A
Authority
TW
Taiwan
Prior art keywords
data
grid
pattern
pattern data
array
Prior art date
Application number
TW99116035A
Other languages
Chinese (zh)
Inventor
Marco Jan-Jaco Wieland
De Peut Teunis Van
Mark Hoving
Edwin Hakkennes
Nol Venema
Original Assignee
Mapper Lithography Ip Bv
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mapper Lithography Ip Bv filed Critical Mapper Lithography Ip Bv
Publication of TW201123254A publication Critical patent/TW201123254A/en

Links

Abstract

A method for defining features for writing on a target using a lithography process. The method comprises defining an array of cells, the features occupying one or more of the cells, and describing for each cell any corners of the features that fall within the cell. The corner may be described by a corner position, a first vector, and a second vector, the two vectors originating from the position.

Description

201123254 y 六、發明說明: 【發明所屬之技術領域】 本發明是關於無遮罩(maskless)帶電粒子微影設借, 尤其關於數據途徑、用於實施修正的方法以及心此類設 備的掃描方法β 【先前技術】 用於積體電路的設計是典型為以電腦可讀檔案表示。 GDS-II檔案格式(用於圖形數據信號的gds標準)是資料庫 播案格式’其為用於積體電路或1(:佈局原圖的數據交換的 微影工業標準。對於使用遮罩的微影機器,典型為使用 GDS-Π檔案以製造其接著由微影機器所使用的遮罩或一組 遮罩。對於無遮軍的微影機器,GDS_n標案是電子式處理 以使其成為適用於控制微影機器的格式。對於帶電粒子微 〜機器GDS-II檔案是轉換成為用於控制在微影製程中所 使用的帶電粒子束的一組控制信號。 可使用預處理單元來處理GDS-Π檔案以產生用於目前 的微影系統的中間數據。視架構選項而定,此中間數據是 位元映像(bitmap)格式或是以向量袼式的區域描述。目前的 微影系統是使用中間數據以使用大量的電子束來將圖型 (pattern)寫入晶圓。 必廣界定數據途徑的架構來實施其能夠以最低成本而 放大到王域的同谷置所需的所有特徵。對於全域的高容量 機器所需的數據途徑特徵含有其為工具校準與製程變化所 201123254 需的不同型式的修正 【發明内容】 本發明是在隨附的申請專利範圍中所描述。 上的種用於定義使用微影處理以寫入在目標 的特徵的方法。該種方法包含·定義 特徵佔用一或多個抵.ιν „ > β等 ,及,對於各個格,描述其屬; 格内的特徵的任何稜角。 ,、屬於。玄 稜角可由棱角位罟、货 s 置第一向1與第二向量所描述,二 個向1疋起自該位置。棱 座標所描述。各個6旦了 “ 個座標及/或由直角 位碼所描述。里可由其指定對於該向量的方向的方 二向::I被疋義為由當從第一向量以預定方向移動到第 二:::該等向量與格邊界所界定的區域,預定方向是 堵如順時針方向。虛擬棱 且士. 文月r對於其部分屬於一袼内而不 具有在该格内的任何稜角 彼…“ 硬角的特徵所定義。虛擬稜角可由其 彼此方位為180度的第-與第二向量所描述。 邊界量可作選擇以僅具有平行於格邊界或垂直於格 ===/或僅具有平行於格邊界、垂直於格邊界、或 對於格邊界為μ度的方向。 可定義最小特徵間距,且噠笙 ,^^ 等格可具有尺寸為等於或 小於最小特徵間距。該等格可具 -fc- -to 寸為等於或小於2的 +方根乘以最小特徵間距的一 — » , ^ ^ ^ 了弋義最小特徵間距為 尺寸4於或大㈣等格的尺寸的平方根。 201123254 位在對於格邊界為45度的邊緣的 部分特徵,可定義甚,ο 〇度的迹緣的特徵或 心我最小特徵間距 該等格的尺寸以2㈣^ 4尺寸為4於或大於 曰伽技&攻 的千方根。對於各個格可定義最大數 目個稜角。各個袼可一 或夕個特徵及/或一或多個特徵 H分4㈣以含對於晶圓的部分場域的圖型數 據、或晶圓的場域條帶的圖型數據。 ^另個觀點’本發明包含—種處理用於微影法的圖 里f的方法’該種方法包含:以向量格式提供圖型數據; 將向置圖型數據變;,、,A i w 、產生基於格的格式的圖型數據;以 將4基於格的圖型數據柵格化以產生用於該微影法的 一階圖型數據。該基於格的圖型數據可包含描述其佔用格 陣列的-或多個格之特徵的格數據,該格數據對於各個格 描述其屬於該格内的特徵的任何棱角。將該基於格的圖型 數據柵格化可在實行該微影法時以即時處理所實行。將該 基於格的圖型數據栅格化可包含:將該基於格的圖型數據 轉列以產生多階圖型數據;以及,將該多階圖型數據遞色 以產生該一階圖型數據。 在又一個觀點’本發明提出一種使用帶電粒子微影機 器根據圖型數據以將晶圓曝光的方法,帶電粒子微影機器 產生複數個帶電粒子小束以將晶圓曝光,該種方法包含: 以向量格式提供圖型數據;將向量圖型數據變換以產生基 於格的格式的圖型數據;將該基於格的圖型數據栅格化以 產生二階圖型數據;將該二階圖型數據串流到小束熄滅器 陣列以將帶電粒子微影機器所產生的小束接通及切斷;以 I Si 5 201123254 及,基於該二階圖型數擄以將該等小束接通及切斷。 該基於格的圖型數掳可包含描述其佔用格陣列的一或 多個格之特徵的格數據,該格數據對於各個格描述其屬於 該格内的特徵的任何稜角。將該基於格的圖型數據柵格化 可在微影機器正將晶圓曝光時以即時處理所實行。將該基 於格的圖型數據柵格化可包含:將該基於格的圖型數據轉 列以產生多階圖型數據;以及,將該多階圖型數據遞色以 產生該二階圖型數據。 【實施方式】 下文僅為舉例且關於圖式而說明本發明的種種實施 例 帶電粒子微影系統 圖1是顯示-種帶電粒子微影系统1〇〇的概念圖,系 統100是分為三個高階的子系統:晶圓定位系統1〇1、電子 光學柱⑽、與數據途徑1G3。晶圓定位㈣1()1將晶圓以 x方向在電子光學柱102之下方移動。晶圓定位系'統101被 供、·。來自數據途徑103的同步信號以將晶圓對準由電子光 學柱10 2所產生的電子小束。 一土圖2A顯示一種帶電粒子微影系統1〇〇的實施例的簡化 不意圖,其顯示電子光學柱1()2 @細節。舉例來說,此類 微影系統是描述於美@專㈣6,897 458號第UK鄭 遽、第7,〇19,908號、第7,〇84 414號與第7 129 5〇2號、美 國專利申請案公告第2〇〇7/〇〇64213號以及共同申請中的美 201123254 國專利申請案序號第61/03 1,573號、第61/031,594號、第 61/045,243 號、第 61/055,839 號、第 61/058,596 號與第 6 1/101,682號,此等美國專利前案均為讓渡給本申請案的所 有者且其整體為以參照方式而納入於本文。 在圖2A所示的實施例中,微影系統包含帶電粒子源 110,例如.用於產生擴展電子束13〇的電子源。擴展電子 束130撞擊於孔隙陣列丨丨丨,其將部分的束阻斷以產生複數 個小束13 1。該系統產生許多個小束,較佳為在約丨〇,〇〇〇 到1,000,000個小束的範圍中。 電子小束131通過其將電子小束131聚焦的聚光透鏡 陣列112。小束13 1是由準直器透鏡系統丨13所準直。準直 的電子小束通過XY偏轉器陣列丨14、第二孔隙陣列U5、 與第二聚光透鏡陣列116〇造成的小束132接著通過束熄滅 器(blanker)陣列i丨7,其包含複數個熄滅器以將小束的一或 多者偏轉。小束通過面鏡148且到達束光闌(st〇p)陣列118, 其具有複數個孔隙。小束熄滅器陣列117與束光闌陣列ιΐ8 是一起操作以藉由將小束阻斷或讓其通過而將小束接通或 切斷。小束熄滅器陣列117可將小束偏轉以使得其將不會 通過在束光闌陣列118 <中的對應孔隙而是將被阻斷。若 小束熄滅器陣歹"17未將小束偏轉,則其將通過在束光闌 陣列118之中的對應孔隙。未偏轉的小束通過束光闌陣列、 且通過束偏轉器陣列119與投射透鏡陣列12〇。 —束偏轉器陣歹〇19提供各個小束133在乂及/或γ方向 (實質垂直於未偏轉小束的方向)的偏轉,以將小束掃描遍及 201123254 目標104的表面。此偏轉不同於小束熄滅器陣列所使用以 將小束接通或切斷的偏轉。接著,小纟133通過投射透鏡 陣列12G且投射到目標1G4之上。投射透鏡配置較佳提供 約100到500倍的縮小。小束133撞擊於目標ι〇4的表面 上,目私104是疋位在晶圓定位系統丨〇丨的活動台上。對 於微影應用,目標通常包含其提供帶電粒子敏感層或抗钱 劑(resist)層的晶圓。 在圖2A所示的代表圖是極為簡化。在較佳實施例中, 單個電子束是首先分割成為多個較小的子束,其接著分為 更多個小束。此類系統是描述於美國專利申請案序號第 61/045,243號,其整體為以參照方式而納入於本文。 在此系統中,各個子束是分為若干個小束,其可視為 圖型束。在一個實施例中,各個子束是分為其排列在7χ7 陣列中的49個小束。小束熄滅器陣列較佳包含一個孔,其 具有對於各個小束的關聯熄滅器電極,以致能各個個別小 束的通/斷(on/off)切換。圖3與4顯示小束熄滅器陣列的一 部分,其就每個圖型束為具有9個小束的實施例而言,各 群組的小束為排列於3χ3陣列。在圖型束中的小束排列及 寫入策略是例如描述於美國專利申請案序號第61/〇58,596 號,其整體為以參照方式而納入於本文。 束偏轉器陣列與投射透鏡陣列是對於各個圖型束較佳 為僅包括一個孔與透鏡(例如:對於構成一個圖型束的各群 級49個小束為一個孔或透鏡)。小束是典型為在其寫入單條 帶(stripe)之群組中而組合(交插/多工)。 201123254 數據途徑架構 數據途徑103之一個實施例的簡化方塊圖是顯示在圖 2B之中,且數據途徑的一部分亦出現在圖2a之中。小束 熄滅器陣列Π7的切換是經由數據途徑所控制。預處理單 元140接收其描述將由微影機器製造的裝置佈局的資訊。 此資訊是典型以GDS-II檔案格式所提供。預處理單元實行 GDS-II檔案的一連串變換來產生通/斷控制信號以控制小束 熄滅器陣列11 7。 控制信號是傳送到電光轉換裝置143 (諸如:雷射二極 體)以將電氣控制信號轉換為光學信號。光學控制信號是透 過光纖145所導引。在光纖輸出處的光束146是透過陣列 的透鏡147而導引到有孔的面鏡148之上。從該鏡,光束 是反射到束熄滅器陣列117的底側上◊個別的光束是指向 到束熄滅器陣列117的底側上的複數個光電轉換裝置(諸 如:光電二極體)。較佳而言,小束熄滅器陣列上具有對於 每個光纖145的光電二極體》光電二極體操作以致動個別 的束熄滅器電極來控制小束13 2的偏轉以將個別的小束接 通或切斷。 用於控制個別小束總滅器電極的控制信號是較佳為經 多工’使得各個光束146載有用於一個通道的控制信號, 該通道包含其共用一個光纖與光電二極體的若干個小束。 多工光朿是由光電二極體所接收且轉換為電氣信號。小束 熄滅器陣列117包括邏輯操作,用於將光電二極體所接收 的控制信號解多工以導出用於個別控制若干個小束熄滅器 201123254 電極的控制信號。在較佳實施例中,用於控制一個圖型束 的49個小束的個別控制信號是時間多工以供在單個光纖上 ' 為由小束媳滅器陣列上的單個光電二極體所接收》 除了多工以外,小束控制信號亦可為以框(fume)配置 來供傳輸且可具有同步位元與附加編碼以改良傳輸,例 如:使用編碼技術以達成頻繁的信號轉變,防止以耦合 方式使用雷射二極體與光電二極體。藉由強制轉變,時: ^是自動分佈在光學信號卜圖12顯示小束控制信號的 實例,其具有對於(一個圖型束的)49個小束的定框、同步 位元、與多工控制位元。 在較為接近晶圓處,使用束偏轉器陣列ιΐ9來將電子 束在y方向偏轉(並且在,方向少量偏轉)以達成遍及晶圓 ^曰表面的電子小束掃描。在所述的實施例中,晶圓ι〇4是 由甜圓定位系統1 〇 1在方向檣 械切動,且電子小束是 在實質垂直於X方向的y方向掃描遍及晶圓。當寫入數據, :束是在y方向緩慢偏轉(相較於返舰 =束是車快速移動回到yr_起始位置(此稱= 束偏轉器陣列119接收來自數 訊。 的時序與同步資 微少==可分為若干個通道。通道是從預處理單元到 氣至光學轉換器(例如:雷射二極例中’通道包含電 唬的單個錢及光學至t4# 束控Μ ㈣換蝴如··光電二極體)。可 10 201123254 指定此通道以傳送對於單個圖型束的控制信號,單個圖型 束包含若干個個別小束(例如:構成一個圖型電子束的49 個小束)。可使用一個圖型束以寫入在晶圓上的單個條帶。 在此配置中,通道代表數據途徑構件,其專用以控制包含 多個小束(例如:49個小束)的一個圖型束且載有小束控制 信號以根據圖型數據來寫入一個條帶。子通道代表其專用 以控制在圖型束内的單個小束的數據途徑構件。 數據途彳i盧揮 數據途徑101將佈局數據變換成為用於控制電子小束 .的通/斷信號。如上所述,此變換可在預處理單元140實行, 預地理單元140實行在典型為以GDUJ或類似楷案开》式的 佈局數據上的一連串變換。此處理是典型包括:平坦化 (flattening)/預處理、柵格化(rasterizati〇n)及多工步驟。 平坦化/預處理步驟將佈局數據格式變換成為劑量映 射。劑量映射以向量格式與關聯的劑量率值來描述在晶圓 上的區域。此步驟可包括諸如鄰近效應修正的一些預處 理。因為預處理的複雜度,此步驟較佳為離線實行。拇格 化步驟將劑量映射變換成為串流的控制(通/斷)信號。多工 步驟是根據多工方案將小束控制信號封裝。 用於在微影機器中寫入晶圓的方半可田…_卜 日日圓的万沄可用以下步驟的順 序而概略描述。晶圓1〇4是安梦*曰ffi中从么“ 疋文哀在日日0疋位系統丨〇丨的台 上,柱ί〇2維持於真空條件, 圭 J采被校準。晶圓被機械 式對準,且按照場域(field)的對準(偏移)被計算。晶圓是由 該台在+x方向移動且該柱開妒宜筮 狂间始寫入第一%域。當小束熄滅 201123254 器陣列的前導列的孔為通過一個場埴臬 旬域邊界,偏移修正是對 於下個場域所設置。因此,當第一個 每域為仍在寫入時, 微影系統將開始寫入下一個場域。力穿 仗馬入一列中的最後一 個場域之後’將移動該台以將晶圓上 圓的下—列的場域定位 在小束熄滅器陣列的下方。當該台在 世x方向移動時,將開 始新的運行。掃描偏轉方向較佳為不變。 修正 由數據途徑所實行的數據處理可提供對小束控制信號 的-些不同調整以作成種種型式的修正與補償。舉例來 說’此等修正與補償可包括鄰近修正與抗㈣加熱修正, 以補償其為使用抗蝕劑性質的結果所發生的效應。數據調 整亦可包括其為設計來補償在微影機器中所發生的誤差或 失效之修正。 在帶電粒子微影機II的較佳實施例中,微影機器並未 内建任何設施以供調整個別電子小束來修正在小束位置、 尺寸、電流、或束的其他特性之誤差。缺失為諸如··小束 的失準或失效、低或高的小束電流、小束的不正確偏轉。 此類缺失可為在微影機器之製造中的缺陷或容許度變化、 阻斷小束或變成帶電及偏轉小束的污物或灰塵、機器構件 中的失效或劣化、等等之結果。微影機器省略用於作成對 J束的個別修正之修正透鏡或電路’以避免在納入附加的 構件到電子光學柱來作成實際束修正所涉及的附加複雜度 與成本,且避免由於納入此類附加構件所需要於該柱尺寸 的增大。然而,小束控制信號的調處及/或晶圓的附加掃描 12 201123254 可補償此等型式的問題。在數據途徑所發生的失效亦可為 由控制信號的調處且連同重新掃描晶圓所修正。用於作成 此等修正的種種方法是於下文所描述。 冗餘掃描 工述的帶電粒子 ^夕 .....。%双佩迷徑τ的 介夕個光纖與雷射二極體、對於各個圖型束的許多個靜電 透鏡與偏轉器以及在小束熄滅器陣列中的極多個媳滅器元 件。極有可能的是,失效可發生在有些此等構件中或是有 2此等構件將會劣化或由污染物所影響而使其不在規格内 實行。為了儘可能延長在系統維護間的時間,可週期 订檢查以判別失效或不合規格的小束或數據通道。此檢查 ^各個晶圓掃描前、在晶圓的各個第一次掃描前或在二 二他便利時刻而實行。檢查可包括一或多 例來說’其包括如在共同申請中的美國專利申請案序於; 61/122,591號所述,該件美國專利中請案 ^ u 式而納入於本文》冗餘掃描的主要目 ·、、 的失效,由於在柱中的失效部分之更換耗在 '、可使用冗餘掃描以對付在數據途徑中的失效 說,在-個通道中的失效光纖或雷射^例來 掃描期間將該通道切斷且使用另-個通道餘 失效通道所寫人的條帶而作修正。 寫人其將要由 在失效或不合規格的小束為偵 其將要由料束所曝光的條帶為未寫入。’可=斷小束以使 掃描(稱為冗餘掃描)以寫人在第—:接者使用第二次 人知描期間省略的晶圓 201123254 條帶。在諸如上述的圖型小束系統中,可切斷其包括失效 或不合規格的小束之完整通道,且將不會寫入其將要由該 通道的小束所曝光的晶圓場域的完整條帶。在實行整個曰 圓的第一次掃描後,可接著實行冗餘掃描以填補遺漏條帶 (及對於具有失效小束的其他通道的任何其他遺漏條帶)。 對於冗餘掃描,晶圓是在第一次掃描後而返回到起始 位置,而且移位到其確保適當作用通道為可用於寫入遺漏 條帶的位置。對於冗餘掃描的圖型數據較佳為在第一次掃 描期間而準備於微影系統中,致使冗餘掃描能在第一次掃 描完成後而儘快開始。較佳為不具有在第一次掃描結束與 冗餘掃描開始之間的顯著延遲,故對於冗餘掃描的數據較 佳為快速可用在適當節點上。 微影機器較佳為能夠在一個掃描中寫入連續的線内 (m-Iine)場域,且以平行於機械掃描的χ方向的二個方向 (即· -X與+χ方向)寫入。該機器亦較佳包括備用束(或圖型 束)’其通常位在該柱的邊緣。 曰為了由適當作用通道在冗餘掃描期間寫入遺漏條帶, 可關㈣柱在y方向及/或χ方向移位(偏移)其對應於 數目的量而直到具有適當作用小束的通道為定位以寫 ^遺漏條帶位置。此較佳為在台上的晶圓的機械偏移所達 成。為了較佳處理所有種類的誤差 後一個通道的失效),可能需要斜私梦 ’、破 7此需要對於第一次與第二次掃描的 移。 多次掃描 201123254 在多次掃描,,實施例中,m , 土 亦可使用第-a & 中對作用小束以及缺陷小束 掃描作用。在多次…! 而仍然達成冗餘 條帶的_ Ap八 田日日圓的第一次掃描是寫入場域 rt 第二次掃描是寫人條帶的其餘部分,造成 掃描或四次掃描等亦可將此原理擴展到三次 曝光的總計時間且減少 用於將曰曰圓 掃描方式為較b 4產1。因此次掃描或兩次 率是=Γ掃描與冗餘掃描是可能的,因為小束失效 型為低。可在第—次掃描前實行束測量以㈣失效 ,、不σ規格的小束。使用卜 果使用此倉訊,可計算第-次與第二次 知描,將造成其指定由作用小束掃描的晶圓的 冗餘掃描中,較佳而言,當失效或不合規格二 ?測時,切斷其包括該小束的整個通道並且使用另一個 1 乍=(具有符合規格的所有小束)來寫入其將要由失效 通道所寫入的條帶。 可使用種種演算法來計算將用於第一次與第二次掃描 的通道及對於各個掃描所需要的晶圓㈣,造成所有條帶 為由作用通道所寫人。對於二次掃描,演算法是在未使用 ㈣通道的各個掃描間而尋a則G分割的通道。可使用 蠻力(brute force)方式來測試種種通道分配與晶圓偏移 以找到適合的組合’或是可使用更複雜的匹配演算法。 因此,對於晶圓的總曝光電流是分配在二個(或更多個) 掃描間。在多次掃描中,可使用第二次掃描(或第三次掃描201123254 y VI. Description of the Invention: [Technical Field] The present invention relates to maskless charged particle lithography, especially regarding data paths, methods for implementing corrections, and scanning methods for such devices β [Prior Art] The design for an integrated circuit is typically represented by a computer readable file. The GDS-II file format (the gds standard for graphic data signals) is the database broadcast format 'which is the lithography industry standard for integrated circuit or 1 (: layout of the original image data exchange. For masks A lithography machine, typically using a GDS-Π file to create a mask or a set of masks that are then used by the lithography machine. For unshaded lithography machines, the GDS_n standard is electronically processed to make it Suitable for controlling the format of lithography machines. For charged particle micro~machine GDS-II files is converted into a set of control signals for controlling the charged particle beam used in the lithography process. The preprocessing unit can be used to process GDS. - Π file to generate intermediate data for the current lithography system. Depending on the architecture options, this intermediate data is in a bitmap format or a vector 的 region description. The current lithography system is used Intermediate data uses a large number of electron beams to write patterns into the wafer. The architecture of the data path must be broadly defined to implement all the features needed to scale up to the same valley at the lowest cost. The data path characteristics required for a global high volume machine contain modifications of the different types required for tool calibration and process variation 201123254. [Invention] The present invention is described in the accompanying patent application. A method of defining a feature that uses lithography to write to a target. The method includes defining a feature that occupies one or more of .ιν „ > β, etc., and, for each cell, describing its genus; Any angularity of the feature, , belongs to. The sinuous angle can be described by the angular position, the first s1 and the second vector, and the two are from the position. The ridge is described by the rib. The coordinates and/or are described by a right-angled bit code. The square direction in which the direction of the vector can be specified::I is deprecated as being moved from the first vector in a predetermined direction to the second::: The area defined by the vector and the boundary of the grid, the predetermined direction is blocked as in the clockwise direction. The virtual edge and the gentleman. The crescent moon r belongs to a part of it without having any corners in the grid... "Characteristics of hard angles Place Definition. Virtual corners can be described by the first and second vectors whose orientation is 180 degrees to each other. The boundary quantity can be selected to have only parallel to the grid boundary or perpendicular to the grid ===/ or only have parallel to the grid boundary, vertical The grid boundary, or the direction of the grid boundary is μ degrees. The minimum feature spacing can be defined, and 哒笙, ^^, etc. can have a size equal to or less than the minimum feature spacing. The grid can have -fc- -to inches For a + square root equal to or less than 2, multiply the minimum feature spacing by one - » , ^ ^ ^ . The minimum feature spacing is the square root of the size of the size 4 or greater (four). The 201123254 bit is 45 for the grid boundary. Some features of the edge of the degree can be defined, ο 特征 的 的 的 的 或 心 心 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小 最小. The maximum number of corners can be defined for each cell. Each of the features may be one or more features and/or one or more features H 4 (4) to contain pattern data for a portion of the field of the wafer, or pattern data for the field strip of the wafer. ^ Another point of view 'The present invention includes a method for processing a graph of f for a lithography method'. The method includes: providing graph data in a vector format; changing the flanking data; , , , A iw , Generating pattern data in a grid-based format; rasterizing the 4 grid-based pattern data to generate first-order pattern data for the lithography method. The grid-based pattern data may include grid data describing features of the grid or array of grids that occupy the grid, and the grid data describes any of the corners of the features within the grid for each grid. Rasterizing the grid-based pattern data can be performed in an instant process when the lithography method is implemented. Rasterizing the grid-based pattern data can include: translating the grid-based pattern data to generate multi-level pattern data; and, dithering the multi-level pattern data to generate the first-order pattern data. In yet another aspect, the present invention provides a method for exposing a wafer based on pattern data using a charged particle lithography machine that produces a plurality of charged particle beamlets to expose the wafer, the method comprising: Providing pattern data in a vector format; transforming the vector pattern data to generate pattern data in a grid-based format; rasterizing the grid-based pattern data to generate second-order pattern data; and encoding the second-order pattern data string Flowing to the beamlet extinguisher array to turn on and off the beamlets generated by the charged particle lithography machine; and I Si 5 201123254 and, based on the second-order pattern number, turn the beamlets on and off . The lattice-based pattern number may include lattice data describing characteristics of one or more of its occupied grid arrays, the lattice data describing any angularity of the features within the grid for each of the cells. Rasterizing the grid-based pattern data can be performed with immediate processing while the lithography machine is exposing the wafer. Rasterizing the grid-based pattern data may include: translating the grid-based pattern data to generate multi-level pattern data; and, dithering the multi-level pattern data to generate the second-order pattern data . [Embodiment] Hereinafter, a charged particle lithography system of various embodiments of the present invention will be described by way of example only and with reference to the drawings. FIG. 1 is a conceptual diagram showing a charged particle lithography system, and the system 100 is divided into three. High-order subsystems: wafer positioning system 1〇, electron optical column (10), and data path 1G3. Wafer positioning (4) 1 () 1 moves the wafer below the electron optical column 102 in the x direction. The wafer positioning system 101 is supplied. The synchronization signal from data path 103 is used to align the wafer with the electron beamlets produced by electro-optical column 102. Figure 2A shows a simplified illustration of an embodiment of a charged particle lithography system showing the electron optical column 1() 2 detail. For example, such a lithography system is described in U.S. Patent No. 6,897,458, No. 7, Zheng, No. 7, No. 19,908, No. 7, No. 84, 414, and No. 7,129, No. 2, U.S. Patent The application announcement No. 2〇〇7/〇〇64213 and the joint application of the US 201123254 national patent application No. 61/03 1,573, 61/031,594, 61/045,243, 61 U.S. Patent Nos. 5,054, issued toK.S. Pat. In the embodiment illustrated in Figure 2A, the lithography system includes a charged particle source 110, such as an electron source for generating an extended electron beam 13A. The extended electron beam 130 impinges on the aperture array 丨丨丨, which blocks a portion of the beam to produce a plurality of beamlets 13 1 . The system produces a plurality of small bundles, preferably in the range of about 丨〇, 〇〇〇 to 1,000,000 small bundles. The electron beamlet 131 passes through a concentrating lens array 112 that focuses the electron beamlets 131. The beamlet 13 1 is collimated by the collimator lens system 丨13. The collimated electron beamlets pass through the XY deflector array 丨14, the second aperture array U5, and the small beam 132 caused by the second concentrating lens array 116〇, then pass through a beamer array i丨7, which includes a plurality A extinguisher to deflect one or more of the beamlets. The beamlets pass through the mirror 148 and arrive at a beam stop array 118 having a plurality of apertures. The beamlet blanker array 117 operates in conjunction with the beam stop array ι 8 to turn the beamlets on or off by blocking or passing the beamlets. The beamlet blanker array 117 can deflect the beamlets such that they will not pass through the corresponding apertures in the beam stop array 118 <RTIgt; If the beamlet extinguisher array "17 does not deflect the beamlets, it will pass through the corresponding apertures in the beam stop array 118. The undeflected beamlets pass through the beam pupil array and pass through the beam deflector array 119 and the projection lens array 12A. The beam deflector array 19 provides deflection of each beamlet 133 in the 乂 and/or gamma directions (substantially perpendicular to the direction of the undeflected beamlets) to scan the beamlets throughout the surface of the 201123254 target 104. This deflection is different from the deflection used by the beam blanker array to turn the beam off or off. Next, the small pupil 133 passes through the projection lens array 12G and is projected onto the target 1G4. The projection lens configuration preferably provides a reduction of about 100 to 500 times. The beam 133 strikes the surface of the target ι 4, and the target 104 is placed on the movable stage of the wafer positioning system. For lithography applications, the target typically includes wafers that provide a charged particle sensitive layer or a resist layer. The representative diagram shown in Fig. 2A is extremely simplified. In the preferred embodiment, a single electron beam is first split into a plurality of smaller beamlets, which are then divided into more beamlets. Such a system is described in U.S. Patent Application Serial No. 61/045,243, the entire disclosure of which is incorporated herein by reference. In this system, each sub-beam is divided into several small beams, which can be regarded as a pattern bundle. In one embodiment, each beamlet is divided into 49 beamlets arranged in a 7χ7 array. The beamlet blanker array preferably includes a hole having associated extinguisher electrodes for each beamlet to enable on/off switching of individual individual beamlets. Figures 3 and 4 show a portion of the beamlet blanker array, with each of the pattern bundles having nine beamlets, the beamlets of each group being arranged in a 3χ3 array. The beamlet arrangement and the writing strategy in the pattern bundle are described, for example, in U.S. Patent Application Serial No. 61/58,596, the entire disclosure of which is incorporated herein by reference. The beam deflector array and the projection lens array preferably comprise only one aperture and lens for each pattern bundle (e.g., a small aperture or lens for each of the 49 small bundles forming a pattern bundle). A small bundle is typically combined (interleaved/multiplexed) in a group in which it is written into a single stripe. 201123254 Data Path Architecture A simplified block diagram of one embodiment of data path 103 is shown in Figure 2B, and a portion of the data path also appears in Figure 2a. The switching of the small beam extinguisher array Π7 is controlled via the data path. Pre-processing unit 140 receives information describing the layout of the device to be fabricated by the lithography machine. This information is typically provided in the GDS-II file format. The pre-processing unit performs a series of transformations of the GDS-II file to generate an on/off control signal to control the beamletter array 11 7 . The control signal is transmitted to an electro-optical conversion device 143 (such as a laser diode) to convert the electrical control signal into an optical signal. The optical control signal is directed through fiber 145. The beam 146 at the output of the fiber is directed through the lens 147 of the array onto the apertured mirror 148. From the mirror, the beam is reflected onto the bottom side of the beam extinguisher array 117. The individual beams are directed to a plurality of photoelectric conversion devices (e.g., photodiodes) directed to the bottom side of the beam extinguisher array 117. Preferably, the beamlet extinguisher array has a photodiode "photodiode" operation for each fiber 145 to actuate individual beam extinguisher electrodes to control the deflection of the beamlets 13 2 to separate individual beamlets Switch on or off. The control signal for controlling the individual beamlet totalizer electrodes is preferably multiplexed such that each beam 146 carries a control signal for a channel that includes a plurality of small fibers that share a single fiber and photodiode. bundle. The multiplexer diaphragm is received by the photodiode and converted into an electrical signal. The beamlet blanker array 117 includes logic operations for demultiplexing the control signals received by the photodiodes to derive control signals for individually controlling the electrodes of the plurality of beamlet blankers 201123254. In a preferred embodiment, the individual control signals for the 49 beamlets used to control a pattern beam are time multiplexed for use on a single fiber as a single photodiode on the beamlet annihilator array. Receiver In addition to multiplex, the beamlet control signal can also be configured for transmission in a fume configuration and can have synchronization bits and additional coding to improve transmission, for example: using coding techniques to achieve frequent signal transitions, preventing The coupling method uses a laser diode and a photodiode. By forcing the transition, ^ is automatically distributed over the optical signal. Figure 12 shows an example of a beamlet control signal with 49 small bundles of frames (of a pattern bundle), synchronization bits, and multiplexing. Control bit. At a closer proximity to the wafer, a beam deflector array ι 9 is used to deflect the electron beam in the y-direction (and a small amount of deflection in the direction) to achieve electron beam scanning across the surface of the wafer. In the illustrated embodiment, the wafer ι 4 is mechanically tangentially oriented by the sweet circular positioning system 1 〇 1 and the electron beam is scanned throughout the wafer in a y-direction substantially perpendicular to the X-direction. When writing data, the : beam is slowly deflected in the y direction (compared to the return ship = beam is the car quickly moving back to the yr_ start position (this is called = beam deflector array 119 receives timing and synchronization from the digital signal.) The small amount == can be divided into several channels. The channel is from the pre-processing unit to the gas to the optical converter (for example: in the laser diode case, the channel contains the individual money and the optical to the t4# beam control (four) This is the channel to transmit the control signal for a single pattern bundle. The single pattern bundle contains several individual beamlets (for example, 49 small cells that make up a pattern beam). A pattern bundle can be used to write a single strip on the wafer. In this configuration, the channel represents a data path component that is dedicated to control the inclusion of multiple beamlets (eg, 49 beamlets) A pattern bundle and carrying a beamlet control signal to write a strip based on the pattern data. The subchannel represents a data path component that is dedicated to control a single beamlet within the pattern bundle. Data path 101 transforms the layout data into The on/off signal of the control electron beamlet. As described above, this transformation can be performed at the pre-processing unit 140, and the pre-geographic unit 140 performs a series of transformations on the layout data, typically in the form of a GDUJ or similar file. This processing typically includes: flattening/preprocessing, rasterization, and multiplexing steps. The flattening/preprocessing step transforms the layout data format into a dose map. The dose mapping is in vector format. The associated dose rate value is used to describe the area on the wafer. This step may include some pre-processing such as proximity effect correction. Because of the complexity of the pre-processing, this step is preferably performed offline. The thumb-up step transforms the dose mapping It becomes the control (on/off) signal of the stream. The multiplex step is to package the beamlet control signal according to the multiplex scheme. The half-field can be used to write the wafer in the lithography machine..._日日日万万沄 It can be roughly described in the order of the following steps. Wafer 1〇4 is from An Meng*曰ffi” 疋 哀 哀 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在 在, The J is calibrated. The wafer is mechanically aligned and calculated according to the alignment (offset) of the field. The wafer is moved by the station in the +x direction and the column is open and mad. The first % field is written. When the beamlet is extinguished, the hole in the leading column of the 201123254 array is passed through a field boundary, and the offset correction is set for the next field. Therefore, when the first field is In order to still write, the lithography system will start writing to the next field. After the force passes through the last field in a column, 'the station will be moved to locate the field below the wafer-on-row. Below the small beam extinguisher array. When the station moves in the x direction, a new run will begin. The scan deflection direction is preferably unchanged. Correcting the data processing performed by the data path provides a small beam control signal. - Some different adjustments to make corrections and compensations for various types. For example, such corrections and compensations may include proximity corrections and anti-(four) heating corrections to compensate for the effects that occur as a result of using resist properties. Data adjustments may also include modifications designed to compensate for errors or failures that occur in the lithography machine. In the preferred embodiment of the charged particle lithography machine II, the lithography machine does not have any built-in facilities for adjusting individual electron beamlets to correct for errors in beam position, size, current, or other characteristics of the beam. Missing is misalignment or failure such as small beam, low or high beam current, and incorrect deflection of beamlets. Such a deficiency may be the result of defects or tolerance changes in the manufacture of the lithographic machine, the blocking of small bundles or the turning off of dirt or dust that is charged and deflected, the failure or degradation in machine components, and the like. The lithography machine omits the correcting lens or circuit for making individual corrections to the J-beam' to avoid the additional complexity and cost involved in incorporating additional components into the electron beam to make the actual beam correction, and avoiding the inclusion of such Additional components require an increase in the size of the column. However, the modulation of the beamlet control signals and/or the additional scanning of the wafer 12 201123254 can compensate for these types of problems. Failures in the data path can also be corrected by the modulation of the control signal and with rescanning of the wafer. The various methods used to make these modifications are described below. Redundant scanning The charged particles of the work ^ 夕 ..... % 佩 个 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的 的It is highly probable that failures may occur in some of these components or that 2 such components will deteriorate or be affected by contaminants so that they are not implemented within specifications. In order to maximize the time between system maintenance, periodic checks can be made to identify failed or substandard beamlets or data channels. This check is performed before each wafer scan, before each first scan of the wafer, or at a convenient time. The inspection may include one or more examples of 'including the U.S. Patent Application Serial No. 61/122,591, the entire disclosure of which is incorporated herein by reference. The failure of the main target, ,, due to the replacement of the failed part in the column, 'can use redundant scanning to deal with the failure in the data path, the failed fiber or laser in the channel ^ The channel was cut during the scan and corrected using the strip of the person written by the other channel's remaining failed channel. It is written that the strip that is to be exposed by the bundle is unwritten in the small bundle that is invalid or out of specification. ‘================================================================================ In a pattern beamlet system such as that described above, the complete channel including the failed or out-of-spec beamlets can be severed and will not be written to the full strip of the wafer field to be exposed by the beamlets of the channel. band. After the first scan of the entire circle is performed, a redundant scan can then be performed to fill the missing strips (and any other missing strips for other channels with failed beamlets). For redundant scans, the wafer is returned to the starting position after the first scan and is shifted to ensure that the appropriate active channel is where it can be used to write the missing strip. The pattern data for the redundant scan is preferably prepared in the lithography system during the first scan, so that the redundant scan can begin as soon as the first scan is completed. Preferably, there is no significant delay between the end of the first scan and the beginning of the redundant scan, so that the data for the redundant scan is preferably fast available on the appropriate node. The lithography machine is preferably capable of writing a continuous in-line (m-Iine) field in one scan and writing in two directions parallel to the chirp direction of the mechanical scan (ie, -X and +χ directions). . The machine also preferably includes a spare bundle (or pattern bundle)' which is typically located at the edge of the column.曰 In order to write a missing strip during the redundant scan by the appropriate active channel, the off-four column can be shifted (offset) in the y-direction and/or the χ direction by an amount corresponding to the number until the channel with the appropriate effect beamlet To locate the position to write the missing strip. This is preferably achieved by mechanical offset of the wafer on the stage. In order to better handle the failure of one channel after all kinds of errors, it may be necessary to sneak a dream, which requires a shift for the first and second scans. Multiple scans 201123254 In multiple scans, in the examples, m, soil can also use the small beam and the small beam scanning effect in the -a & In many times...! The first scan of the _ Ap Yasuda day circle that still achieves the redundant strip is the write field rt. The second scan is the rest of the write strip, causing the scan or four scans, etc., to extend this principle to The total time and reduction of the three exposures is used to scan the circle to be 1 compared to b 4 . Therefore, the secondary scan or the two rate is = scan and redundant scan is possible because the small beam failure type is low. The beam measurement can be performed before the first scan to (4) the failure, and the small beam without the σ specification. Using this binning, you can calculate the first and second knowledge, which will cause it to be specified in the redundant scan of the wafer scanned by the small beam, preferably, when it fails or does not meet the specifications? At the time of the test, cut off the entire channel including the beamlet and use another 1 乍 = (all beamlets that meet the specifications) to write the strips that will be written by the failed channel. A variety of algorithms can be used to calculate the channels to be used for the first and second scans and the wafers (four) required for each scan, causing all strips to be written by the active channel. For the second scan, the algorithm is to find a G-segmented channel between the various scans of the (four) channel. A brute force can be used to test various channel assignments and wafer offsets to find the right combination' or a more complex matching algorithm can be used. Therefore, the total exposure current for the wafer is distributed between two (or more) scans. In multiple scans, a second scan (or a third scan) can be used

S 15 201123254 =第四次掃描等等)來掃描其在第—次掃 通道的條帶,如同在冗餘掃描,。不存在任何失曰:=效 】束時,亦可制多轉描。將^電流分在二或多個 田』間具有優點在於晶圓的瞬間加熱變 =各個掃描的總小束電流降低,各個掃描=的 是分散在多個掃m 為相同’總熱負載S 15 201123254 = fourth scan, etc.) to scan the strips in the first sweep channel, as in redundant scans. There is no any loss: = effect 】 When the beam is bundled, it can also be made into multiple scans. The advantage of dividing the current into two or more fields is that the instantaneous heating of the wafer = the total beam current of each scan is reduced, and each scan = is dispersed over multiple sweeps for the same 'total heat load'

造成較少的局部或瞬間熱負I ❹多切财降低在㈣途財的需求容量。 ::圓使用二次掃描’數據途徑的數據傳輸容量在理論 為減半,因為各個掃描僅需要小束控制數據量的一半。 是:為數據途徑所需的龐大數據傳輸容量 ”關聯的间成本而為顯著。對於上述的實施例,其 一個通道的每個圖型束為49個 3 的傳輸容量4有二個:期母個通道為约 6人 八有13,000個圖型束(各個圖型束 W 49個小束)的機器將需要其各者為4咖/…容量的 個通道。因此’顯著降低對於數據途徑的需求容量。 寫入策略 目前工業標準是300mm晶圓。晶圓是分為固定尺寸的 ㈣’其具有最大尺度為26mmx33mm。各個場域可作處理 =產生多個1C(即:對於多個晶片的佈局可寫入單個場域中) 而1C並未跨越場域邊界。就26mmx33mm的最大尺 單個標準晶圓上具有可用的63個場域於丨认 ° ^ J J個%域。較小的場域是可能 =將造成每個晶圓為較多個場域。5顯示其分為場域 圓、以及寫入場域的方向。場域是在晶圓上的矩形區 201123254 域,典型為具有26mmx33mtn的最大尺寸。ODS-II檔案是 描述場域的特徵。寫入部分(不完整)場域亦為可能,舉例來 說,藉由將完全場域寫為部分場域且跨越晶圓邊界。 在微影機器的較佳實施例中,機器產生丨3,〇〇〇個子束 且各個子束是分割為49個小束,造成637,〇〇〇個小束(即: 〇χ49)。小束媳滅器陣列是在26x26mm的區域中含有 13’〇〇〇個光電二極體與637,_個孔。在小束熄滅器陣列中 的各個光電二極體接收對於49 (7x7)個熄滅器孔/小束之控 制的多工控制信號。在距離26mm的13,_個子束造成在 y方向(垂直於機械掃描)的寬度2μιη的條帶且如同在χ方向 的場,一樣長。各個子束的49個小束寫入單個條帶。 晶圓較佳為由微影機器在反向與順向的χ方向所寫入 ⑽光^由偏轉II)在y方向的寫人方向是料在__個方向。 田場域的尺寸(高度)是選取為小於電子·光學(E〇 =r〇n_°Ptieal)裂縫的尺寸(即··如投射到晶圓上的小束的 二=的尺寸X例如··小於26mm的最大尺寸),則多個 =二放在晶圓上,但是並非所有電子小束為將使用 將減少。 ⑨將…-人知描晶圓且整體產量 當機器為正在將圖型寫入場 熄滅器陣列進入下個p — ,、瞬間,小束 單歹J進入下個%域且開始將圖型寫入其 應该能夠同時寫入二個場域。若場 故機— 能夠同時寫入三個場域。 …。機@ Μ該 小束媳減器陣列的簡化形式是顯示在圖3與4之卜 £ S] 17 201123254 其中僅有16個光電二極體,各者接收對於9(3χ3)個媳滅号 孔/小束之控制的多工控制信號。具有關聯的熄、滅器電極二 熄滅器孔能夠將小束(或電子束)阻斷或是讓其通過。通過媳 滅器孔的小束將寫入晶圓表面上的抗蝕劑。 ' 在圖3中’熄滅器孔的配置是對於平行投射寫入策略 所顯示;而在圖4 +,此是對於垂直寫入策略所顯示。在 圖4中,對於小束的㈣器孔是分佈在整個條帶寬度即: 各個小束與相鄰小束是在垂直於寫入(掃描)方向的方向為 等距離定位。此為可能,但對於少量的孔,就於束與小束 電流之間的比率而論,此配置的效率將為極低。對於效率 的一個測量是填滿因數,填滿因數是熄滅器孔的總面積與 對於一個圖型束的孔為群組在其的面積之間的比率。填滿 因數疋對於砰估就電流輸入(束電流)與電流輸出(總計小束 電流)而論的特定網格(grid)幾何結構的效率為有用。當小束 孔群組的面積為較小,填滿因數將增大到更佳值。 適用於少量的孔之寫入策略是“平行投射”寫入策略 (參閱:圖3) ’其中(以其最簡單形式)個別小束為交插且寫 入整個條帶寬度(如在圖8B所示)。此類的寫入策略是描述 於美國專利申請案序號第61/058,596號,其整體為以參照 方式而納入於本文》 掃描繞 對於平行的所有小束,束偏轉器陣列11 9將產生三角 形的偏轉信號。偏轉信號包括掃描階段與返馳階段,如在 圖ό的示意圖所示。在掃描階段期間,偏轉信號將小束(當 18 201123254 接通時)在y方向緩慢移動且小束熄滅器陣列根據小束控制 信號而將小束接通及切斷。在掃描階段後,返馳階段開始。 在返馳階段㈣,小束被切斷且偏轉信號將小束快速移動 到下個掃描階段將開始處的位置。 掃描線是在掃描階段期間之晶圓表面上的小束途徑。 在沒有特別措施的情況下,掃描線將不會確實沿著y方向 寫入晶圓上,而將會稍微偏斜為同樣具有小的乂方向分量, 因為在X方向的連續台移動。此誤差可藉由將小的χ方向 分量加到偏轉場域以匹配台移動來作修正。此修正可在ε〇 =處理,使得數據途徑不需要修正此誤差。& χ方向的 分量是小的’因為台移動相較於y方向偏轉掃描速度為慢 (典型的相對速度比可為1:1_。然而,此χ方向分量 的效應是在具有圖型束的系統中而大為提高。首先,偏轉 速度可與每個圖型束的小束數目成比例而降低。其次,歸 因於小束陣列的傾斜度(如在圖3、4與9的實例中所示), 在晶圓上的掃描線的偏斜將造成變更由不同小束所作成的 掃描線之間的距離。足夠大的偏斜可能造成掃描線重疊或 改變相關於彼此的位置。 ^掃描線(參閱:圖6的右側)是分為三段:開始過掃描 奴、圖型段與結束過掃描段。小束是沿著y方向偏轉。小 :為:轉在其中的距離是典型較其條帶所應寫入為寬。過 寫入處的位置的空間。過掃 描:早邊過剩的。倘若條帶寬度為2pm且過掃描為— (或叫,此造成3 Pm的掃描線長度。掃描線位元框的過 201123254 掃描段持有其不是料寫人圖型(圖型段位元)的位元。過掃 描位元值為切斷,但是傳送在光纖上。掃描線位元框的圖 型段持有描述栅格化圖型的位元。在此段中的位元是被主 動接通及切斷以供寫入特徵。 在圖6中(的左側),掃描線是對於僅有一個小束為寫入 條帶的情況而描繪。在偏轉週期期間的小束途徑是a b_C。 AB疋在掃描階段期間的掃描線移動,而是在小束為切 斷期間的祕。條帶邊界是標示為D與E。在圖6的右側, 識別過掃描與圖型段。用於在掃描線上切換小束的小束控 制信號的整組位元被稱為掃描線位元框。 在整個掃描線期間,小束是由微影系統所控制。在過 掃描段中,小束將被切斷。在圖型段中,小束是根據需要 被寫入在晶圓場域中的特徵而切換β對於過掃描段與圖型 段之在掃描線位元框令的&元是代表要轉㈣小束熄滅器 陣列的數據。在過掃描段中的位元/像素似乎為無用且耗用 數據途徑的頻寬。然而,在過掃描段中的位元/像素可提供 修正(諸如:圖型移位與圖型定標)的空間,提供縫綴演算法 的二間,且當寫入策略為用在所有小束寫入整個條帶寬度 (平行投射)而提供對於小束在熄滅器孔y位置差異的空間。 饭定對於其控制小束的小束控制信號的固定位元率與 某個像素尺寸,掃描線可映射成為固定長度位元框,即: 掃描線位元框。 在圖7中’提出對於圖型偏移與圖型定標的實例。掃 描線A疋’又有偏移或定標的垂直掃描線,其中,寫入掃描 20 201123254 線的小束被正確對準且τ过 平且正確偏轉以將在晶圓上的期望 正確曝光。掃描線Β與停 ' ,± ^ . 悚帶並未最佳對準,例如:歸因於 小束的失準。此可藉由嘲敕 、 錯由調整小束切換的時序、藉由將 控制#號中的數據移位一 疋登像素而修正。此可葬 在掃描線位元框内的控制位元移位而達成。 9 掃描線C並未正確定標以相配在條帶邊界d與 内’例如.歸因於其局部較堂能盔找从丨土 較节匕為弱的小束的偏轉。因此, m控制信號的較多個位元,而過掃描段使用較少 疋。寫入條帶的圖型需要對於條帶寬度的多個位元。 從位元框的觀點,僅可為以令德去^ 像素的解析度而進行移位與 疋=》而,柵格化方法能夠處理子像素解析度修正(例如: :。像素)。組合此二者將允許移位,諸如⑴個像素的移 小束窝入y略 在上述的實施例中,各個早击县八& d Λ 谷個子束疋刀為49個小束且通道 將49個小束組合以供穹备 罵入條帶對於寫入條帶存在多個不 同的寫入策略。小束寫入策略是 L7 ^ ^ ^ ^小束為以何種方式排 、供寫入條帶。方案可為堆叠、交插或重疊的組合。小 疋f二個階財所偏轉:掃描與返馳。在掃㈣段中, 二束疋/。者在晶圓上的其掃描線而偏轉(當其為接通)。掃描 =元框的圖型段將用位元圖型所填滿以將期望晶片特徵 曝光。 在圖” ’數個實例是顯示可能的交插方案,盆使用 四個小束以供寫入條帶。此等實例並非即時顯示小束如何 L S. } 21 201123254 寫入,而是顯示當寫入已經完成時,哪個小束為已經寫入 條帶的哪個部分。 實例A顯示將小束堆疊。每個小束寫入其本身的子條 帶。對於此組態’各個小束在其返驰前而僅為寫入少量的 :元。偏轉信號的頻率是高的而且其振幅是小的。此寫入 策略疋適用在成群的小束為排列以使得群組寬度(小束數目 技射間距ppr〇j)為等於條帶寬度(垂直投射)之情形。 ,杈射疋一系列的寫入策略。對於垂直投射的基本 形式’所有小束寫入小的子條帶。子條帶寬度是條帶寬度 的小4刀。媳滅器孔網格的尺寸是典型相關於條帶寬度。 在實例Β中,在整個條帶寬度上交插小束。偏轉信號 =頻率疋低的而且其振幅是大的。相配交插掃描線的寫入 ^略^平行投射寫人策略。特別是對於在—個群組中的相 “里的小束’此策略允許較小的群組尺寸及改良 因為少量的小束,在晶圓上的群組尺寸是因為合理 的填滿因數而顯荖 可計算i為斜於力_ 對於此寫入策略(平行投射), 束門距二;—個群組中的特定數目的小束與某個小 士π 一連串的像素尺寸+像素尺寸不是任 π兮 栺線位疋框中的額外位元以補償在小束 熄滅Μ與條帶中央之間的最差情況的偏移。 小束以六奸射π系列的寫入策略。對於平行投射,所有 條帶宽I。方式寫入整個條帶寬度。媳滅器孔網格無關於 實例C是交插與堆疊的組合。對於實例D,連續交插 22 201123254 層是如同磚牆而 間的較佳平均。 小束。 重疊。相較於實例c,此組態將提供在小束 在條帶料,具有將寫入㉟過條帶邊界的 確定—描線如何將條帶填滿的實例。寫人策略是 仃奴射寫入策略的一個優點是其效 =使用—個電子束以作成該等小束。其效率是取決於孔 μ :面積(小束輸出電流)相較於孔群組面積(束輸入電流)的 ^ “。對於相當少量的孔(49個)’束(小束群組)的面積是為 了可接受的效率而必須為小。對於“平行投射,,,束(群組) 尺寸是小於條帶寬度。 像素尺寸是一個重要的系統參數。在熄滅器(孔的)網格 與像素尺寸之間的關連性是在下文解說。 圖9顯示一種簡化的小束熄滅器陣列。對於各個小束, 具有在小束熄滅器陣列中的一個對應孔以及在各個孔的熄 滅器電極。熄滅器包括電子電路,藉由使得熄滅器電極通 電或斷電以將小束切斷或接通。僅具有四個孔的陣列是顯 示作為簡單實例,且圖型束是由四個小束所組成。 按照網格’五列的掃描線圖型是類似於圖8的圖型而 繪製》五列是對於在1到5之範圍中的特定κ值所繪製。K 是關於(例如:由掃描之間的台移動所引起)在掃描線之間的 距離的一個因數。藉由調整在χ方向的台移動與在丫方向 的偏轉速度(掃描階段與返馳階段)之相對速度可達成不同 的K因數。Causes less local or instantaneous heat negative I ❹ more cuts to reduce the demand capacity in (four) way. The data transmission capacity of the ::circle using the secondary scan' data path is theoretically halved because each scan requires only half of the amount of small beam control data. Yes: it is significant for the inter-connected cost of the huge data transmission capacity required for the data path. For the above embodiment, each pattern bundle of one channel has 49 transmission capacities of 4 and two: Machines with about 6 people and 83,000 pattern bundles (49 bundles for each pattern bundle) will require a channel of 4 coffee/... capacity, thus 'significantly reducing the need for data paths. The current industry standard for writing strategies is 300mm wafers. Wafers are divided into fixed sizes (4) 'The largest scale is 26mm x 33mm. Each field can be processed = multiple 1Cs are generated (ie: layout for multiple wafers) Can be written to a single field) and 1C does not cross the field boundary. The maximum size of 26mmx33mm has a total of 63 fields available on a single standard wafer. The smaller field is Probably = will cause each wafer to be more than one field. 5 shows that it is divided into a field circle and the direction of the write field. The field is in the rectangular area 201123254 on the wafer, typically with a maximum of 26mmx33mtn Size. The ODS-II file describes the field. It is also possible to write a partial (incomplete) field, for example, by writing the full field as a partial field and across the wafer boundary. In a preferred embodiment of the lithography machine, the machine generates 丨3, a sub beam and each sub-beam is divided into 49 small beams, resulting in 637, a small beam (ie: 〇χ 49). The beamlet annihilator array contains 13' in the 26x26mm area光电 a photodiode with 637, _ holes. Each photodiode in the beamlet extinguisher array receives a multiplex control signal for the control of 49 (7x7) extinguisher holes/beamlets. A distance of 26 mm from 26 mm causes a strip of 2 μιη width in the y direction (perpendicular to the mechanical scan) and is as long as the field in the x direction. 49 small bundles of each subbeam are written into a single strip. The circle is preferably written by the lithography machine in the reverse and forward direction of the ( (10) light ^ by the deflection II) in the y direction of the writing direction is expected to be in the __ direction. The field size (height) is Choose to be smaller than the size of the electron · optics (E〇=r〇n_°Ptieal) crack (ie · as small as projected onto the wafer) The size of the beam's two = X is less than the maximum size of 26 mm, for example, multiple = two are placed on the wafer, but not all of the electron beamlets will be used for reduction. 9 will... Output When the machine is writing the pattern to the field extinguisher array into the next p-, and instantaneously, the small bundle J into the next % field and start writing the pattern should be able to write to both fields simultaneously If the machine is on the machine - can write to three fields at the same time. ... machine @ Μ The simplified form of the beam reducer array is shown in Figure 3 and 4 s] 17 201123254 of which only 16 Dipoles, each receiving a multiplex control signal for the control of 9 (3 χ 3) annihilation holes/beamlets. An associated extinguisher and extinguisher electrode can be used to block or pass a small beam (or electron beam). The small beam passing through the quencher hole will be written to the resist on the wafer surface. The configuration of the 'extinguilder hole' in Figure 3 is shown for the parallel projection write strategy; and in Figure 4+, this is for the vertical write strategy. In Fig. 4, the (four) apertures for the beamlets are distributed over the entire strip width: i.e., each beamlet and adjacent beamlets are equidistantly positioned in a direction perpendicular to the write (scan) direction. This is possible, but for a small number of holes, the efficiency of this configuration will be extremely low in terms of the ratio between the beam and the beam current. One measure of efficiency is the fill factor, which is the ratio of the total area of the extinguisher holes to the area of the group of holes for a pattern bundle. The fill factor is useful for estimating the efficiency of a particular grid geometry with respect to current input (beam current) and current output (total beam current). When the area of the small beam group is small, the fill factor will increase to a better value. The write strategy for a small number of holes is the "parallel projection" write strategy (see: Figure 3) 'where (in its simplest form) individual beamlets are interleaved and the entire strip width is written (as in Figure 8B) Shown). A write strategy of this type is described in U.S. Patent Application Serial No. 61/058,596, the entire disclosure of which is incorporated herein by reference. Deflection signal. The deflection signal includes a scanning phase and a kickback phase, as shown in the schematic of Figure 。. During the scan phase, the deflection signal slowly shifts the beamlet (when 18 201123254 is turned on) in the y direction and the beamlet blanker array turns the beamlet on and off according to the beamlet control signal. After the scanning phase, the kickback phase begins. In the flyback phase (4), the beamlet is cut and the deflection signal moves the beamlet quickly to the position where the next scanning phase will begin. The scan line is the beamlet path on the wafer surface during the scan phase. Without special measures, the scan line will not actually be written to the wafer along the y-direction, but will be slightly skewed to have a small chirp direction component as well, because of the continuous table movement in the X direction. This error can be corrected by adding a small chirp direction component to the deflection field to match the station shift. This correction can be handled at ε〇 = so that the data path does not need to correct this error. & The component of the χ direction is small 'because the stage movement is slower than the y-direction deflection scan speed (the typical relative speed ratio can be 1:1_. However, the effect of this χ direction component is in the pattern bundle The system is greatly improved. First, the deflection speed can be reduced in proportion to the number of beamlets per pattern bundle. Secondly, due to the tilt of the beamlet array (as in the examples of Figures 3, 4 and 9) As shown, the skew of the scan lines on the wafer will cause the distance between the scan lines made by the different beamlets to be changed. A sufficiently large skew may cause the scan lines to overlap or change the position relative to each other. The scan line (see: the right side of Figure 6) is divided into three sections: the beginning of the scan slave, the pattern segment and the end overscan segment. The beamlet is deflected along the y direction. Small: the distance in which the turn is typical It should be written wider than its strip. Space over the position of the write. Overscan: Excessive in the early side. If the strip width is 2pm and the overscan is - (or called, this results in a scan line of 3 Pm Length. The scan line bit box has passed the 201123254 scan segment holding it is not expected The bit of the human pattern (pattern segment bit). The overscan bit value is cut off, but transmitted on the fiber. The pattern segment of the scan line bit box holds the bit describing the rasterized pattern. The bits in this segment are actively turned on and off for writing features. In Figure 6 (left side), the scan line is depicted for the case where only one small beam is written to the strip. The beamlet path during the period is a b_C. AB疋 scan line movement during the scanning phase, but the secret during the beamlet is cut off. The strip boundaries are labeled D and E. On the right side of Figure 6, identification Overscan and pattern segments. The entire set of bits used to switch the beamlet beamlet control signals on the scan line is referred to as the scanline bitframe. During the entire scanline, the beamlets are controlled by the lithography system. In the overscan segment, the beamlet will be cut. In the pattern segment, the beamlet is switched according to the features that need to be written in the field of the wafer, and the scan line is for the overscan segment and the pattern segment. The & meta of the meta box is the data representing the array of small beam extinguishers to be rotated (four). The bits in the overscan segment The meta/pixel seems to be useless and consumes the bandwidth of the data path. However, the bits/pixels in the overscan segment provide space for corrections (such as pattern shifting and pattern scaling), providing stitching calculations. Two of the methods, and when the write strategy is to write the entire strip width (parallel projection) for all beamlets, provides a space for the difference in the position of the beamlets at the extinguisher hole y. The fixed bit rate of the beam control signal and a certain pixel size, the scan line can be mapped into a fixed length bit box, ie: the scan line bit box. In Figure 7, an example of pattern offset and pattern scaling is proposed. The scan line A 疋 'has again offset or scaled vertical scan lines, wherein the write beam 20 201123254 line beamlets are properly aligned and τ is flat and properly deflected to properly expose the desired exposure on the wafer. The scan line 停 and stop ' , ± ^ . The 悚 belt is not optimally aligned, for example: due to small beam misalignment. This can be corrected by ridiculing the error, adjusting the timing of the beamlet switching, and shifting the data in the control # number by one pixel. This can be achieved by shifting the control bits within the scan line bit box. 9 Scan line C is not being determined to match the deflection at the strip boundary d and inside, for example, due to its locality, which is a weaker beam that is weaker than the crucible. Therefore, m controls more than one bit of the signal, while the overscan segment uses less 疋. The pattern written to the strip requires multiple bits for the stripe width. From the viewpoint of the bit frame, the shifting and 疋 = " can be performed only by the resolution of the pixel, and the rasterization method can process the sub-pixel resolution correction (for example, : pixels). Combining the two will allow shifting, such as shifting the beamlets of (1) pixels slightly in the above-described embodiment, each of the early hits of the eight & d Λ valley sub-beams is 49 small bundles and the channel will There are 49 different bundle combinations for the write-in stripe. There are multiple different write strategies for writing strips. The small bundle write strategy is how the L7 ^ ^ ^ ^ beamlet is arranged for writing stripes. The solution can be a combination of stacking, interleaving or overlapping. Xiao 疋 f two-stage financial deflection: scanning and returning. In the sweep (four) paragraph, two bundles /. The person is deflected (when it is on) on its scan line on the wafer. The pattern segment of the Scan = Metaframe will be filled with the bit pattern to expose the desired wafer features. In the figure "Several examples show possible interleaving schemes, the basin uses four small bundles for writing strips. These examples are not instant display of how the beamlet L S. } 21 201123254 writes, but instead shows Which bundle is already written to the strip when the write has been completed. Example A shows stacking the bundles. Each bundle is written to its own sub-strip. For this configuration, the individual bundles are in it. Before the flyback, only a small amount of : yuan is written. The frequency of the deflection signal is high and its amplitude is small. This write strategy applies to the clustering of small bundles to make the group width (the number of small bundles) The technical spacing ppr〇j) is equal to the strip width (vertical projection). A series of writing strategies are applied. For the basic form of vertical projection, all small bundles are written into small sub-bands. The strip width is a small 4 knife width. The size of the quench hole grid is typically related to the strip width. In the example, the beam is interleaved over the entire strip width. Deflection signal = frequency is low Moreover, its amplitude is large. The writing of the matching interleaved scanning line ^^^ Line projection writer strategy. Especially for the "small bundles" in the group - this strategy allows for smaller group sizes and improvements because of the small number of small bundles, the group size on the wafer is because A reasonable fill factor can be calculated to calculate i is oblique to force _ for this write strategy (parallel projection), the beam door distance is two; - a specific number of small bundles in a group and a pair of small π The pixel size + pixel size is not an extra bit in any of the π 兮栺 line positions 以 to compensate for the worst case offset between the beam blanking Μ and the center of the strip. The small bundle is based on the writing strategy of the π series. For parallel projection, all strips have a bandwidth of I. The way to write the entire strip width. The annihilator hole mesh is irrelevant. Example C is a combination of interleaving and stacking. For Example D, the continuous interleaving 22 201123254 layer is a better average like a brick wall. Small bunch. overlapping. Compared to example c, this configuration will provide an example of how the beam will be filled in the strip, with the determination that the strip will be written over 35 strips. One of the advantages of the write strategy is that the slave write strategy is effective = use an electron beam to make the beamlets. The efficiency depends on the hole μ: area (small beam output current) compared to the hole group area (beam input current). For a relatively small number of holes (49) 'beam (small beam group) area It must be small for acceptable efficiency. For "parallel projection," the bundle (group) size is smaller than the strip width. Pixel size is an important system parameter. The correlation between the grid of the extinguisher (hole) and the pixel size is explained below. Figure 9 shows a simplified beamlet blanker array. For each beamlet, there is one corresponding hole in the beamlet extinguisher array and the extinguisher electrode in each hole. The extinguisher includes an electronic circuit that turns the beam off or on by causing the extinguisher electrode to be powered or de-energized. An array with only four holes is shown as a simple example, and the pattern bundle is composed of four small bundles. The scan line pattern according to the grid 'five columns is similar to the pattern of Fig. 8'. The five columns are plotted for a particular κ value in the range of 1 to 5. K is a factor about the distance between the scan lines (for example, caused by the movement of the table between scans). A different K factor can be achieved by adjusting the relative speed of the table movement in the χ direction and the yaw rate in the 丫 direction (scan phase and flyback phase).

23 201123254 在圖9之對於κ=1的列中,圖型是顯示當台移動群組 寬度的距離時而將被寫入。在掃描線之間的距離是等於在 對於此投射的熄滅器孔之間的距離,即:投射間距(Ρμ。』)。 實際上,投射間距將為遠大於像素尺寸且為一個常數(微影 機器的設計參數)。在圖9中的其他列是顯示當該台僅移動 群組尺寸的整數分數而在x方向的掃描線距離發生為何。κ 是此分數。 一些Κ值將造成重寫先前的掃描線。不應使用此等κ 值。避免此舉的κ值是由方程式GCD(NK)=1所定義,其 中,GCD指出最大公分母,N是對於一個通道在小束熄滅 器中的孔數目(即:在各個圖型束中的小束數目),且艮是台 移動對群組尺寸的分數。若在網格中的孔數目與κ值的最 大公分母等於1,則該κ值為可接受。當使用一值κ=5,在 掃描線之間的距離亦將隨著相同因數而減小。使用“平行 投射”且選擇適當〖值,可確定像素尺寸(至少在又方向)。 然而,一個限制在於此造成僅為固定組的像素尺寸。因數κ 將偏轉頻率與台速度作連結。 圖65說明具有在頂部實例為因數K=1及在底部實例為 K:3的寫入策略。圖66說明對於具有4個小束的圖型束之 可能Κ值。 對於49個孔的網格(例如:7χ7陣列)的實例是提供在 圖1〇的表格中,其描述對於數個有效κ值在χ方向的像素 尺寸(以奈米為單位),假設束間距為61 nm (給定血型的孔 尺寸而將提供25〇/。填滿比率)。對於此等參數,投射間距p 24 201123254 將為8.6 nm。對於此幾何結構的網格寬度是WprQj=414 nm。 因此,位元框是能夠掌控+/_2〇7 nm的寫入策略移位。 圖11是九個小束的陣列的圖例,顯示一些使用術語的 定義,包括:束間距Pb、投射間距Ppr<)j、網格寬度 與傾斜角〇{array。圖63是另一個實例,顯示四個小束的陣列。 圖57顯示像素尺寸與網格寬度的表格,取決於每個圖 型束的小束數目(Npat_beains)、陣列傾斜角(aarray)、投射間距 (Pproj)、與K因數。為了降低其需要被產生且透過數據途徑 所傳送的控制數據量及提高產量,大像素尺寸是期望的。 然而,像素尺寸是受到期望CD與抗蝕劑性質所限制。在表 格中,假設在X方向的最佳像素尺寸(I— X)為3·5 nm,且 從左側起的第四行顯示其基於投射間距與最佳像素尺寸之 κ的計算值。給定每個圖型束的小束數目,可接受的最接近 K值是顯示在從左侧起的第五行中。第六與第七行顯示對於23 201123254 In the column for κ=1 in Fig. 9, the pattern is displayed when the distance of the group moving group width is displayed. The distance between the scan lines is equal to the distance between the extinguisher holes projected for this, i.e., the projection pitch (Ρμ.). In fact, the pitch of the projection will be much larger than the pixel size and a constant (design parameters of the lithography machine). The other columns in Fig. 9 show what happens when the station only moves the integer fraction of the group size and the scan line distance in the x direction. κ is this score. Some devaluation will cause the previous scan line to be overwritten. These kappa values should not be used. The κ value to avoid this is defined by the equation GCD(NK)=1, where GCD indicates the maximum common denominator and N is the number of holes in a beamlet extinguisher for one channel (ie: in each pattern bundle) The number of small bundles, and 艮 is the score of the mobile to group size. The κ value is acceptable if the number of holes in the grid and the maximum common denominator of the κ value are equal to one. When a value of κ = 5 is used, the distance between the scan lines will also decrease with the same factor. Use Parallel Projection and select the appropriate value to determine the pixel size (at least in the other direction). However, one limitation is that this results in only a fixed set of pixel sizes. The factor κ links the deflection frequency to the table speed. Figure 65 illustrates a write strategy with a factor K = 1 at the top and K: 3 at the bottom. Figure 66 illustrates the possible threshold for a pattern bundle with 4 small bundles. An example of a grid of 49 holes (eg, a 7χ7 array) is provided in the table of Figure 1〇, which describes the pixel size (in nanometers) in the χ direction for several valid κ values, assuming beam spacing It is 61 nm (25 〇 / fill ratio for a given blood type pore size). For these parameters, the projection pitch p 24 201123254 will be 8.6 nm. The grid width for this geometry is WprQj=414 nm. Therefore, the bit box is capable of controlling the write strategy shift of +/_2〇7 nm. Figure 11 is a legend of an array of nine small bundles showing some definitions of terms used, including: beam spacing Pb, projection pitch Ppr <) j, grid width and tilt angle 〇 {array. Figure 63 is another example showing an array of four beamlets. Figure 57 shows a table of pixel size and grid width, depending on the number of beamlets (Npat_beains), array tilt angle (aarray), projected pitch (Pproj), and K factor for each pattern bundle. In order to reduce the amount of control data that needs to be generated and transmitted through the data path and increase throughput, large pixel sizes are desirable. However, the pixel size is limited by the desired CD and resist properties. In the table, it is assumed that the optimum pixel size (I-X) in the X direction is 3.5 nm, and the fourth row from the left side shows the calculated value of κ based on the projection pitch and the optimum pixel size. Given the number of beamlets per bundle, the closest acceptable K value is shown in the fifth row from the left. The sixth and seventh lines show

…定的每個圖型束的小束數目、陣列傾斜角、投射間距與K 因數所將造成的以奈米為單位的像素尺寸與網格寬度。 較高的Κ指出較快的偏轉掃描速度(相對於台移動),且 k成在X方向的較小像素。以固定的數據率,像素將在y 方向成為較大’使得像素形狀從大約方形改變為矩形。 小_束寫入策略铬,τ: 田小束疋方位為對⑨Ε〇 I縫的某個角度而能夠寫入非 重疊知描線。Ε0裂縫相關於偏轉方向的傾斜引起在乂方向 置差距如在圖i i所示。此位置差距可作修正。對於 每個】、束肖於移位的值是投射間距的倍數。在圖η中, S. 25 201123254 在頂部孔與中央孔之間的差距等於Wproj/2。此等值將造成 全像素移位分量與子像素移位分量。全像素移位分量較佳 為悤疋作補償,但是子像素分量僅當使用即時柵格化而可 作補償。 多工 定框(framing)、編碼及同步 為了降低系統成本,可使用一個光纖以控制多個(例 如:7x7 = 49個)熄滅器孔。在一個實施例中,透過各個光纖 所傳送的連續控制位元是用於控制小束熄滅器陣列的連續 熄滅器孔(即:用於控制一連串的小束在—個實施例中, 各個光纖包含對於49個子通道的通道傳送控制資訊,用於 在單圖型束上的49個小束的控制。此控制資訊可在被施加 到對於各個小束的熄滅器電極之前而先作緩衝,或是控制 資訊可在沒有緩衝的情況下而直接被施加。為此目的,可 在小束熄滅器陣列上提供緩衝器。具有交插/多工的子通道 之數據途徑的不意圖是顯示於圖55,且解多工方案的示意 圖是顯示於圖56’其❹列選擇器與行選擇器來將多= 子通道解碼以分開對於各個小束的個別控制位元。 為了同步且指出在控制資訊串流中的哪個位元為屬於 哪個小束’較佳為使用某種的^框,如在目12的實例所示。 在此實例中,使用框起始指示位元(在此實例^ 7個位元) 在小束熄滅器上的定框器將同步到其的循環圖型中。 當DC平衡序列需要用於光電二極體側的AC搞合光學 f射器與自動臨限調整,較佳為使用某種編碼。-個實例 疋例如8b/10b編媽。然而,此將造成較高的位元率,以 26 201123254 編碼將對於位元率增加2 $ %。 信號的定框與編碼亦 字組以標示框的起始。例如.使用特定編碼 :個糊载有對於若干個個別小束(例如:49個小束〕 視在訊將為以串列方式從數據途徑傳送到熄滅器。 視在熄滅器上的解多蛊 ‘‘“ 與同步實施而冑’可能需要補償 熄滅器時序偏移”,甘& m ^ ^ 、起因於媳滅器為歸因於串列數據 傳輸在不同時間接收董 + 據 m *η* 束的控制資訊。存在數個 可月b的小束同步選項。囡 項门步實鈀主要視在熄滅器上的實施 的可能性而定。 刃π跑 可用不同方式實行d、壶η本 仃J束冋步,例如:將所有小束同步 到一個同步信號、將在一 ^仃的所有小束同步、將在一列的 所有小束同步或是不將小& t, 』采冋步。對於具有排列在7x7陣 列之每個圖型束為49個小束的實施例,為了將所有小束同 =:個同步㈣’對於49個小束的控㈣據可作緩衝且 施加到用於小束切換的49個熄滅器電極各者。為了將 :一行的所有小束同步,對於在各行…通道的控制數 Π緩衝且同步施加到用於該行小束的7個媳滅二 :二了將在一列的所有小束同步,對於在各列的7個通 ::控制數據可作緩衝且同步施加到用於該列小束的7個 …t 何同步,所有49個小束的控制 電極。 4减益所接收而直接施加到熄滅器 對於行同步、列同步或無同步,個別小束像素時序將 27 201123254 為不同。當在小束間有時序差 方向移位而作補償。此移位將 移位是視列小束結合而定,補 為可能》 異,差異可藉由將像素在y 恆為在子像素範圍中。因為 償是僅當執行栅格化而 缝級(stitching) 因為場域為由多束所寫入,較佳為在不同束所寫入的 場域部分間使用縫綴。缝綴誤差(由—束所寫人的圖型㈣ 於相鄰束所寫入的圖型之移位)造成二個型式的微影誤差· 臨界尺寸(CD)誤差(在縫綴邊界的線為太厚或太薄读重疊 、差_於重疊誤差,典型為容許5 nm。縫綴方式是免除 CD誤差的方法,CD誤差是起因於縫綴誤差。可使用不= 的縫綴策略。此等策略是例如:無縫綴、不整齊邊緣 邊緣與智慧型邊界。 人 對於無縫辍策略,預期的是,除了束的良好對準以外, 並不需要特定手段。一束結束在其他束開始處^如果發生 失準,線將出現在劑量為太低或太高之處。束光點將在某 程度上使此效應平均。然而,無縫綴並非較佳。 不整齊邊緣缝綴策略是例如描述於美國專利公告第 2008/0073 5 88號,其整體為以參照方式而納入於本文。 對於軟邊緣策略,束寫入範圍將重疊。圖58B顯示其 說明軟邊緣策略的圖例。圖型是在二束寫入處的二端而(在 遞色之前)淡出此策略具有其誤差為散佈在一個區域的效 應,如在圖中的丨μιη軟邊緣所示。此策略的副效應是在於 某些像素可能被加倍寫入(即:用2〇〇%劑量)。因為相當大 28 201123254 的束尺寸’劑量將在數個像素間散佈。 智慧型邊界策略定義重疊寫入範圍,但是僅為讓一束 寫入此區域。58A顯示其說明智慧型邊界策略的圖例。 在圓示的實例中,使用100nm重疊寫入範圍,例如:具有 像素的25冑像素。在二個條帶或場域之間的邊界一^是 靠近此邊界的圖型數據特徵的臨界部分將作識別且置放到 個條帶或另一者中。此播点、太—伽/欠册 有τ此&成在一個條帶之間的實際寫入 邊界為移動以避免跨越特徵的臨界部分,使得臨界特徵將 怪為由單束所寫入。 軟邊緣縫綴策略是在二個邊界平滑淡出到下個條帶的 區域處。對於軟邊緣縫綴策略,可使用0 5 μηι的最大過掃 描長度。若發生5nm的縫綴誤差,此造成在5細兀線寬度 之區域中的1〇〇%劑量誤差。若縫綴重疊為i㈣,此ι二 劑量誤,少到1〇〇%χ 5 — 5%。可設定總劑量 誤差預异為3% ’且〇 5%劑量誤差是對來自此劑量誤差預算 的縫綴誤差所供給的合理預算。 *縫辍方法(軟邊緣或智慧型邊界)與過掃描長度可為每 個知描的選擇。降低過掃描長度將造成機器的較高產量。 使用者較佳為能夠選擇軟邊緣或智慧型邊界縫綴策略及 邊緣的尺寸。 所需數據途徑容量的降低 具有二個掃描的多次掃描使用造成微影機器以盆最大 容量的一主 /、取八 、一平而寫入。此寫入容量降低使得數據途徑所需的 硬體量為能夠顯著減少。...the number of beamlets, the tilt angle of the array, the pitch of the projections, and the K-factor to be the pixel size in nanometers and the width of the grid. A higher Κ indicates a faster deflection scan speed (relative to the table movement) and k is a smaller pixel in the X direction. At a fixed data rate, the pixels will become larger in the y direction, causing the pixel shape to change from approximately square to rectangular. Small _ bundle write strategy chrome, τ: Tian Xiao Shu 疋 orientation can be written to non-overlapping lines at a certain angle of 9 Ε〇 I seam. The Ε0 crack is related to the tilt of the yaw direction causing the gap in the 乂 direction as shown in Fig. i i . This location gap can be corrected. For each, the value of the shift is a multiple of the projected pitch. In Figure η, S. 25 201123254 The difference between the top hole and the center hole is equal to Wproj/2. This value will result in a full pixel shift component and a subpixel shift component. The full pixel shift component is preferably compensated for, but the subpixel component can only be compensated for using instant rasterization. Multiple Framing, Coding, and Synchronization To reduce system cost, you can use one fiber to control multiple (for example, 7x7 = 49) extinguisher holes. In one embodiment, the continuous control bits transmitted through the respective fibers are continuous extinguisher holes for controlling the beam blanker array (ie, for controlling a series of beamlets in an embodiment, each fiber comprises Channel transfer control information for 49 subchannels for 49 beamlet control on a single pattern bundle. This control information can be buffered before being applied to the extinguisher electrodes for each beamlet, or Control information can be applied directly without buffering. For this purpose, a buffer can be provided on the beamlet blanker array. The data path with interleaved/multiplexed subchannels is not intended to be shown in Figure 55. And the schematic diagram of the demultiplexing scheme is shown in Figure 56's array selector and row selector to decode the multiple = subchannels to separate the individual control bits for each beamlet. To synchronize and indicate the control information string Which bit in the stream belongs to which bundle 'is better to use some kind of ^ box, as shown in the example of item 12. In this example, use the box start indicator bit (in this example ^ 7 Bit The framing device on the beamlet extinguisher will be synchronized to its cycle pattern. When the DC balance sequence requires AC for the photodiode side to engage the optical ejector and automatic threshold adjustment, preferably Use some kind of encoding. - An example, such as 8b/10b. Mom, however, this will result in a higher bit rate, and the encoding of 26 201123254 will increase the bit rate by 2 $%. The signal is framed and coded. The group is marked with the beginning of the box. For example, use a specific code: a paste contains a number of individual beamlets (for example: 49 beamlets). The video will be transmitted in tandem from the data path to the extinguisher. The solution on the extinguisher is ''multiple'' with synchronous implementation and 胄 'may need to compensate for the extinguisher timing offset', Gan & m ^ ^, resulting from the annihilator due to serial data transmission at different times Receive the control information of Dong + according to the m * η * bundle. There are several small beam synchronization options for the month b. The threshold palladium is mainly determined by the possibility of implementation on the extinguisher. The way to implement d, pot η 本仃J bundle step, for example: synchronize all the small bundles to A synchronization signal, which will synchronize all the small beams in one column, synchronize all the small beams in one column or not small & t, 。. For each pattern bundle with arrays arranged in 7x7 For the 49 small beam embodiments, in order to synchronize all the small beams with =: one (four)' for 49 small bundles, the four (four) data can be buffered and applied to each of the 49 extinguisher electrodes for beamlet switching. Will: all the small bundles of one line are synchronized, for the control number in each line...the buffer is buffered and applied synchronously to the 7 annihilation two for the line beamlet: two will synchronize all the small beams in one column, for each The 7 passes of the column:: The control data can be buffered and applied synchronously to the 7...t synchronizations for the column beamlets, all 49 small beam control electrodes. 4 Debt received and applied directly to the extinguisher For line sync, column sync, or no sync, the individual beamlet pixel timing will be different for 27 201123254. It is compensated when there is a timing difference shift between the beamlets. This shift shifts the view depending on the beamlet combination, which is possible to make the difference in y constant in the sub-pixel range. Since the compensation is only when performing rasterization and stitching because the field is written by multiple beams, it is preferable to use stitching between the fields of the field written by the different beams. The stitching error (the pattern of the person written by the beam (4) shifting the pattern written by the adjacent beam) causes two types of lithography errors. The critical dimension (CD) error (the line at the seam boundary) Too thick or too thin to read overlap, difference _ overlap error, typically 5 nm allowed. The stitching method is to eliminate the CD error, CD error is caused by the stitching error. You can use the stitching strategy of not =. Other strategies are, for example, seamless splicing, irregular edge edges and intelligent boundaries. For a seamless 辍 strategy, it is expected that no specific means are required other than the good alignment of the bundle. If there is a misalignment, the line will appear at a dose that is too low or too high. The beam spot will average this effect to some extent. However, seamless splicing is not preferred. It is described in U.S. Patent Publication No. 2008/0073, the entire disclosure of which is incorporated herein by reference. At the two ends of the two-beam write (Before dithering) Fade out this strategy with the effect that its error is spread over a region, as shown in the soft edge of 丨μιη in the figure. The side effect of this strategy is that some pixels may be doubled (ie: Use a 2% dose). Because the beam size of the 201123254 is quite large, the dose will spread between several pixels. The smart boundary strategy defines the overlapping write range, but only writes a bundle to this area. 58A shows it A legend illustrating a smart boundary strategy. In the example of a circle, a 100 nm overlap write range is used, for example: 25 pixels with pixels. The boundary between two strips or fields is close to this boundary. The critical portion of the pattern data feature will be identified and placed in a strip or the other. This pod, the _ gamma / owed book has τ this & into the actual write boundary between a strip To move to avoid crossing the critical part of the feature, the critical feature will be weird to be written by a single bundle. The soft edge stitching strategy is to smooth out the two borders to the area of the next strip. For the soft edge stitching strategy Can make The maximum overscan length of 0 5 μηι. If a 5 nm stitching error occurs, this causes a 1% dose error in the area of the 5 fine line width. If the seam overlap is i (four), this ι dose is wrong, less To 1〇〇%χ 5 — 5%. The total dose error can be set to 3%' and the 5% 5% dose error is a reasonable budget for the error of the stitching from this dose error budget. Soft edge or smart border) and overscan length can be chosen for each description. Lowering the scan length will result in higher machine yield. Users are better able to choose soft edge or smart border stitching strategy and edge The size of the required data path is reduced. The use of multiple scans with two scans causes the lithography machine to write a master/, eight, and a flat with the maximum capacity of the basin. This reduction in write capacity allows the amount of hardware required for the data path to be significantly reduced.

29 201123254 -個通道是在數據途徑中的一個工作單元。一個通道 能夠在掃描期間寫人-個條帶。即時處理中所涉及的數據 途徑的元件為:快速記憶體、處理單元、雷射、光纖與媳 滅器。因為僅# 50%通道為現用於叫時描,處理單元的 數目可能為以大約相同的因數所減少。 同時争流較少個通道之處理單元減少具有下述優點: 每個通道所需的較少個邏輯格(,、每個通道節點所需在 快速記憶體頻寬上的硬性限制及所需的快速記憶體儲存尺 寸的可能降低。減少處理單元的數目亦具有缺點:須有一 種方式以對於適s通道連接處理單元與雷Μ,且新限制可 能讓掃描失效,尤其如果發生大量後繼(叢集)通道誤差。 在以下敘述中,運用節點的概念。一個節點具有連接 的Υ個(光學)通道且具有可用的χ個處理單元。圖13顯示 對於此類節點的一個模型。可商購的電氣至光學(Ε/0)轉換 器是典型為含有12個通道(即:γ=12)。Ε/0轉換器(例如: 雷射二極體)將來自處理單元的電氣控制數據轉換為透過光 纖而傳送到微影機器的熄滅器的光學數據。將Ε/〇轉換器 驅動的處理單元(例如:現場可程式閘陣列(Fp(3A,field programmable gate array))含有χ個通道。可使用χ*γ交點 來將任一個處理單元切換到任一個Ε/〇轉換器。χ*γ交點 是單獨的裝置或是整合在處理單元中。用交點,可能將任 —個處理單元輸出(X)路由到任一個數據途徑輸出(γ)。 假使一些光學通道失效,首先對於在第一次與第二次 掃描之間的移位的可能性必須作確定,其中所有條帶位置 30 201123254 為由至少—個適當工作的通道所涵蓋。當可能移位位置為 已知’確定是否可用的處理單元為在掃描間分配且涵蓋 100%的條帶。 在圖14令,以概念圖顯示每個掃描的通道位置。如在 圖14所示的條帶(籃色)是以此特定組合的通道誤差與二個 個別移位值所寫入。區別重疊與非重疊的通道位置是重要 的對於在重疊的通道位置將正確寫入的條帶,在此位置 對於一個掃描的工作通道必須為可用。對於非重疊的通道 位置,在第一次與第二次掃描之間的晶圓移位將造成二個 區域,其中僅可能用一個特定掃描來寫入條帶。在此區域 中的失效通道將中斷良好通道的序列。最左側的通道誤差 (參閱:在圖中指向其的紅色箭頭)強制條帶在其右側開始。 在左側,無法使用通道。典型而言,使用移位以使得重疊 ^域成為無誤差(使用二個掃描),且可使用在重疊區域中的 一些通道以達到將寫入之所需數量的條帶。 相較於不能寫入在重疊區域中的位置的可能性,不能 ::在非重疊區財的位置的可能性是相對較高。因此, =而言’在非重疊區域…良好通道,,序列是短的。 因此,使肖12870個通道在二個掃描令涵蓋⑽ 將是困難的,因為太過於取決在非重 条帶 的相當大序列的可用性。使用13 、、义好通道 涵蓋_個條帶將較容易許多,因:m掃描中 在非重疊區域。實際上,很重不疋太取決 的完整序列。 此疋在重疊區域中找到條帶 Γ: 31 201123254 當將處理單元的鉍 ,尚者銘仿夕从.數目減少時引入新的拘束。除了找到 適一移位之外,必彡g 4Χ» ^ 單元到通道的成功分配二對於二次與第二次掃描之處理 ^ , 在圖15中,顯示此舉的一個實例。29 201123254 - A channel is a unit of work in the data path. One channel is capable of writing a person-strip during the scan. The components of the data path involved in real-time processing are: fast memory, processing unit, laser, fiber optic and annihilator. Since only the #50% channel is now used for call time, the number of processing units may be reduced by approximately the same factor. At the same time, the processing unit that contends for fewer channels has the following advantages: fewer logical cells required per channel (,, each channel node requires a hard limit on the fast memory bandwidth and the required The possibility of fast memory storage size is reduced. Reducing the number of processing units also has the disadvantage that there must be a way to connect the processing unit to the thunder for the appropriate s channel, and new restrictions may invalidate the scan, especially if a large number of successors (cluster) occur. Channel error. In the following description, the concept of a node is used. A node has one (optical) channel connected and has one processing unit available. Figure 13 shows a model for such a node. Commercially available electrical to An optical (Ε/0) converter typically contains 12 channels (ie, γ = 12). A Ε/0 converter (eg, a laser diode) converts electrical control data from the processing unit into an optical fiber. Optical data transmitted to the extinguisher of the lithography machine. The processing unit driven by the Ε/〇 converter (eg, field programmable gate array (Fp (3A, field programmable gate arr) Ay)) contains one channel. You can use χ*γ intersection to switch any processing unit to any Ε/〇 converter. χ*γ intersection is a separate device or integrated in the processing unit. Routing any of the processing unit outputs (X) to any of the data path outputs (γ). In case some optical channels fail, the probability of shifting between the first and second scans must first be determined, Where all strip positions 30 201123254 are covered by at least one properly working channel. When the possible shift position is known 'determine whether the available processing unit is allocated between scans and covers 100% of the strips. Let's show the position of each scan channel in a conceptual diagram. The strip (basket) shown in Figure 14 is written with the channel error of this particular combination and two individual shift values. The difference overlaps and non-overlaps. The channel position is important for strips that will be correctly written at overlapping channel locations where they must be available for a scanned working channel. For non-overlapping channel positions, first and second The wafer shift between scans will result in two regions where only one specific scan can be used to write the strip. The failed channel in this region will interrupt the sequence of good channels. The leftmost channel error (see: The red arrow pointing to it in the figure) forces the strip to start on its right side. On the left side, the channel cannot be used. Typically, the shift is used so that the overlap field becomes error-free (using two scans) and can be used for overlap Some of the channels in the area to reach the required number of stripes to be written. Compared to the possibility of not being able to write to locations in the overlapping area, the possibility of not:: the position of the non-overlapping area is relatively High. Therefore, = in terms of 'non-overlapping areas... good channel, the sequence is short. Therefore, making Shaw 12870 channels covered in two scan orders (10) would be difficult because it depends too much on non-heavy strips The availability of a fairly large sequence. It is much easier to use _ strips to cover _ strips, because: m scans are in non-overlapping areas. In fact, it is very important to rely on the complete sequence. This 找到 finds a strip in the overlap area 31: 31 201123254 When the processing unit is 铋, the singer introduces a new constraint when the number is reduced. In addition to finding a suitable shift, a successful assignment of the unit to the channel must be performed for the processing of the second and second scans. In Figure 15, an example of this is shown.

對於此實例,假設其瞢 貝1 J 白时出诵、蓄成士齡 道與3個處理單元的節點。 白點才日出通道為切斷, 八^ A 而黑點指出其為使用且處理單元為 分配的通道。红A 4*今 & ,s己號指出通道誤差。可驗證的是, 並無郎點运反對於特定檫 處理單元的限制。 具有作用在節點的最大三個 圖16顯不相較於對於非重疊區域的通道而使用較少個 處理單元的結果。此 〜 _ π貝不艮好通道的最大序列是以對於 一節點的每五個通道反—如点 η 、 遣為二個處理早元的限制所得到。最大 長度等於每個節點的處理單元數目之二倍。對於其他移位 值(圖16的移位是理想者),在非重墨區域中的有用序列將 實質為較小(查看切位為增加丨時所發生者)。因此,在非 重疊區域中的通道甚至是不如先前(未考慮減少處理單元量) 為有用。 除了在非重疊區域中的通道之較差利用外,基於相同 限制的另一個弱點是在重疊區域中發生。在重疊區域中, 減少母個節點的處理單元數目是轉變為對於誤差序列(誤差 叢集)的靈敏度。對於每個12個通道的節點為7個處理單元 之’處理單元數目的二倍加上一的叢集將造成失敗的 乃配。假使叢集是映射在單一個節點上,分配將對於處理 單7L尺寸加上一的叢集為失敗。每當掌控叢集為實際瓶頸 ^仍存在將郎點尺寸按比例放大(例如:24個通道與14 32 201123254 個處理單元)的可能性。此舉將降低對於大叢集的靈敏度。 重要的是,系、統是對於高達某個階層#通道誤差為強健。 此外,如果發生減少處理單元,對於通道誤差的強健度是 維持在合理的階層。 對於冗餘掃描概念的關鍵參數是條帶的數目、通道的 數目、誤差通道的預期數目、誤差叢集的預期尺寸、每個 節點的通道數目以及每個節點的處理單元數目。在識別通 道誤差之後,系統將找出其造成長度為等於或大於需要條 帶數目的“良好”序列之可能移位組合。“良好”序列是 由在非重疊區域中的“良好,,通道位置或在其至少一個通 道為“良好,,處的重疊區域中的位置所植成。此過程將造 成移位的清單與“良好,,區域的起始與尺寸。 如果發生在通道與處理單元之間的一對一關係(即:無 在數據途徑谷置的減小),成功的晶圓移位 單元為少於通道,成功的分配是附力:。 一者所® Λ d 通道而由二個掃描的 赖可田I.寺,分配是成功的。按照掃描,節點不能分配 較可用者為多的處理單元。 通言可配f略是先分配其必須寫入某些條帶位置的 掃描的重疊區域中的位置,的t置與在-個 位置。假K壬^τ… ' 他掃描中的誤差 假使任何郎點需要較可 企圖將失敗。 勹夕妁處理早π,分配 從一側開始,分县褅彳1她 刀配疋通過條帶位置而反覆進行。處理 33 201123254 !:=將最早離開範圍的節點而作分配。假使此類的 凡王刀配’來自其他掃描的節點應分配處理單元以 :i假使任何即點需要較可用者為多的處理單元, 分配企圖將失敗。可使用苴 ^ , 使用其他滚略,其給予較佳的結 找到在假使A前被拒絕時的分配可能性。 刀配方案失敗的典型理由是在非重疊區域中的失敗的 限制、沒有備用的虛揮留_ $備用的處理早疋、以及大的誤差叢集。結合在 置的誤差通道之特定移位值經常造成失敗的分配。 對於兩次掃描’備用的處理單元是超過節點應供應的通道 數目半數的處理單%,例如:每個節點為} 2個通道與6個 處理單元的組態不具有任何備用的處理單元。 大誤差叢集最後將耗盡在特定節點的處理單元數目。 叢集的影響是重度取決於其位置,因為確定是否一或二個 節點應分配處理單元以寫入誤差位置。對於每個12個通道 的節點為7個處理單元,一個節點最多可吸收7個誤差, 而二個節點最多可吸收14個誤差。 …圖17至23是曲線圓’其說明為了確定關於微影機器 令里而改變數據途徑容量的效應之模擬實驗結果。曲線圖 是顯示從5G個實驗當中的成功數目。成功是意指已經找到 一個成功的移位與分配。因為許多模擬是關於改變單一個 參數,除非是另作指日月,定義其為使用的—個預設參數集: :帶的數目=13000;通道的數目=1313〇;每個節點的處理 單元數目=7’及,每個節點的通道數目=12。 12個通道使用7個處理單元的節點是稱為組態。 34 201123254 在圖17中’顯示每個節點為不同數目個處理單元的效應, 假设無大的誤差叢集(僅有小的自然叢集情形八丨2/6組態是 視為降低的下限’因為每12個通道為5個處理單元的組態 將總疋失敗。12/12組態是實際為沒有減少任何處理節點的 組態;其成功僅是取決於找到一個成功移位(無分配限制)。 模擬結果顯示,相較於12/12組態,對於12/6與12/7組態 的強健度將稍微減小^ 圖18是針對於如同在圖17中的相同組態的誤差叢集 的效應。12/6組態是對於尺寸5的誤差叢集為特別靈敏, 尺寸5的誤差叢集是由於缺乏在節點中的備用處理單元所 引起。在關鍵位置的一個誤差將引起執行的失敗。12/7與 1 2/12組態並未顯示對於尺寸5的叢集之特別靈敏度。 改變通道數目的效應是顯示在圖19中。如果發生減少 處理單元數目,非重疊的區域是幾乎為無用。此解釋對於 使用13000個通道的不佳結果。具有較多個通道的組態將 給予較多個具有良好”序列的移位機會,主要因為較寬 的重疊區域。模擬實驗顯示,具有2〇〇個誤差的1313〇個 通道將造成平均26個成功的移位,而13260個通道將對於 相同數目個誤差而造成平均41個成功的移位。使用13〇〇〇 個通道僅供平均1 4個成功的移位。對於典型1 2 / 7組態, 增加通道數目乃增大強健度。 圖20顯示當先前模擬是以誤差叢集為5的效應所擴大 的結果。並未觀察出結合改變通道數目的任何顯著效應。 如稍早所述,強健度在當將處理單元數目從12個減少For this example, assume that the mussels 1 J are white, and the nodes of the Steward Road and the three processing units are stored. The white point is the sunrise channel, and the black point indicates that it is the channel used for processing and the processing unit is assigned. Red A 4* Today & s has a channel error. It is verifiable that there is no limit to the specific processing unit. Having a maximum of three acting on a node Figure 16 does not appear to be the result of using fewer processing units for channels in non-overlapping regions. This ~ _ π 艮 艮 通道 通道 通道 通道 通道 通道 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大 最大The maximum length is equal to twice the number of processing units per node. For other shift values (the shift in Figure 16 is ideal), the useful sequence in the non-re-ink region will be substantially smaller (the one that occurs when the cut-off is increased). Therefore, the channels in the non-overlapping regions are even less useful than previously (without considering reducing the amount of processing units). In addition to the poor utilization of channels in non-overlapping regions, another weakness based on the same constraints occurs in overlapping regions. In the overlap region, reducing the number of processing units of the parent node is a change to sensitivity to the error sequence (error cluster). For each of the 12 channel nodes, the number of processing units of the 7 processing units plus twice the number of processing units will result in a failure. If the cluster is mapped on a single node, the allocation will fail for processing the cluster with a single 7L size plus one. Whenever the cluster is controlled as the actual bottleneck ^ There is still the possibility of scaling up the lang point size (for example: 24 channels with 14 32 201123254 processing units). This will reduce the sensitivity to large clusters. What's important is that the system is strong for a certain level of channel error. In addition, if a reduction processing unit occurs, the robustness to the channel error is maintained at a reasonable level. The key parameters for the redundant scanning concept are the number of stripes, the number of channels, the expected number of error channels, the expected size of the error cluster, the number of channels per node, and the number of processing units per node. After identifying the channel error, the system will find a possible shift combination that results in a "good" sequence of length equal to or greater than the number of required strips. A "good" sequence is formed by "good," in a non-overlapping region, or at a location in which at least one of the channels is "good," in the overlapping region. This process will result in a list of shifts with "good, start and size of the region. If a one-to-one relationship between the channel and the processing unit occurs (ie: no reduction in the data path valley), successful The wafer shifting unit is less than the channel, and the successful assignment is the attached force: one of the ® Λ d channels and the two scans of the Lai Ketian I. Temple, the assignment is successful. According to the scan, the nodes cannot be assigned There are many processing units available. Generally speaking, f can be assigned to the position in the overlapping area of the scan which must be written to some strip positions, and t is set at - position. False K壬^τ... The error in his scan is that if any of the points need to be tried, it will fail. The 勹 妁 妁 早 , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , !:= will be assigned as the earliest node leaving the range. If such a king knife is equipped with 'other nodes from other scans, the processing unit should be assigned to: i. If any point needs more processing units than the available ones, the allocation attempt Will fail. You can use 苴^ to make With other scrolling, it gives a better knot to find the probability of allocation when it is rejected before A. The typical reason for the failure of the knife-matching scheme is the limitation of failure in the non-overlapping region, and there is no standby virtual singer _ $ Alternate processing early, and large error clusters. The specific shift value combined with the set error channel often results in a failed allocation. For two scans, the 'alternative processing unit is a processing order that exceeds half the number of channels that the node should supply. %, for example: each node is configured with 2 channels and 6 processing units without any spare processing units. The large error cluster will eventually exhaust the number of processing units at a particular node. The impact of the cluster is heavily dependent Its position, because it is determined whether one or two nodes should allocate processing units to write the error position. For each 12 channel nodes, there are 7 processing units, one node can absorb up to 7 errors, and the two nodes can Absorb 14 errors. ... Figures 17 to 23 are curve circles' which illustrate the simulation of the effect of changing the data path capacity in order to determine the lithography machine order. The result is a graph showing the number of successes from 5G experiments. Success means that a successful shift and assignment has been found. Because many simulations are about changing a single parameter, unless otherwise referred to as the sun and the moon, define it as The preset parameter set used: : the number of bands = 13000; the number of channels = 1331〇; the number of processing units per node = 7' and the number of channels per node = 12. 12 channels use 7 The node of the processing unit is called the configuration. 34 201123254 In Figure 17 'shows the effect of each node as a different number of processing units, assuming no large error clusters (only small natural clustering situations, gossip 2/6 groups) The state is considered to be the lower limit of the reduction' because the configuration of 5 processing units per 12 channels will fail. The 12/12 configuration is actually a configuration without any processing nodes being reduced; its success is only dependent on finding A successful shift (no allocation limit). The simulation results show that the robustness for the 12/6 and 12/7 configurations will be slightly reduced compared to the 12/12 configuration. Figure 18 is the effect of the error cluster for the same configuration as in Figure 17. . The 12/6 configuration is particularly sensitive to error clusters of size 5, and the error cluster of size 5 is due to the lack of spare processing units in the nodes. An error at a critical location will cause an execution failure. The 12/7 and 1 2/12 configurations do not show the special sensitivity for clusters of size 5. The effect of changing the number of channels is shown in Figure 19. Non-overlapping areas are almost useless if there is a reduction in the number of processing units. This explanation is a poor result for using 13,000 channels. Configurations with more channels will give more opportunities for shifts with a good "sequence", mainly because of the wider overlap area. Simulation experiments show that 1313 channels with 2 errors will result in an average of 26 A successful shift, while 13260 channels will result in an average of 41 successful shifts for the same number of errors. Using 13 channels is only for an average of 14 successful shifts. For a typical 1 2 / 7 group State, increasing the number of channels increases the robustness. Figure 20 shows the results of the previous simulation with the effect of an error cluster of 5. No significant effects of combining the number of changed channels were observed. As mentioned earlier, robust Degree when reducing the number of processing units from 12

35 201123254 到7個而減小,且增加通道數目將改良強健度。圖η呈現 當藉由使用較多個通道以嘗試補償其歸因於處理單元減少 的強健度損失之結果。如可看出,當將組態從i2/i2改變到 12:7的強健度損失可藉由將通道數目增加僅約1% (例如: 將通道數目從1313〇個增加到1328〇個)所補償。 注意,在模擬中使用的叢集均為其似乎為最差條件之 特疋尺寸@ |叢集。其他叢集策略傾向提供較正面的 結果。ffl 22顯示三種策略的比較:僅為注入單一個叢集; 在固定距離注入儘可能多個類似的叢集(從起始位置到起始 位置為65個);及,在隨機位置注入儘可能多個類似的叢集 (然而在其間維持2(M固良好通道的最小距離)。注意,誤差 叢集之間的固定距離產生許多相關性且將造成大量的成功 移位。 當減少處理單元數目,大於尺寸為5的叢集將具有在 強健度上的嚴重影響。此可在圖23所看出,其中,在具有 叢集尺寸5的12/07 (12/07@5)與具有叢集尺寸8的^則 (12/07@8)之間的強健度差異是顯而易見的。 若大於5的誤差叢集更頻繁發生,可使用替代方式來 結合減少處理單元數目以減小叢集靈敏度。增大節點尺寸 且使用可相比的比率(諸如:24/14組態)是一個此類的替代 者。此舉的效應可在圖23所看出,其顯示相較於12/〇7@8 組態之使用24/14@8組態的較大強健度。 其他替代者是將通道隨機排列遍及於節點、或將通道 系統式在節點間廣泛分佈。此等者將造成—個誤差叢集, 36 201123254 其對應於多個不同節點而非為集中在一或二個節點。在此 組態中,寫入叢集誤差的所有鏡射位置將不會是丨或2個 節點而是其多者的任務。然而,將通道隨機排列或是散佈 可能具有其他負面的副作用,因為相鄰者(且潛在共用資訊) 的概念消失。 分配策略最佳化:除了檢查分配限制外,分配功能性 的重要任務可能是使掃描間的縫綴數目為最小化。 可從以上模擬所得到的結論是如下。減少每個節點的 ^理單元數目可顯著減少硬體的量。減少每個節點的處理 单兀數目將稍微減小強健度。對於兩次掃描,5q% (例如: 12/6組態)是用於減少每個節點的處理單元數目的下限。接 近50%的組態是對於小叢集的誤差(尺寸=5)為特別靈敏。 12/6組態因此不如12/7組態為佳,12/7組態未顯示此 度。12/7組態似乎為對於每12個通道的處理單元數目的合 理下限。為了良好強健度,通道數目較佳為大於條帶數目 (+1 %)。增加通道數目顯荖古 的虑拽H 度。因為減少每個節點 的處理单7G數目之強储_招 失可藉由使用附加1%通道而易 、只。大的誤差叢集(>5)將戲劇性地減小強健产。 數據途徑要纟 建& 在圖24令的流程圖是 的概觀與其相依性。瞭Γ 所涉及的處理 瞭解相依性允許(就持續時間而认 且揭示對於平行執行以提高產量。:要) 掃描時而作處理且/或裁人到讀。 執…個35 201123254 Reduces to 7 and increases the number of channels to improve robustness. Figure η presents the result of a reduction in robustness due to reduced processing units due to the use of more channels. As can be seen, the loss of robustness when changing the configuration from i2/i2 to 12:7 can be achieved by increasing the number of channels by only about 1% (for example: increasing the number of channels from 1313 to 1328). make up. Note that the clusters used in the simulation are the feature size @ | clusters that appear to be the worst condition. Other clustering strategies tend to provide more positive results. Ffl 22 shows a comparison of the three strategies: only injecting a single cluster; injecting as many similar clusters as possible at a fixed distance (65 from the starting position to the starting position); and, injecting as many as possible at random locations A similar cluster (however maintains 2 (M is the minimum distance for a good channel). Note that the fixed distance between the error clusters produces a lot of correlation and will cause a large number of successful shifts. When reducing the number of processing units, the size is greater than The cluster of 5 will have a severe impact on robustness. This can be seen in Figure 23, where 12/07 (12/07@5) with cluster size 5 and ^ with cluster size 8 (12) The difference in robustness between /07@8) is obvious. If error clusters greater than 5 occur more frequently, alternative methods can be used in combination to reduce the number of processing units to reduce cluster sensitivity. Increase node size and use comparable The ratio (such as 24/14 configuration) is one such alternative. The effect of this can be seen in Figure 23, which shows the use of 24/14@ compared to the 12/〇7@8 configuration. 8 The greater robustness of the configuration. Other alternatives The channel is randomly arranged throughout the node, or the channel system is widely distributed among the nodes. These will result in an error cluster, 36 201123254 which corresponds to multiple different nodes instead of being concentrated in one or two nodes. In this configuration, all mirrored positions written to the cluster error will not be 丨 or 2 nodes but the task of many of them. However, randomly arranging or spreading the channels may have other negative side effects because The concept of neighbors (and potentially shared information) disappears. Allocation strategy optimization: In addition to checking allocation constraints, an important task of assigning functionality may be to minimize the number of stitches between scans. The conclusion is as follows. Reducing the number of processing units per node can significantly reduce the amount of hardware. Decreasing the number of processing units per node will slightly reduce the robustness. For two scans, 5q% (for example: 12/ 6 configuration) is the lower limit for reducing the number of processing units per node. Nearly 50% of the configuration is particularly sensitive to small cluster errors (size = 5). 12/6 configuration is therefore not 12/7 configuration is preferred, 12/7 configuration does not show this. The 12/7 configuration seems to be a reasonable lower limit for the number of processing units per 12 channels. For good robustness, the number of channels is preferably greater than The number of bands (+1%). Increasing the number of channels is an obvious measure of H. Because the number of 7Gs for processing each node is reduced, the loss can be easily achieved by using an additional 1% channel. The error cluster (>5) will dramatically reduce the robust production. The data path is to be built & the flow chart in Figure 24 is an overview of its dependence. Γ The processing involved understands the dependence (allows) It is time to recognize and reveal for parallel execution to increase production.: To be processed while scanning and/or to cut to read. Hold one

37 201123254 不同的相依性與其不同的機率或限制可能發生於不同 ♦、構。舉例來說’在處理E1 (晶圓測量與定位)與ci (線内 =理及/或載入數據以供初次掃描到_)之間的相依性。 广於架構選項A (離線處理),此相依性不存在。對於選項 ,此相依性可能存纟’然而對於即時栅格化,此相依性將 存在(小東與掃描線的即時結合^ 關於處理的典型性能要求:從飼服器下載新的圖型到 2器節點的局部儲存器<6G分鐘;要儲存在串流器節點的 :二存器中的圖型數目〉⑴;機器是歸因於載入新影像 :離線的時間<60秒;若每個晶圓將實行栅格化—次,在將 二正參數更新與其為備妥以寫人之間的最大時間是36秒(6 刀鐘的10%);及,掃描曝光期間為<3分鐘。 時序與同步 時脈與同步信號可透過光纖而分佈到其他子系統(諸 離也與晶圓台)。此具有優點為在子系統間的電流隔 ,與對於電磁影響的不靈敏度。可使用時脈變化以改變劑 =然而’由於可藉由改變像素尺寸來補償劑量變化,較 :避免時脈變化以簡化其負責將數據傳送到媳滅器之數 ,的實際部分的實施且排除在時脈頻率改變之後要重 新同步所需要的時間。 :用固定時脈率的優點是在於時脈不再需要被分佈到 =途徑的不同構件之間。利用(在FPG“側的)標準相位 2迴路叫),在局部時脈頻率的變化可作補償。當需要 乂的變化(諸如:纖),需要特別準備為能夠使得數據 38 201123254 途控子系統同步。 數據途徑較佳操作為對於整個微影系統的時脈主控器 且將時序與同步信號提供到其他子系統,諸如:電子光學 柱(偏轉器)與晶圓定位系統。 修正 在上述的帶電粒子微影機器的實施例中,不具有内建 到微影機器的任何設施來調整個別電子小束以修正在小束 位置、尺寸、電流、或其他束特徵的誤差。微影機器省略 修正的透鏡或電路來作成對於小束的個別修正,以避免在 納入附加構件到電子光學柱來作成實際的束修正所涉及的 附加複雜度與成本,且避免由於納入此類附加構件所需要 在柱尺寸的増大。 =因此,為了修正在小束位置、尺寸、電流等等變化的 調整是#由作成對於由㈣途徑所提供的控制信號的修正 調整而作《。對於種種理由所需要,作成數個型<的修正。 此等修正包括修正以補償: ^ •在小束位置的變化。歸因於在柱之製造中的變化, 諸如·在孔隙陣列或小束熄滅器陣列中的孔的確實定位與 的變化、或由聚光透鏡或投射透鏡或偏轉電極 ::場:度的差異,小束可能失準。此類失準可用“圖型 移位來修正。 M 土 ㈣此等者可能造成整個晶圓場域為在 及/或y方向移位。此型式的 來修正。 域移位亦可用“®型移位,’37 201123254 Different dependencies and their different probabilities or limitations may occur in different configurations. For example, the dependency between processing E1 (wafer measurement and positioning) and ci (in-line = rational and / or loading data for initial scanning to _). Broader than architecture option A (offline processing), this dependency does not exist. For options, this dependency may be a problem. 'However, for immediate rasterization, this dependency will exist (the instant combination of Xiaodong and the scan line ^. Typical performance requirements for processing: downloading a new pattern from the feeder to 2 The local storage of the node is <6G minutes; the number of patterns to be stored in the streamer node: 2); (1); the machine is attributed to loading a new image: offline time < 60 seconds; Each wafer will be rasterized—the maximum time between the update of the two positive parameters and the ready to write is 36 seconds (10% of 6 knives); and, during the scan exposure, < 3 minutes. Timing and Synchronous Clock and Synchronization Signals can be distributed through fiber to other subsystems (also from the wafer table). This has the advantage of current isolation between subsystems and insensitivity to electromagnetic effects. Clock changes can be used to change the agent = however 'because the dose variation can be compensated by changing the pixel size, rather than: avoiding clock changes to simplify the implementation of the actual part of the number responsible for transferring data to the quencher and eliminating Be heavy after the clock frequency changes The time required for synchronization: The advantage of using a fixed clock rate is that the clock no longer needs to be distributed between different components of the = path. Use (in the FPG "side" standard phase 2 loop), in local time Variations in pulse frequency can be compensated for. When a change in ( (such as fiber) is required, special preparation is required to enable synchronization of data 38 201123254. The data path is preferably operated as a clock master for the entire lithography system. And providing timing and synchronization signals to other subsystems, such as: an electronic optical column (deflector) and a wafer positioning system. Corrected in the embodiment of the charged particle lithography machine described above, there is no built-in to lithography machine Any facility to adjust individual electron beamlets to correct errors in beam position, size, current, or other beam characteristics. The lithography machine omits modified lenses or circuits to make individual corrections for beamlets to avoid Component to the electro-optical column to create additional complexity and cost involved in actual beam correction, and avoid the need to incorporate such additional components The size of the column is large. = Therefore, in order to correct the change in the beam position, size, current, etc., the adjustment is made by the correction of the control signal provided by the (4) route. For various reasons, it is necessary to create Several types of corrections. These corrections include corrections to compensate for: ^ • changes in beamlet position. Due to variations in the manufacture of the column, such as holes in the array of pores or beamlet extinguishers The exact positioning and variation, or by the condenser lens or projection lens or deflection electrode:: field: degree difference, small beam may be misaligned. Such misalignment can be corrected by "pattern shift. M soil (4) These may cause the entire wafer field to shift in the / and / or y directions. This type is fixed. Domain shifts can also be used with "® shifts,"

39 201123254 •在數據途徑的延遲誤差(例如:由數據途徑中的光纖 長度的差異所引起)。此誤差可藉由在y方向移位來修正。 •熄、滅器時序偏移。由於將小束控制信號多工傳輸, 多個小束是透過一個通道所控制且小束控制信號是串列式 接收,即:對於不同小束的控制信號是由小束熄滅器陣列 在不同時間所接收。視熄滅器設計而定,對於將小束接通 及切斷(例如:小束可為以列或行的單位所切換)或個別小束 將會遭受不同的偏移。視實現控制位元(小束被切換)的策略 而定,此可能造成特定小束為相較於另一個小束而在稍後 ㈣間所切換。此誤差的效應是在子像素範圍+。結果是 每個小束的偏移。 •在小束媳滅器陣列孔位置的變化。各個小束通過在 小束媳滅器陣列的孔且為由在該孔的熄滅器電極所切換。 在小束熄滅器陣列之製造中的變化可能造成在孔且因此為 對應小束的位置於乂與y方向二者的機械偏移,當相較於 參考位置。此誤差的效應是典型為多個像素,且結果是每 個小束的偏移。此誤差的全像素(整數)部分將典型在執行時 作補償。其餘的子料(分數)部分可由即時栅格化作補償。 •在偏轉強度的變化。此等者可能是歸因於小束偏轉 ㈣電氣偏轉場強度的空間差異,此必須對於“圖型定炉 (SCaling)”、“劑量修正,,作修正。亦可能具有在偏轉差; 的小束偏移分量,其可由“圖型移位,,作修正。 為用於將小束熄減 ,有效的劑量率將 器 •在控制信號脈衝期間的變化。因 陣列電極接通及切斷的不同時序行為 40 201123254 在J束間為不同。當未將控制信號多工傳輪時,此效應是 顯著的(例如:1G%)。就對於在—個通道中的49個小束的 控制信號多丄傳輸而論,其顯著性是因為轉變效應相同而 降低,但最小脈衝寬度是相較於未多工傳輸情形而為較大 49倍(假設10%/49=0.2%)。甚者,此誤差是取決於劑量率。 誤差將對於100%劑量率寫入為小,而誤差是在以5〇%劑量 率寫入為最、大。 總體圖型銘付 當在晶圓上寫入圖型時,寫入圖型的小束不太可能是 句為凡王對準。為了修正此失準且致使小束能夠寫入對準 條帶’調整圖型數據以補償失準誤差。此調整可使用軟體 或硬體來作&,且可在圖型㈣的處理期間的不同階段所 進行。舉例來說,可㈣向量格式、#以多階灰階格式、 或以一階B/W位το映射(bitmap)的圖型數據而作出修正。 偏移可能發生在X方向(台移動方向)或y方向(小束掃 描偏轉方向)或是二者。偏移可能以全像素移位及/或子像素 移位而發生。全像素移位可藉由在栅格化後移位若干個像 素所達成。子像素移位可作為部分的栅格化過程而達成。 總體圖型移位(即:在一個通道中的所有小束的移位) 可用於(在X與y方向的)條帶位置修正及(在\與y方向的) 場域位置修正。對於條帶位置修正的χ與y圖型移位的實 例疋顯不於圖25。在圖左,條帶是顯示具有其重疊在意圖 位置的期望圖型。在圖右,條帶是顯示具有重疊為其若未 作出修正而將寫入的圖型。如可看出,需要總體圖型移位 41 201123254 以使通道的所有小束來寫入在移位向上且到左側的位置。 典型為在校準之後而頻繁進行(每個晶圓或場域—次 束移位。可假設小束為完全對準關於在同個通道中的其= 小束,使得在通道中的所有小束得到相同的圖型偏移。他 對於圖型移位的典型要求是對於總體移位的每個通首 的個別X肖Y移位設定,纟參數是每個場_為更新—次= 典型的最大移位範圍可為+2〇〇 nm到_2〇〇 ,且 八,υ· 1 nm 的移位準確度。此修正是對於總體移位的每個通道,預期 的是’在圖型束中的所有小束使用相同偏移值。對於總體 圖型移位,通道圖型是無關於束交插策略而整體作移位。 .煻、滅器時庠低_後.Τ 對於多個子通道的小束控制信號較佳為在單個通道上 作多工傳輸。視熄滅器設計而定,此將造成個別的小束在 不同時間切換到下個像素。總滅器時序偏移修正需要按昭 =通道在Υ的修正,典型為具有小於一個像素的最大餘 範圍、以及(M nm的移位準確度。移位參數是靜態的,由 於熄滅器時序偏移是視熄滅器設計而定。 炮滅器孔偏孩你$ 因為熄滅器幾何結構,不同的孔具有從某個參考點的 不同偏移使用在孔之X的偏移以產生交插的圖型(參閱: 圖9)。將即時考量其可預測的時序延遲且不視為此修正的 部分’a相對於參考(例如:中間條帶)之γ的偏移被補償α 誤差疋劃刀為全像素與子像素分量β全像素移位應為總是 作補償,而僅有即時柵格化能夠處理子像素分量。媳滅器 42 201123254 孔偏移修正需要按照子通道對於子像素分量在γ的修正, 典型為具有+/-Wproj/2或+/_210叫即:叫)^的最大 移位範圍、以及0_1 nm的移位準確度。修正參數是靜態的, 因為熄滅器孔偏移是依據熄滅器幾何結構。 劑量修正 因為在微影機器中的製造容許度變化,有效的劑量是 按照小束而變化。小束掃描偏轉強度的變化亦可造成劑量 強度的變化。劑量率亦可使用一個劑量因數作修正:造成 劑量率=劑量率圖*劑量因數。此公式以數學式描述修正, 但是劑量修正較佳為藉由調整像素白值及/或臨限值在遞色 過程中實現。舉例來# ’當小束用9G%的劑量因數來校準, 其強度為100%/90% = Ul.1%e因此,若1〇〇為預設值用 於遞色的白值將為Ul.1;且若預設值為5〇,遞色臨限值將 為 55,6 〇 劑量修正是按照小束而實行,且修正參數是每個晶圓 更新一次。對於劑量修正的典型要求/值是5〇%_1〇〇%的圖型 劑里映射0.2/〇步進尺寸的圖型劑量準確度、-⑽%的 束劑量因數、及0.2%步進尺寸的束劑量準確度。造成的劑 量率應捨入到最接近的值。 圖项定標(scaling) 束疋於各個掃描期間在y方向偏轉且將圖型從條帶的 一側寫入到另一側。偏轉距離是較佳為涵蓋條帶寬度與過 掃描距離的二倍。假使偏轉為非完全一致,一束是相較於 其他者而偏轉較強且因此偏轉距離將為不同。歸因於跨於 43 201123254 差異是發生在掃描 列遠端處之較弱的 偏轉場之小束而將 陣列所發生的電麼降,掃描偏轉強度的 偏轉陣列表面上”巧電”造成在陣 偏轉場,且偏轉距離是對於經歷較弱的 為較短。 此是使用圖型定;I® & I e g 261 M ®以標的實例是在圖 26所不。在圖左,條帶 _ 杖料从立 員為具有以在虛線之間的圖型 特徵的思圖標定所覆蓋 A且右—去从山 在圖右,條帶是顯示 公、有右未作出定標修正時而將寫入所覆蓋的圖型。如可 ^:需要圖型定標修正來降低通道的所有小束的 正確標定將特徵寫入。 定標可藉由調整傳送到熄滅器的數據信號的位元率、 ^光圖型散佈在不同數目的像素上所達成。歸因於同步 考里,改變位元率並非為較佳。為了避免此,定標 將圖型散佈在不同數目的位元/像素上所實行。假設的二 =同群組的小束具有相同的偏轉強度。此是因為其為由確 實相同的偏轉器所偏轉。因此對於某個群組的所 , 圖型定標因數是相同。 ^圖型定標需要每個通道的修正’修正參數較佳是每個 冗餘掃描重組而更新一次。最大範圍典型為i至"」(例如: 2 _變為2.2㈣,且準確度〇」nm/1 Mm=1/1〇,_。假設 偏轉強度疋對於通道中的所有小朿為相同,因為小束共用 相同的偏轉陣列,且大約為在此偏轉器中的相同位置。 圖27是總結種種型式的修正以及典型的參數與範圍的 表格。注意,當使用第一次掃描與第二次(或冗餘)掃描時、, 44 201123254 劑量修正是較佳為在 動態圖袒蒋付 一個掃描前所實行 動態圖型移位亦可 J破鐽供來補償晶圓發熱。此可使用 母個通道的X與γ低 — 表’其具有依據時間變化的值。可 使用每1 ms為〇. 1 nm 田 的攻大斜率(等於在X的-10 μηι),及 每3 00 mm (晶圓尺寸α 马具有30,000個項目的偏移表。 逼型定尺寸(Si7ir>Mf 為在遍及掃描偏轉陣列表面的小束掃描偏轉強度的 差異’小束的偏轉距離將^變。此可使用(上述)圖型定標或 圖3L定尺寸修正來作補償。對於圖型定尺寸修正的要求是 概括為同於圖型定標。 數據途徑架構 數據途控接收指定格式的佈局數據且將此數據處理以 使得其可使用電子束來寫入在晶圓上。數據途徑亦實行對 圖型數據的調整以補償在微影機器中的誤差,並且將同步 信號提供到其他的子系統。 圖28顯示數據途徑的功能方塊圖,其顯示從GDs_n 圖型數據檔案到其透過光纖所傳送的位元串流之流程。此 圖亦顯示其發生在適當功能方塊的修正。視架構選項而 定’可在數據途徑處理内的不同點處而作出修正。 輸入數蜱始 $ 對於數據途徑子系統的輸入將是預處理的格式(通常為 得自諸如GDS-II或MEBES的工業標準檔案格式),其含有. 將被“寫入”到晶圓上的佈局資訊。憑藉此工業標準樓案39 201123254 • Delay error in the data path (eg due to differences in fiber length in the data path). This error can be corrected by shifting in the y direction. • Off and off timing offset. Due to the multiplexed control of the beamlet control signal, multiple beamlets are controlled through one channel and the beamlet control signals are in-line reception, ie: the control signals for different beamlets are by the beamlet extinguisher array at different times Received. Depending on the design of the extinguisher, it will be subject to different offsets for turning the beam on and off (for example, the beamlets can be switched in units of columns or rows) or individual beamlets. Depending on the strategy for implementing the control bits (the beamlets are switched), this may cause the particular beamlet to switch between later (four) than the other beamlet. The effect of this error is in the subpixel range +. The result is the offset of each beamlet. • Changes in the hole position of the small beam quencher array. Each beamlet passes through the aperture in the beam annihilator array and is switched by the extinguisher electrode at the aperture. Variations in the manufacture of the beamlet blanker array may result in mechanical offsets in both the 乂 and y directions of the hole and thus the position of the corresponding beamlets, as compared to the reference position. The effect of this error is typically multiple pixels, and the result is the offset of each beamlet. The full pixel (integer) portion of this error will typically be compensated for during execution. The remaining fraction (fractional) portion can be compensated by instant rasterization. • Changes in deflection strength. These may be due to the spatial variation of the beam deflection (4) electrical deflection field strength, which must be corrected for "SCaling", "dose correction," or it may have a small difference in deflection; The beam offset component, which can be corrected by "pattern shifting." For use in extinguishing the beamlets, the effective dose rate will vary during the control signal pulse. Different timing behaviors due to the turn-on and turn-off of the array electrodes 40 201123254 is different between J bundles. This effect is significant when the control signal is not multiplexed (for example: 1G%). As far as the control signals of 49 small beams in one channel are transmitted, the significance is reduced because the transition effects are the same, but the minimum pulse width is larger than that of the unmultiplexed transmission case. Times (assuming 10%/49=0.2%). Moreover, this error is dependent on the dose rate. The error will be written to be small for the 100% dose rate, and the error is written at the highest dose rate at 5 〇%. The overall pattern is paid. When writing a pattern on a wafer, the beamlet of the written pattern is unlikely to be a sentence for Van Wang. In order to correct this misalignment and cause the beamlet to be able to write to the alignment strip' adjustment pattern data to compensate for the misalignment error. This adjustment can be done using software or hardware and can be done at different stages during the processing of pattern (4). For example, the correction may be made in (4) vector format, # in a multi-level gray scale format, or in a pattern data of a first-order B/W bit το bitmap. The offset may occur in the X direction (stage movement direction) or the y direction (small beam scan deflection direction) or both. The offset may occur with full pixel shifting and/or sub-pixel shifting. Full pixel shifting can be achieved by shifting several pixels after rasterization. Subpixel shifting can be achieved as part of the rasterization process. The overall pattern shift (ie, the shift of all beamlets in a channel) can be used for strip position correction (in the X and y directions) and field position correction (in the \ and y directions). The example of the χ and y pattern shifts for strip position correction is not as shown in Fig. 25. On the left side of the figure, the strip is displayed with the desired pattern with its overlap at the intended position. On the right side of the figure, the strip is a pattern that has overlaps that will be written if no corrections have been made. As can be seen, the overall pattern shift 41 201123254 is required to cause all of the beamlets of the channel to be written to the position shifted up and to the left. Typically done frequently after calibration (per wafer or field-to-beam shift. It can be assumed that the beamlets are fully aligned with respect to their = beamlets in the same channel, so that all beamlets in the channel are made The same pattern offset is obtained. His typical requirement for pattern shift is the individual X-Shaft shift setting for each head of the overall shift, and the 纟 parameter is for each field _ is updated - times = typical The maximum shift range can be from +2〇〇nm to _2〇〇, and the displacement accuracy of 八·υ·1 nm. This correction is for each channel of the overall shift, and is expected to be 'in the pattern bundle All beamlets in the same offset value are used. For the overall pattern shift, the channel pattern is shifted regardless of the beam interleaving strategy. 煻, 灭 灭 庠 _ _ Τ Τ For multiple subchannels The beamlet control signal is preferably multiplexed on a single channel. Depending on the design of the extinguisher, this will cause individual beamlets to switch to the next pixel at different times. The totalizer timing offset correction needs to be corrected. = channel correction in Υ, typically with a maximum margin of less than one pixel, and (M nm shift accuracy. The shift parameter is static, since the extinguisher timing offset is dependent on the extinguisher design. The gunner hole is a child. Because of the extinguisher geometry, different holes have a certain The different offsets of the reference points use the offset of the X of the hole to produce the interpolated pattern (see: Figure 9). The predictable timing delay will be considered immediately and is not considered part of this correction. The offset of γ of the reference (eg, the middle strip) is compensated by the alpha error. The knives are full-pixel and sub-pixel components. The full-pixel shift should always be compensated, while only the instant rasterization can process the sub-pixels. Component. Quencher 42 201123254 Hole offset correction requires correction of γ for sub-pixel components according to sub-channels, typically with a maximum shift range of +/- Wproj/2 or +/_210 called: )) 0_1 nm displacement accuracy. The correction parameters are static, because the extinction hole offset is based on the extinction geometry. Dose Correction Because the manufacturing tolerances in the lithography machine vary, the effective dose varies according to the beamlet Small beam scanning deflection intensity change It can also cause a change in dose intensity. The dose rate can also be corrected using a dose factor: causing a dose rate = dose rate map * dose factor. This formula is modified mathematically, but the dose correction is preferably by adjusting the pixel white value. And/or the threshold is implemented during the dithering process. For example, when the small beam is calibrated with a 9G% dose factor, the intensity is 100%/90% = Ul.1%e. Therefore, if 1小 is The white value of the preset value for dithering will be Ul.1; and if the preset value is 5〇, the dithering threshold will be 55,6 〇 The dose correction is performed according to the small bundle, and the correction parameter is The wafer is updated once. For the typical requirement of dose correction, the pattern dosage accuracy of 0.2/〇 step size, -(10)% beam dose factor, and the map agent in the model of 5〇%_1〇〇% are mapped. Beam dose accuracy of 0.2% step size. The resulting dose rate should be rounded to the nearest value. The item is scaled and deflected in the y direction during each scan and the pattern is written from one side of the strip to the other. The deflection distance is preferably two times the width of the strip and the overscan distance. If the deflection is not exactly the same, one beam is deflected stronger than the others and therefore the deflection distance will be different. Due to the difference across 43 201123254, the difference is the beam that occurs in the weaker deflection field at the far end of the scan column, and the electrons that occur in the array are dropped. The surface of the deflection array that scans the deflection intensity is “intelligible”. The deflection field is deflected and the deflection distance is shorter for a weaker experience. This is the use of the model; I® & I e g 261 M ® to the standard example is shown in Figure 26. In the left part of the figure, the strip _ stick material is covered by the icon of the figure with the pattern feature between the dotted lines, and the right side is taken from the mountain to the right of the figure, the strip is displayed public, and the right is not made. The calibration pattern will be written to the covered pattern. For example, ^: Graphic calibration correction is required to reduce the correct calibration of all beamlets of the channel to write features. The calibration can be achieved by adjusting the bit rate of the data signal transmitted to the extinguisher, and the pattern of light is spread over a different number of pixels. Due to the synchronization of the test, changing the bit rate is not preferable. To avoid this, scaling is performed by spreading the pattern over a different number of bits/pixel. The assumed two = the same group of beamlets have the same deflection strength. This is because it is deflected by a deflector that is identical. Therefore, the pattern scaling factor is the same for a group. ^Graphic calibration requires correction for each channel. The correction parameter is preferably updated once for each redundant scan recombination. The maximum range is typically i to "" (for example: 2 _ becomes 2.2 (four), and the accuracy 〇" nm / 1 Mm = 1 / 1 〇, _. Assume that the deflection strength 疋 is the same for all the small 朿 in the channel, Since the beamlets share the same deflection array and are approximately the same position in the deflector. Figure 27 is a table summarizing the various types of corrections and typical parameters and ranges. Note that when using the first scan and the second time (or redundant) scanning, 44 201123254 Dose correction is better to perform the dynamic pattern shift before the dynamic map 袒 付 paid a scan can also be used to compensate for wafer heating. This can be used The channel's X and γ are low - the table 'has a time-dependent value. It can be used every 1 ms for the 1 nm field's attack slope (equal to X -10 μηι at X), and every 3 00 mm (wafer The size α horse has an offset table of 30,000 items. The forced size (Si7ir>Mf is the difference in the deflection strength of the beamlet scanning across the surface of the scanning deflection array'. The deflection distance of the beamlet will change. This can be used (above) ) Figure calibration or Figure 3L sizing correction for compensation. The requirement for pattern sizing correction is summarized as being the same as the pattern sizing. The data path architecture data control receives the layout data of the specified format and processes this data so that it can be written on the wafer using the electron beam. The data path also implements adjustments to the pattern data to compensate for errors in the lithography machine and provides synchronization signals to other subsystems. Figure 28 shows a functional block diagram of the data path showing the GDs_n pattern data file to The flow of the bit stream transmitted through the fiber. This figure also shows the corrections that occur in the appropriate function block. Depending on the architecture option, it can be modified at different points within the data path processing. The input to the datapath subsystem will be in a preprocessed format (usually from an industry standard file format such as GDS-II or MEBES) containing layout information that will be "written" onto the wafer. This industrial standard building case

45 201123254 格式,預先定義的季 處理後,數據將對於=:以離線處理所施加。在離線 用便於後續處理的二:=的下個階段作儲存。數料 為一個檔案。田’、式作儲存,例如:每個個別通道 劑量映射數蜱 劑量映射是典型為使用向 的區域。劑量率是每以… '乂疋義早一個劑$率 率來將圖铟宜 b面積的輻射強度。用適當的劑量 手來將圖型寫入是必要 要的否則寫入的圖型將不會在抗蝕 出現。舉例來說,劑量率的範圍可能以〇.2%的步 進為50-1GG%,且劑量映射的空間解析度可能為⑺.η· 該等區域為非重疊’故其描述該等區域的多邊形的線並未 父又。料區域可使用在角度〇、45。、或90。的線以向量 弋斤定義#果發生即時轉列,離線過程可將複雜的多 邊形分解為較簡單者,例如:多邊形可作簡化以使得掃描 線僅相交邊界最多為二次。在匕簡化在硬體中的轉列。 預處理 預處理作用是典型為每個設計實行一次。此步驟需要 完成大量的計算能力。通常在預處理中納入以下功能性:(句 讀出GDS.II晶片設計且取出對於在晶片製程中的特定步驟 所需的資訊《此典型造成對於在此步驟中所需的特徵的多 邊形圖。(b)對劑量映射應用抗蝕劑加熱修正。此修正典型 迈成對於特徵位置的調整。(c)在多邊形上應用鄰近修正。 此修正將造成具有附屬不同劑量率的更多個多邊形的劑量 映射。(d)將對於以向量格式的各個場域的劑量映射輸出。 46 201123254 通道劃分(splitting) 逋迢走較隹使用為用於進一步虚& οβ '處理的單元。為了使此 為可能,場域劑量映射被劃分為每個 & ’個通道的劑量映射。多 邊形是縮減為由一個通道所寫入的 ^條帶區域。條帶區域是 較佳為延伸超過條帶的邊界,以考量縫紐楚& t 1里縫綴菜略與遞色起始 人為因素。若使用“智慧邊界,,縫綴策略,盆中,臨界特 徵被分配到單一通道/條帶,則在條帶邊界上的臨界特徵多 邊形被分配到當將劑量映射劃分時的特定通道。 通道轉列(rendering) 轉列是栅格化過程的第—…。形狀資訊盘劑量資 訊是轉列在像素中。圖29顯示其重疊在條帶上的佈局圖型 特徵來說明轉列過程。㈣資訊與劑#資訊是在㈣㈣ 中以向量格式所描述,且通常為基於場域。在χ的像素邊 界值是由機器的起點所固定(亦假設第—列將為由小束所 寫入)。此將確定所有像素X座標(在圖29的德本ν · 1豕 I X idx)與 其將掃描線寫入的對應小束(在圖29的 ^ ,^ 4术ldx)之間的關 係。掃描線是在γ方向的一列像素。 從在晶圓上的場域的典型X位置及從例行計量過程所 確定的X偏移,可確定特定場域的第—次掃描線(第一場域 像素列)。在此實例中的像素與場域原點是未對準。因此 “sub pix offs X”定義從場域原點開始處的左像素X邊界 (作為用於向量格式的參考)的偏移。 ' 1 在Y的像素尺寸、條帶寬度、過掃描與圖型定標將造 成其為需要的整數個像素。一個額外像素可能被添加以允 201123254 = 位。對於所有小束的圖型定標因數將為相同且 因此所有像素將為相同γ尺寸。 』且 移位可總是分為整數部分(全 像素移位)。可藓由蔣移位)與分數部分(子 位。益法η 框中移位來實現全像素移 …、法以此方式來實現子像素移位,但可藉由轉列/ ::來進行。在Υ方向的移位為總體(即:在¥方向的總體 :修:Γ):按照小束為專用(例如:束位置或熄滅器時序偏 ,)。轉列過程應知道哪個小束寫入掃描線且(將子像 :適::掃播線像h像素是在轉列前被移位,故其為 f準條帶VecrefY”(參閱:在圖中的放大顯示A)線,盆 對於特徵與劑量的向量格式描述而言為在y方向的基線Γ 因為在小束與像素X索引之間的關係是僅當開始掃描 而為,定’僅可用即時轉列來處理子像素移位。離線轉列 將總疋假設子像素移位為零。 虚_道遗色Mitherinp;) 遞色是栅格化過程的第二個步驟。藉著遞色,特定的 劑量率是由對於子通道的切換序列所實現。遞色是實質將 多T灰階像素量化為二階的白/黑像素,並將在各個像素中 的量化誤差傳播到相鄰的像素且局部強制特定平均劑量 率。圖30說明此過程。遞色技術是當印刷時典型用於實現 灰階或彩色變化。-些眾所週知的演算法是誤差擴散(2χ2 矩陣)與弗洛依德斯坦貝格(F1〇yd steinberg)(2x3矩陣卜 遞色是在一或二個(螺旋形)方向實行。遞色演算法典型 需要為了作準備的-些像素。因&,條帶£度是為了較佳 48 201123254 結果而延伸小的邊限。 為了微影目的,可作出一些改良。一個改良在於誤差 傳播較佳為不傳播到零值的像素。誤差值應在另一個方向 傳播或拋棄。將量化誤差傳播到在需要零劑量處的像素是 無用的。此鑒於對於CD與間距的合理值而亦應理解。如果 發生從灰值到零值的轉變,此保證將接著多個零像素。 遞色處理將灰階像素轉變為黑/白像素。因為遞色處理 必須將ϊ化誤差傳播到其相鄰像素,,亦處理$條掃描線的 子像素移位。圖30說明此處理。為了以準確方式傳播量化 誤差,對另一條掃描線的誤差傳播不是不重要,因為掃描 線並未對準。量化誤差可基於在相鄰像素之間的重疊量而 傳播’使得具有較大重疊的像素接收傳播量化誤差的較大 部分。替代且較簡單的策略是僅將誤差傳播到其具有最大 重疊的相鄰者》 v…旦八权狂钓匙因於來自轉列處 量率、每個小束的劑量因數及對於通道的定標因數 因齡县赫社蛊.丨 土《 μ . 劑 旦 、哎 < 的疋知;因數。劑 :因數是較佳為每個小束所設m,冑色模組亦應知 道掃描線對小束結合(在圖3〇的“子束Μχ”)。 遞色處縣造成料條帶的所有像素之通/斷狀態。在 =步處理之前,移除選用的邊限像素。如果發生軟邊緣, ^要邊限像素,因為在條帶邊界已經具有平滑的 間為已知或是 且像素將在γ 視架構選項^,修正是在遞色過程期 未知。對於離線遞色,無法進行子像素移位, 49 201123254 方向對準。 對於遞色過程,臨限是較佳總是為“白值,,的—半 因為白值將因為小束劑量修正而從預設值偏離。 通道·定框及多工 此處理實行在遞色之後的種種任務。遞色像素位元是 投射到掃描線位元框中。在此作業中,可實行小束特定的 全像素移位。對於單一個偏轉掃描,接著組合適當位元。 如在稍早對於轉列處理所述,在γ方向的全像素移位 可在稍後階段所進行❶b/w位元映射的像素是置放在其掃描 ,位元框中。此位元框典型較位元映射寬度為寬,因為= 容許移位空間。圖31說明此處理。垂直箭頭指出相對於零 移位線的全像素移位。若像素在此線(如在圖31中的掃描線 =元框的最左掃描線)開始’其全像素移位為零且像素為完 全置中在掃描線位元框令。 凡 ,組合偏轉掃描框位元的下個步驟是在圖32所顯示。出 :::必要以適應正柄寫入策略且在對的時刻提出心 對於參數Ν=4且Κ-3μπ Λ 圖的底側左部顯奸 不门 2 Κ-3的不同小束位置。位置是顯示為對妒 束幸tr 並不夠充分。對於此步驟,應知道小 有::偏轉掃描索引二者。對於特定偏轉掃描索引的所 部列=料為單—個偏轉掃描位元框。在圖32,二個底 疋填滿符號以查出在偏轉掃# ' 在偏轉“位-框的像素位置。 50 201123254 作為最後一個(選用)步 改良數據傳輸。. 數攄浠 偏轉掃描位元框將作編碼以 圖33疋顯不數據途徑的主 立士%回^入 要數據處理與儲存元件的示 思方塊圖,包含:離線處理& m _ 圖型串流器節點及熄滅器晶片早·器)、數個 月(小束熄滅器陣列)。 離線處理&中央儲存里;θ + 以rn,…… 處理輸入佈局數據(例如: 以GDS-II格式)且產生對於條锴 ' 的輸入槽案。根據對於各個 知描之通道對條帶的分配冬 ΛΑ 條帶數據必須最後終止在正確 的圖型串流器節點。 圖型串流器節點含有磁碟與RAM儲存。磁碟儲存是用 以儲存對於計晝圖型的輸人數據,且ram儲存正在將目前 圖型串流處理的處理單元所需要的數據。 視:構選項而定’來自伺服器的輸入數據是相同於對 於處理單7C的輸人數據。此為適用於離線與即時柵格化。 對:離線柵格化,位元映射是從伺服器所接收且轉送到處 理。對於即時柵格化,以向量格式的輸人數據是從词 服益所接收且轉送到處理單元1理單元將向量格式轉換 為^元映射。對於線内架構的選項,以向量格式的輸入數 據是為了處理單元而轉換為位元映射。 塞構撰Ji 數據途徑的功能單元是顯示於圖28 : (1)預處理;(2) 通道劃分;(3)通道轉列;(4)通道遞色;(5)子通道映射;及 (6)通道多工與編碼。 51 201123254 預處理及通道劃分較佳為離線執行,且子通道映射及 通道夕工與烏碼較佳為即時執行。然而,才冊格化(包含通道 轉列及通道遞色)可為離線' 線内或即時執行。下述架構選 項為.(A)離線柵格化;⑻線内栅格化與按照場域偏移;(C) 線内柵格化與對準場域;(D)即時柵格化。 在微影系統的一個實施例中,定義微影系統的以下要 求(其影響數據途徑架構):最大場域尺寸為26mm χ 33匪 (y,X)且每個場域的寫入時間為2.5秒,加上對於第二次的另 個2.5秒;13,000個光纖/通道/條帶與㈣侧個電子小束 03’OOOx每個通道為49個小束);條帶寬度為—且過掃 描寬度(單側)為1.1 5 _ (包含〇 2偏移範圍卜/鳩_ + 〇 2 定標範圍(條帶寬度的⑽)+()·5軟邊緣(()5 _單側)+ 〇·25 寫入策略(假設Wpr。尸42〇 nm :單側Wpr〇j/2=2 ι 〇;最大 偏轉寬度為4.3 _ (偏轉頻率為視寫入策略與驅動速度而 定);典型像素尺寸為3.5nm,且像素尺寸範圍為2,6詣 (1/3到3x(典型像素尺寸)2);劑量網格解析度為⑺^㈣; 最小間距為64對於線的最小CD為22 nm,且對於孔 的最小CD $ 32 nm ;輸入解析度為〇 25 _且拇格化解析 度為0.1 nm。 在圖型串流器上的數據圖型儲存尺寸>10個圖型;更新 新修正參數且為備妥以開始寫入新晶圓的時間是36秒;從 伺服器到圖型串流器的上載時間<6〇分鐘;從局部儲存写到 快遠讀體的成像<60秒(單獨處理步驟)及<6分鐘(在寫入 期間)’·且以7個處理單元之12個通道的處理節點。‘·’、 52 201123254 微影系統較佳為均能夠處理正與負抗姓劑。抗钱劑的 特徵較佳為在數據途徑的離線處理中所處理且數據途徑的 其餘部分應該對於此不須知道1 了寫人單一個晶圓,可 使用二次,即:首次與第二次或冗餘次。此二者的組合將 寫入在晶圓上的所有13,〇〇〇個條帶。 嚴廣A :離線柵格化 圖59顯示使用離線柵格化的實施例。GDs n格式圖型 數據接受離線處理,包括:鄰近效應修正與抗㈣加熱修 正。若使用智慧型邊界,邊界是在此階段作計算。柵格化(轉 列與遞色)被實行來將肖*圖型數據轉換為二階的黑/白位 元映射,其為用於此實施例(即:用於傳輸到微影系統的數 據格式)的工具輸人數據格式。此離線處理是對於既定的圖 型設計、對於一或多個批次的晶圓為實行一次。 其次’實行工具輸入數據的線内處理以產生圖型系統 串流(PSS,pattern system streaming)格式,其亦為 B/w位 元映射格式。線内處理是典型以軟體實行。圖型串流器接 著處理PSS格式數據以產生熄滅器格式數據,備妥以供傳 輸到小束熄滅器陣列《此處理是典型以硬體實行,且可包 括其涉及對於束位置校準、場域尺寸調整及/或場域位置調 整之在X及/或Y方向的全像素移位的修正。可按照場域來 實行此處理。熄滅器格式圖型數據接著被傳送到用於晶圓 曝光的微影系統。 在此架構選項中’許多工作异雜綠;佳45 201123254 Format, pre-defined season After processing, the data will be applied to =: for offline processing. It is stored offline in the next stage of the second:= for subsequent processing. The number is a file. Fields, storage, for example: number of dose mappings per individual channel 剂量 The dose mapping is typically the area of use. The dose rate is the radiant intensity of the indium b area of each of the graphs. It is necessary to write the pattern with the appropriate dose hand, otherwise the written pattern will not appear on the resist. For example, the range of dose rates may be 50-1 GG% in steps of 〇. 2%, and the spatial resolution of the dose mapping may be (7). η · The regions are non-overlapping, so the description of the regions The line of the polygon is not the parent. The material area can be used at angles 〇, 45. , or 90. The line is defined by the vector. The immediate process is broken down. The offline process can decompose the complex polygon into a simpler one. For example, the polygon can be simplified so that the scan line only intersects the boundary at most twice. In 匕 simplify the transfer in the hardware. Pretreatment Pretreatment is typically performed once for each design. This step requires a lot of computing power. The following functionality is usually included in the pre-processing: (Sentence reads the GDS.II wafer design and takes the information needed for the specific steps in the wafer process. This typically results in a polygonal map of the features required in this step. b) applying a resist heating correction to the dose mapping. This correction typically proceeds to adjust the feature position. (c) Apply a proximity correction to the polygon. This correction will result in a dose mapping for more polygons with different dose rates attached. (d) The dose mapping for each field in the vector format will be output. 46 201123254 Channel division (splitting) is used as a unit for further virtual & οβ 'processing. To make this possible, The field dose mapping is divided into dose maps for each & 'channel. The polygon is reduced to the strip region written by one channel. The strip region is preferably extended beyond the boundary of the strip to consider The stitching of the New Chu & t 1 is slightly related to the initial color of the dithering. If the "smart border, the stitching strategy, the pot, the critical features are assigned to a single For the track/strip, the critical feature polygon on the strip boundary is assigned to the specific channel when the dose map is divided. Channel rendering The transition is the first... of the rasterization process. The information is circulated in the pixels. Figure 29 shows the layout pattern features superimposed on the strips to illustrate the transition process. (4) Information and Agent # Information is described in vector format in (4) (iv), and is usually based on the field The pixel boundary value at χ is fixed by the starting point of the machine (also assuming that the first column will be written by the beamlet). This will determine all pixel X coordinates (deben ν · 1豕 IX idx in Figure 29) The relationship between the corresponding beamlets (in Figure 29, ^4 ldx) of the scan line. The scan line is a column of pixels in the gamma direction. Typical X position from the field on the wafer And the X-offset determined by the routine metrology process, the first-order scan line (the first field-pixel column) of the specific field can be determined. The pixel in this example is misaligned with the field origin. "sub pix offs X" defines the left pixel X boundary from the origin of the field (as a reference for the vector format) offset. ' 1 pixel size, strip width, overscan and pattern scaling in Y will cause it to be an integer number of pixels needed. An extra pixel may be added to allow 201123254 = Bit. The scaling factor for all beamlets will be the same and therefore all pixels will be the same gamma size. And the shift can always be divided into integer parts (full pixel shift). Bit) and the fractional part (sub-bit. The shift in the η method η box to achieve full-pixel shift..., the method achieves sub-pixel shifting in this way, but can be performed by arranging /: :: in the Υ direction Shift to the whole (ie: the total in the direction of the ¥: repair: Γ): dedicated to the small bundle (for example: beam position or extinguisher timing offset). The transfer process should know which small beam is written to the scan line and (sub-image: suitable:: the sweep line is like the h pixel is shifted before the transition, so it is the f-strip strip VecrefY) (see: in the figure) The magnified display shows the A) line, which is the baseline in the y direction for the vector format description of the feature and dose. Because the relationship between the beamlet and the pixel X index is only when the scan is started, the set is only available. Instant sub-column to handle sub-pixel shifts. Offline sub-columns will always assume that the sub-pixels are shifted to zero. Virtual_message Mitherinp;) Dithering is the second step of the rasterization process. By dithering, The specific dose rate is achieved by a switching sequence for the sub-channels. The dithering is to substantially quantize the multi-T gray-scale pixels into second-order white/black pixels, and propagate the quantization error in each pixel to the adjacent pixels and Locally forcing a specific average dose rate. This process is illustrated in Figure 30. The dithering technique is typically used to achieve grayscale or color variations when printing. Some well known algorithms are error diffusion (2χ2 matrix) with Freudstein Grid (F1〇yd steinberg) (2x3 matrix It is implemented in one or two (spiral) directions. The dithering algorithm typically requires some pixels to be prepared. Because &, the strip is extended to a small margin for better results of 2011 201123254. For lithography purposes, some improvements can be made. One improvement is that the error propagation is preferably a pixel that does not propagate to zero. The error value should propagate or discard in the other direction. Propagating the quantization error to a pixel that requires a zero dose is useless. This is also understood in view of the reasonable value for CD and spacing. This guarantee will follow multiple zero pixels if a transition from gray to zero occurs. Dithering transforms grayscale pixels into black/white pixels. Since the dithering process must propagate the decimation error to its neighboring pixels, the sub-pixel shift of the $ scan line is also processed. Figure 30 illustrates this process. In order to propagate the quantization error in an accurate manner, the error to the other scan line Propagation is not unimportant because the scan lines are not aligned. The quantization error can be propagated based on the amount of overlap between adjacent pixels' such that the pixels with larger overlap receive propagation quantization error The larger part. The alternative and simpler strategy is to propagate the error only to its neighbors with the greatest overlap. v... Once the eight-powered squid, due to the rate of the transfer, the dose factor of each beamlet And the calibration factor for the channel, the age of the county, Heshe, the soil, the μ. The agent, the 哎, the 疋 know; the factor: the factor is better for each beam, m, 胄 color module It should also be known that the scan line pair is bundled (in the “Subbeam Μχ” in Figure 3). The dithering county causes the on/off state of all pixels of the strip. Before the step is processed, the selected edge is removed. Limit pixels. If a soft edge occurs, ^ be bound to the pixel, because the strip boundary already has a smooth interval is known or the pixel will be in the γ view architecture option ^, the correction is unknown during the dithering process. For offline dithering, subpixel shifting is not possible, 49 201123254 Alignment. For the dithering process, the threshold is always always "white value," - half because the white value will deviate from the preset value due to the small beam dose correction. Channel, frame and multiplex this process is performed in the dither Subsequent tasks. The dithered pixel bit is projected into the scan line bit box. In this operation, a small beam-specific full pixel shift can be performed. For a single deflection scan, then the appropriate bit is combined. Earlier for the column processing, the full pixel shift in the gamma direction can be performed at a later stage by ❶b/w bit mapping of the pixels placed in its scan, in the bit box. This bit box is typically compared The bit map width is wide because = allows for shifting space. Figure 31 illustrates this process. The vertical arrow indicates the full pixel shift relative to the zero shift line. If the pixel is on this line (as in scan line in Figure 31 = The leftmost scan line of the meta box begins with 'the full pixel shift is zero and the pixel is fully centered in the scan line bit box. The next step in combining the deflection scan frame bits is shown in FIG. Out::: necessary to adapt to the positive handle write strategy and present at the right moment The heart is different for the small beam position of the bottom side of the bottom side of the parameter Ν=4 and Κ-3μπ Λ. The position is displayed as a good fortune and is not enough. For this step, you should know There are:: deflection scan index. For the particular deflection scan index, the column = material is a single deflection scan bit box. In Figure 32, the two bottoms are filled with symbols to find the deflection sweep # ' The pixel position of the bit-box is deflected. 50 201123254 As the last (optional) step to improve data transmission. The digital deflection scan bit box will be coded as shown in Figure 33. The main data from the non-data path is returned to the block diagram of the data processing and storage component, including: offline processing & m _ pattern The streamer node and the extinguisher wafer are early), and several months (the beamlet extinguisher array). Offline processing & central storage; θ + to rn, ... process input layout data (for example: in GDS-II format) and generate input slots for strips '. According to the channel for each known channel, the winter band data must be terminated at the correct pattern stream node. The graphics streamer node contains disk and RAM storage. The disk storage is used to store the input data for the metering pattern, and the ram stores the data required by the processing unit that is processing the current pattern stream. Depending on the configuration option, the input data from the server is the same as the input data for the processing list 7C. This is for offline and instant rasterization. Pair: Offline rasterization, the bit map is received from the server and forwarded to the process. For instant rasterization, the input data in vector format is received from the word benefit and forwarded to the processing unit to convert the vector format to a ^-mapping. For the option of inline architecture, input data in vector format is converted to a bit map for processing units. The functional units of the Ji data path are shown in Figure 28: (1) pre-processing; (2) channel partitioning; (3) channel transposition; (4) channel dithering; (5) sub-channel mapping; 6) Channel multiplexing and coding. 51 201123254 Pre-processing and channel division are preferably performed offline, and sub-channel mapping and channel Xigong and U-code are better for immediate execution. However, bookkeeping (including channel routing and channel dithering) can be performed offline or in-line. The following architectural options are: (A) offline rasterization; (8) inline rasterization and field offset; (C) inline rasterization and alignment field; (D) instant rasterization. In one embodiment of the lithography system, the following requirements of the lithography system are defined (which affect the data path architecture): the maximum field size is 26 mm χ 33 匪 (y, X) and the write time per field is 2.5. Seconds, plus another 2.5 seconds for the second time; 13,000 fibers/channels/strips and (4) side electron beamlets 03'OOOx 49 small beams per channel); strip width is - and overscan Width (one side) is 1.1 5 _ (including 〇2 offset range 卜 / 鸠 _ + 〇 2 scaling range ((10) of strip width) + () · 5 soft edges (() 5 _ one side) + 〇 ·25 write strategy (assuming Wpr. corpse 42〇nm: one-sided Wpr〇j/2=2 ι 〇; maximum deflection width is 4.3 _ (deflection frequency depends on the write strategy and drive speed); typical pixel size Is 3.5nm, and the pixel size range is 2,6诣 (1/3 to 3x (typical pixel size) 2); the dose grid resolution is (7)^(4); the minimum spacing is 64 for the line with a minimum CD of 22 nm, And for the smallest CD of CD $ 32 nm; the input resolution is 〇25 _ and the resolution of the thumb is 0.1 nm. The data pattern storage size on the pattern streamer > 10 patterns; The new and new correction parameters are ready to start writing new wafers for 36 seconds; the upload time from the server to the graphics stream is <6〇 minutes; from local storage to fast-distance reading <60 seconds (single processing step) and <6 minutes (in writing period)'· and processing nodes of 12 channels of 7 processing units. '·', 52 201123254 The lithography system is preferably capable of Processing positive and negative anti-surname agents. The characteristics of the anti-money agent are preferably processed in the off-line processing of the data path and the rest of the data path should not be known for this. , ie: first and second or redundant times. The combination of the two will be written on all 13 strips on the wafer. Yan Guang A: Offline Rasterization Figure 59 shows the use of offline grid The embodiment of the grid. The GDs n format data is processed offline, including: proximity effect correction and anti-(four) heating correction. If a smart boundary is used, the boundary is calculated at this stage. Rasterization (transition and dithering) ) is implemented to convert the Shaw pattern data into second-order black/white bits Shot, which is the tool input data format used in this embodiment (ie, the data format used for transmission to the lithography system). This offline processing is for a given pattern design, for one or more batches of crystal The circle is implemented once. Secondly, the in-line processing of the tool input data is implemented to generate a pattern system streaming (PSS) format, which is also a B/w bit mapping format. Inline processing is typically implemented in software. The pattern streamer then processes the PSS format data to produce extinction format data, ready for transmission to the beamlet blanker array. "This process is typically implemented in hardware and may include it involving beam position calibration, field Correction of full-pixel shift in the X and/or Y direction for domain size adjustment and/or field position adjustment. This process can be performed by field. The extinguisher format pattern data is then transferred to a lithography system for wafer exposure. In this architecture option, 'many jobs are heterogeneous; good

下夕忭疋離綵進仃。栅格化將;IOn the eve of the evening, I will leave the color. Rasterization will; I

離線執彳于且母個設什為執行一次β饼於丨卜、强TS J人對於此選項,用於微影 53 201123254 輸人數據是以黑/白(卿,w μ映射 =圖型描述。位元映射是即時處理。因此,僅有由階 段5 (通道定框及容 的。階段5的修…:二圖34)所提供的修正是可用 疋王像素移位修正’其可包括:每個通 斑與Υ方向的總體圖型移位、熄滅器時序偏移(γ方向) 與熄滅器孔偏移(Υ方向)。 '、有在]、束對於列映射(媳滅器孔偏移與熄滅器 的人像的影響。適當的¥偏移將被附加且捨入到最接近 由於僅為全像素修正,相當小的像素尺寸㈠㈣是合 滿足準確度規格。使用小像素的缺點是在於:需要豆 1 目較於可用於通道的較大頻寬,此可能造成較低的產量; 母個通道需要使用多個光纖。 圖5中顯不對於此架構選項的處理流程。重點是 在改變批次的瞬間。虛 4 ]處理机程可作分析以在微影系統整個 過程找出其可用於將圖型數據載入的區間,使得此等處理 可千灯處理以使產量最大化。在中央條塊,批次 ^變到圖@ B。對於此圖,假設的是:沒有理由(因為失效 束)將束與條帶重新配置。載入新圖型的主要部分(對於圖型 B的主要掃描所寫入的條帶)可在最後的主要掃描為完成後 而立即開始。此圖亦顯示:载入新圖型的第二次掃描/冗餘 掃描部分可相當晚開始且應在當對於新圖型的第二次掃描/ 冗餘掃描應開始時而完成0 田 掃描0與F的持續時間都是典型為以分鐘。對於平 54 201123254 行的處理Η與D的總持續時間可為約i分鐘。因此,可用 於載入總㈣㈣間等利於二個掃描與晶圓交換的時間 (約6分鐘),假設不需要在節點間的條帶數據重組。當新的 失效通道是以處理D所找到’條帶數據重組可能是必要的。 圖36是對於離線栅格化架構(選項Α)的圖型串流器節 點的主要兀件的方塊圖。在圖36中,各個節點包含數個元 件。節,點CPU協調在節點上的處理且將數據到處移動。網 路裝置疋與伺服器(離線處理&中央處理單元)通訊且接收 局數據到串流。 磁碟儲存單元儲存用於處理單元的位元映射。可能具 有數種版本的位元映射為可用在磁碟上。可藉由使用以某 些RAID模式的磁碟陣列而改良可靠度與讀取性能。磁碟機 的讀取速度是藉由條串化(RAID 0,將數據分佈在磁碟陣列) 而提高。可藉由將數據以冗餘方式儲存(RAID 5, N個磁碟: 储存尺寸=N-lx磁碟尺寸)而改良可靠度。 處理單元記憶體(PU-RAM)儲存圖型數據。當掃描時, 處理單7L是從此RAM讀取其圖型數據。CPU是在掃描之前 將圖型數據載入到RAM。處理單元將圖型數據串流且產生 光學信號以供傳輸到熄滅器。 對於此組態的典型數據流是在圖37所顯示。圖型數據 是由節點cpu從網路裝置所接收(1)且儲存在磁碟(2)。每當 圖型數據是對於掃描所需要時,節點CPU從磁碟讀出數據 (3)且將其儲存在PU-RAM (4)。當掃描時,處理單元從 PU-RAM讀出其圖型數據(5)。 55 201123254 此架構的重要特徵是PU_RAM的大小、PU RAM載入 %間、磁碟載入時間與磁碟大小。pu_RAM載入時間(將所 有條帶數據載入PU-RAM的時間)將主要取決於磁碟儲存單 元的性能。關於磁碟載入時間,對於新掃描的位元映射必 須從飼服ϋ下載’幻g服器可能是對於通訊的航頸。磁碟 載入時間可為藉由將從伺服器到節點的頻寬增大或是將在 伺服器上的位元映射數據壓縮而改良。對於磁碟大小,假 設為了克服分佈瓶頸(伺服器頻寬),可在磁碟儲存單元中儲 存多個(例如:10個)圖型。視關於可用性或讀取速度的要 求而定,磁碟可針對於特定RAID階層而構成。 以離線與線内的概念,預處理像素的重新排序及映射 可由其包含現場可程式閘陣列(FpGA)的處理單元所實行。 此處理單元將允許全像素移#並且可將來自記憶體的數據 重新排序以朝向熄滅器多工傳輸。 壓縮亦可用於架構選項A。可能的組態包括:無壓縮、 壓縮遞色影像或壓縮灰階影像。 對於無壓縮,圖型串流器節點將(未壓縮)遞色影像儲存 在磁碟上。亦可能在分佈前而在伺服器將此影像壓縮。在 此It況,圖型串流器應在影像接收後不論以何種方式將其 解壓縮,但此似乎不會是瓶頸’因為存在對此處理的合理 時間量。 對於壓縮遞色影像,壓縮降低分佈工作量(通訊時間) 且降低RAM大小要求。對此解決辦法,離線處理應將遞色 影像壓縮’ @ FPGA應將影像在内部解墨縮且將其處理。因 56 201123254 此’在RAM的影像是較小許多。就圖34的功能單元而論, 壓縮與解壓縮功能是插入在遞色之後,如在圖39所示。 壓縮可能對於遞色影像為較沒效,因為遞色影像含有 許多零值,且非零區域是歸因於劑量值的變化而可能為難 以壓縮。圖40顯示遞色的測試影像,使用單色(每個像素為 1位元)影像。影像(圖40)是當每次重複時改變劑量階層之 圖42的遞色版本的8倍。由於每次重複時改變劑量,壓縮 工具無法利用重覆且為較沒效率。GZIp與〇ptipng是可能 的壓縮方法。遞色影像的壓縮不容易且將大約傳遞丨:4規模 的壓縮比率(主要是壓縮零的序列)^使用i :4的壓縮比率, 使用2 nm像素的典型條帶影像的尺寸將造成每個條帶為 4352 MB未壓縮與1088 MB壓縮,且每個串流器為6i gb 未壓縮與15.2 GB壓縮(即:14x)。在此方案中,壓縮遞色 影像將降低RAM大小為16 G位元組,提供對於載入時間(對 單磁碟而言,磁碟_>RAM為約2分鐘)與分佈時間(伺服器◊ 磁碟為約1,5小時)的優點。2分鐘的載入時間適合用於處理 流程中之載入的時窗。料處在於,FPGA A加強具有每個 通道的解壓縮,其跟上約5 Gbit/s的即時數據速率。此外, 伺服器較佳是在初始將所有數據壓縮。 對於壓縮的灰階影像,就圖34的功能單元而論,壓縮 與解壓縮功能應是在轉列之後而插入,如在圖41所示。在 轉列之後,離線過程應將灰階影像壓縮,且FPga將影像解 壓縮、遞色及處理。 〜 圖42顯禾一格(64χ1〇〇〇 nm@2 nm像素)的轉列位元映Offline and the mother is set to perform a beta cake in the 丨, strong TS J for this option, for lithography 53 201123254 input data is black / white (Qing, w μ mapping = graphic description The bit map is instant processing. Therefore, only the correction provided by stage 5 (channel framing and accommodating. Phase 5 mod...: Fig. 34) is available for 疋王 pixel shift correction 'which may include: Overall pattern shift in each pass and Υ direction, extinator timing offset (γ direction) and extinction hole offset (Υ direction). ', there is in, bundle versus column mapping (quenchor hole deviation) The effect of shifting and extinguishing the portrait. The appropriate ¥ offset will be appended and rounded to the nearest due to only the full pixel correction, the relatively small pixel size (a) (four) is to meet the accuracy specification. The disadvantage of using small pixels is that : Requires a larger bandwidth than the available channels, which may result in lower yields; the parent channel requires multiple fibers. Figure 5 shows the processing flow for this architectural option. The focus is on changing The instant of the batch. The virtual 4] processing machine can be analyzed to The entire process of the shadow system finds the interval it can use to load the pattern data, so that these processes can be processed to maximize the yield. In the central block, the batch ^ changes to the graph @ B. For this graph, Assume that there is no reason (because of the failure bundle) to reconfigure the bundle with the strip. Loading the main part of the new pattern (for the strip written by the main scan of pattern B) can be done at the end of the main scan Start immediately. This figure also shows that the second scan/redundant scan portion of the new pattern can be started quite late and should be when the second scan/redundant scan for the new pattern should start. The 0 field scan 0 and F durations are typically in minutes. For Ping 54 201123254 lines, the total duration of processing Η and D can be about i minutes. Therefore, it can be used to load total (four) (four) equal to two The time between scanning and wafer exchange (about 6 minutes), assuming that there is no need to reorganize the strip data between nodes. It may be necessary to reorganize the stripe data when the new failed channel is found by processing D. Figure 36 is for Offline rasterization architecture (option Α) A block diagram of the main components of a graphics streamer node. In Figure 36, each node contains several components. The node, the point CPU coordinates the processing on the node and moves the data around. The network device and the server ( Offline processing & central processing unit communication and receiving office data to the stream. The disk storage unit stores the bit map for the processing unit. It is possible to have several versions of the bit map mapped to be available on the disk. Improves reliability and read performance with disk arrays in some RAID modes. The drive's read speed is improved by striping (RAID 0, distributing data across the disk array). The data is stored redundantly (RAID 5, N disks: storage size = N-lx disk size) to improve reliability. The processing unit memory (PU-RAM) stores pattern data. When scanning, the processing unit 7L reads its pattern data from this RAM. The CPU loads the pattern data into the RAM before scanning. The processing unit streams the pattern data and produces an optical signal for transmission to the extinguisher. A typical data flow for this configuration is shown in Figure 37. The pattern data is received by the node cpu from the network device (1) and stored on the disk (2). Whenever the pattern data is required for scanning, the node CPU reads data from the disk (3) and stores it in the PU-RAM (4). When scanning, the processing unit reads its pattern data (5) from the PU-RAM. 55 201123254 An important feature of this architecture is the size of PU_RAM, PU RAM load %, disk load time and disk size. The pu_RAM load time (the time it takes to load all stripe data into the PU-RAM) will depend primarily on the performance of the disk storage unit. Regarding the disk load time, the bitmap for the new scan must be downloaded from the feed service. The illusion device may be the neck for communication. The disk load time can be improved by increasing the bandwidth from the server to the node or compressing the bit map data on the server. For the disk size, it is assumed to overcome the distribution bottleneck (server bandwidth), and multiple (for example, 10) patterns can be stored in the disk storage unit. Depending on the availability or read speed requirements, the disk can be constructed for a particular RAID hierarchy. With the concept of off-line and in-line, the reordering and mapping of pre-processed pixels can be performed by a processing unit that includes a field programmable gate array (FpGA). This processing unit will allow full pixel shift # and the data from the memory can be reordered for multiplexer transmission towards the extinguisher. Compression can also be used for architectural option A. Possible configurations include: uncompressed, compressed dithered images, or compressed grayscale images. For no compression, the graphics streamer node stores the (uncompressed) dithered image on the disk. It is also possible to compress this image on the server before distribution. In this case, the graphics streamer should decompress the image after it has been received, but this does not seem to be a bottleneck because there is a reasonable amount of time for this processing. For compressed dithered images, compression reduces the amount of distributed work (communication time) and reduces RAM size requirements. For this solution, off-line processing should compress the dithered image. @FPGA should image the image internally and process it. Because 56 201123254 this 'image in RAM is much smaller. As far as the functional unit of Fig. 34 is concerned, the compression and decompression functions are inserted after the dithering, as shown in Fig. 39. Compression may be less effective for dithered images because the dithered image contains many zero values and the non-zero region may be difficult to compress due to changes in dose values. Figure 40 shows a dithered test image using a single color (one bit per pixel) image. The image (Fig. 40) is 8 times the dithered version of Fig. 42 that changes the dose level each time it is repeated. Since the dose is changed each time it is repeated, the compression tool cannot be reused and is less efficient. GZIp and 〇ptipng are possible compression methods. The compression of the dithered image is not easy and will pass approximately: a compression ratio of 4 scale (mainly a sequence of compressed zeros) ^ using a compression ratio of i: 4, the size of a typical strip image using 2 nm pixels will result in each The strip is 4352 MB uncompressed and 1088 MB compressed, and each streamer is 6i gb uncompressed and 15.2 GB compressed (ie: 14x). In this scenario, compressing the dithered image will reduce the RAM size to 16 Gbytes, providing the load time (for a single disk, the disk_>RAM is about 2 minutes) and the distribution time (server)优点 The advantage of the disk is about 1,5 hours). The 2 minute load time is suitable for processing the time window of loading in the process. The expectation is that FPGA A enhances the decompression of each channel, which keeps up with the instantaneous data rate of approximately 5 Gbit/s. In addition, the server preferably compresses all data initially. For compressed grayscale images, as far as the functional units of Figure 34 are concerned, the compression and decompression functions should be inserted after the transition, as shown in Figure 41. After the transition, the offline process should compress the grayscale image and FPga decompresses, dithers, and processes the image. ~ Figure 42 shows the transition bit of the grid (64χ1〇〇〇 nm@2 nm pixels)

57 201123254 射的實例。為了壓縮’使用GZIP與〇ptipng(均為開放原始 碼的壓縮工具)。此二箱古m , 、 方法均為無損失。GZIP是通用的壓 縮工具,而〇Ptipng是專用於壓縮2D影像。pNG壓縮是由 -個階所組成,即.2D預測濾波器與壓縮器,使 得0PUPng提供優異㈣縮比。視在實際設計中找到的圖型 而定,在較大影像中可能存在較多的重複。 使用1:40 (PNG)的壓縮比與2nm像素,壓縮率將影像 收縮到可相比於向量格式的尺寸n使用此方式需要 將png解壓縮整合在處理單元FpGA。當位元映射尺寸以 一因數4而成長,壓縮影像是對於σζιρ 且對於刚為一因數2。壓縮結合小像素為相當有效數丨.3 對於使用灰階像素的此方式之有趣觀察是在於:潛在 允許移位及構成較大的像素以供串流到熄滅器。較大像素 的值可從較小像素所計算,#由使用較小像素的值的線性 組合。輸入影像可視為過取樣。圖43顯示小網格輸入像素 與大輸出像素的此概念。提出實例,其中像素尺寸的比率 是1:2,然而,其他比率亦為可能。FpGA將未壓縮位元映 射且將數個小像素組合形成大像素以供串流到熄滅器。優 點在於:此方式將限制於光纖(大輸出像素)的頻寬,即使當 使用小的輸入像素。於光纖的頻寬是視為瓶頸,且每個通 道可旎需要使用二個光纖以將2 nm像素串流到熄滅器。 關於此架構的備註: •劑量映射較佳為仍被附加到輸入位元映射且由 FPGA所使用。 58 201123254 •因為在FPGA發生遞色,劑量修正是可能的。 •當從輸入像素來構成熄滅器陣列而在X與γ移位, 準確度是取決於實際的像素尺寸。 •在FPGA的解壓縮與遞色是必要的。 •壓縮被附加到離線過程。預期的是,壓縮將會顯著 增加處理工作量。 RAM大小是隨著1:4〇的壓縮比而減小。對於此方案, FPGA是備有即時的解壓縮邏輯,其能夠跟上灰階所擴展的 速率(》5 Gbit/s)。 選項B與C :鋏内Μ始 圖60顯禾使用線内柵格化的實施例。gds-II格式圖型 數據接受如同對於圖5 9的離線實施例的離線處理,包括: 鄰近效應修正、抗蝕劑加熱修正、及(若使用的)智慧型邊 界。修正的向量圖型數據與劑量映射是用於此實施例的工 具輸入數據格式。此離線處理是對於既定的圖型設計、對 於一或多個批次的晶圓為實行一次。 其次’實行向量工具輸入數據的線内處理來將向量數 據柵格化以產生B/W位元映射數據,其在此實施例為圖型 系統串流(PSS)格式。此處理是典型以軟體實行,且可當設 定新劑量設定時而實行。如同在圖59的實施例中,圖型串 流器接著處理PSS格式數據以產生熄滅器格式數據,包括 其涉及如同先前在位元映射數據之對於束位置校準、場域 尺寸調整及/或場域位置調整之在X及/或γ方向的全像素 移位的修正。可按照場域來實行此處理。熄滅器格式圖型 59 201123254 數據接著被傳送到用於晶圓曝光的微影系統。 圖61顯示使用線内栅格化的第二個實施例。此為類似 於圖60的實施例,除了對於束位置校準、場域尺寸調整、 及/或場域位置調整的修正是在向量工具輸入數據上所作成 之外。因為此等修正是在向量數據所作成,在又與γ方向 的全像素移位與子像素移位均可作成。此等修正是典型為 以軟體實彳τ,且可為按照晶圓所實行。在修正已經作成後, 實行柵格化來產生pss格式數據以供輸人到圖型串流器。 圖44顯示其分配到處理步驟的線内栅格化功能單元。 對於此架構,功能單元3與4 (栅格化)是線内執行。對於此 選項’用於微影系統的輸入數據將是以向量格式的條帶圖 ,描述。將依需求(按照晶圓、按照數個晶圓、按照系㈣ )而進行柵格化。在總體偏移或是在總體劑量的變化可 觸發線内柵格化。 適田的劑直是藉由改變像素面積所設定。可藉由改變X 與Y像素尺寸二者而改變像素面積。然而,僅可將X尺寸 =變到某些值(如關於圖1G所論述)。為了總體劑量的微 調’可使用對於γ尺卄玷々嫩 尺寸的改變。假設固定位元率,γ像素 尺寸是藉由改變偏轉頻率 、 貝丰以及使用不同的圖型定標因數所 呂又疋ο 因為柵格化結果將用你 用於所有%域,無法考量特定場域 的子像素偏移。每個揚说Μ 每域的偏移疋較佳為最後捨入到全像 素、、為由階段5(通道定框及多工)所即時考量。 修正可包括: 201123254 •在χ與γ的場域圖型移位(僅為全像素移位)。參數 是每個場域為更新一次。 •在X與Υ的總體圖型移位(以子像素解析度)。參數 是每個晶圓掃描為更新一次或多次。 •透過圖型疋標的總體劑量改變。參數是每個晶圓掃 描為更新一次或多次。 每個小束的劑量修正以及子像素移位是均無法處理。 根本原因是在X方向移位的能力,其㈣㈣小束的映射。 為了限制誤差’此選項將典型導致使用相當小的像素尺寸 (約2 nm)。以小束將寫八每個場域的相同線之意義而言, 此選項相較於架構選項B是特例。換言之,對於小束映射 的列是固定的且對於每個場域為相同。因&,可補償小束 特定修正。S為子像素修正是適當實彳于,小束將以較大的 準確度來寫入圖型。因此,像辛 像京尺寸疋較大的(〜3 5 nm), 其不會造成朝向媳滅器之較高的光學通道計數。 所有修正是被支援,然而,場域是位在理想的位置, 且因此在場域間不具有在X盥 /、Y的偏移。處理流程可能是 不同於架構選項A。對於架構選項 7 L,新的位映射必 須在每個晶圓或數個晶圓而從 攸问里輸入檔案所頻繁產生。 (F)主要掃描。假使新的 映射再生,可能且有 在晶圓測量(E1)的相依性。圖45 /、 棰$ & ^ ”肩不如果發生相依性的處 理流程。當不具有此相依性,處 處理流程將類似於圖35的處 理&程。當有效估計對於再生 理夂鉍彳+ ^ 玍所需要的貢訊(緩慢變化的處 >數)’亦不具有相依性。故, 竹王』提早開始,但是必 61 201123254 須在實際測量之後而確認。如果發生非預期的不匹配,重 斤開始再生且將損失一些產量。最後,考量在於:假使足 夠ram是可用的,可在主要掃描後而儘早開始處理。此對 於處理的時框將再增加25分鐘。支援線内處理的解決辦法 ,要極度強大的處理羊元以滿足合理時序要求。對於最 差的條件(2.00 nm像素,最大縫綴),要轉列的像素數目將 疋每個條帶為35 G個像素。向量數據的尺寸將是每個條帶 為606 Μ位元組。在圖46,顯示對於線内處理的架構。圖 示架構顯示方塊栅格化器(rasterizer),,。此方塊將是負責 將向量格式轉列為條帶B/w影像的線内處理任務。用於實 施線内柵格化器的選項是: •離線,處理及控制。 •使用FPGA邏輯《對於即時柵格化,FpGA邏輯是為 了相同目的所使用。對於即時柵格化,在FpGA的許多資源 必須被使用以滿足性能要求。對於線内栅格化,使用Fp(5A 技術,可用相較於即時形式為較少的資源來實施解決辦法。 •使用GPU技術。圖形處理單元(Gpu,Graphical Processing Unit)是典型為用於視訊處理的處理器。此等處理 盗是在用於轉列3D圖像(遊戲,Vista)的消費者系統(桌上型 與膝上型)所見到。GPU是利用大量平行性。G8〇架構利用 128個分緒(thread)處理器,而技術現狀的卡GTX28〇利用 240個分緒處理器。分緒處理器的性能是概略為核心 CPU的五分之一。GPU的性能是明顯取決於在其任務中的 平行性程度。轉列是相當容易平行化的任務。(在一個方向 62 201123254 的)遞色任務是某個程度(對角線)平行。 •使用技術現狀的多核心CPU。現今的多核心cpu是 極為強大的。實例是Intel的新架構:〜i 7技術。fpga 解決辦法是_為相對便宜的解決辦法。相較於架構選項d (在fpga即時柵格化),對於此解決辦法的性能要求是較為 放寬許多(對於7個條帶為2.5秒、,相較於對於…固條帶為 6分鐘)。因此’ FPGA是較小(且較便宜)許多。儘管如此, 可行性S取決於VHDL的轉列演算法實施的可行性。 當對軟體解決辦法評估,GPU技術將會是最佳,因為 在GPU可用的高度平行性將使得轉列任務獲益。不利之處 在於:GPU技術正快速發展中。藉由提供穩定的計算統合 ^£^^(CUDA> Compute Unified Dev1Ce Architecture) API,此快速發展硬體的問題(至少由NVImA)已經解決。 此API適用大範圍的圖形卡型號與版本。現今,甚至存在 為了高性能計算(Tesla)的產品線。此產品線針對於科學計算 而非為遊戲繒圖。 對於此架構,處理是於以下步驟所描述。 向里格式輸入檔案是從伺服器轉移到硬碟。在開始初 始掃描之前或在參數變化之後,柵格化模組應將輸入檔案 處理以產生新的位元映射。位元映射是儲存在處理單元的 Ram記憶體1掃描時,處理單元從其ram讀出位元映射 ,據。此處理對於架構選項A、B與C是類似的。柵格化器 疋使用FPGA #術所實施。邏輯將類似#同用於即時拇格化 選項。相較於即時解決辦法,線内解決辦法是較為輕型許57 201123254 Example of shooting. In order to compress 'use GZIP and 〇ptipng (both are open source compression tools). The two boxes of ancient m, and methods are no loss. GZIP is a universal compression tool, and 〇Ptipng is dedicated to compressing 2D images. pNG compression is composed of - orders, ie. 2D predictive filters and compressors, so that 0PUPng provides excellent (four) scaling. Depending on the pattern found in the actual design, there may be more repetitions in larger images. Using a compression ratio of 1:40 (PNG) and 2 nm pixels, the compression ratio shrinks the image to a size comparable to the vector format. In this way, png decompression is integrated into the processing unit FpGA. When the bit map size grows by a factor of four, the compressed image is for σ ζ ιρ and for a factor of two. Compression combined with small pixels is quite effective. 有趣3 An interesting observation of this approach to using grayscale pixels is that it potentially allows shifting and constituting larger pixels for streaming to the extinguisher. The value of a larger pixel can be calculated from a smaller pixel, #by a linear combination of values using smaller pixels. The input image can be viewed as oversampling. Figure 43 shows this concept of small grid input pixels and large output pixels. An example is proposed in which the ratio of pixel sizes is 1:2, however, other ratios are also possible. The FpGA maps the uncompressed bits and combines several small pixels to form a large pixel for streaming to the extinguisher. The advantage is that this approach will be limited to the bandwidth of the fiber (large output pixel), even when using small input pixels. The bandwidth of the fiber is considered a bottleneck, and each channel can use two fibers to stream 2 nm pixels to the extinguisher. Remarks about this architecture: • The dose mapping is preferably still attached to the input bit map and used by the FPGA. 58 201123254 • Dose correction is possible because of the dithering in the FPGA. • When X and γ are shifted from the input pixel to form the extinguisher array, the accuracy depends on the actual pixel size. • Decompression and dithering in the FPGA is necessary. • Compression is attached to the offline process. It is expected that compression will significantly increase the processing effort. The RAM size is reduced with a compression ratio of 1:4〇. For this scenario, the FPGA is equipped with instant decompression logic that keeps up with the rate at which the grayscale is extended ("5 Gbit/s). Options B and C: Initiation Figure 60 shows an embodiment using inline rasterization. The gds-II format pattern data is accepted as offline processing for the offline embodiment of Figure 59, including: proximity effect correction, resist heating correction, and (if used) smart boundaries. The corrected vector pattern data and dose map are the tool input data formats used in this embodiment. This off-line processing is performed once for a given pattern design for one or more batches of wafers. Next, the in-line processing of the vector tool input data is performed to rasterize the vector data to produce B/W bit map data, which in this embodiment is a graphics system stream (PSS) format. This process is typically implemented in software and can be performed when new dose settings are set. As in the embodiment of FIG. 59, the pattern streamer then processes the PSS format data to produce extinction format data, including that it relates to beam position calibration, field size adjustment, and/or field as previously mapped in the bit map data. The correction of the full pixel shift in the X and/or γ directions of the domain position adjustment. This process can be performed in accordance with the field. Extinguisher format pattern 59 201123254 Data is then transferred to the lithography system for wafer exposure. Figure 61 shows a second embodiment using in-line rasterization. This is an embodiment similar to that of Figure 60 except that the correction for beam position calibration, field size adjustment, and/or field position adjustment is made on the vector tool input data. Since these corrections are made in vector data, both full pixel shift and sub-pixel shift in the gamma direction can be made. These corrections are typically implemented in software, and can be performed on a wafer basis. After the correction has been made, rasterization is performed to generate pss format data for input to the graphics streamer. Figure 44 shows the inline rasterization functional unit that it is assigned to the processing step. For this architecture, functional units 3 and 4 (rasterized) are performed inline. For this option, the input data for the lithography system will be described in a strip chart in vector format. Rasterization will be performed according to requirements (by wafer, by several wafers, by system (4)). In-line rasterization can be triggered by an overall offset or a change in overall dose. The agent of the field is set by changing the pixel area. The pixel area can be changed by changing both the X and Y pixel sizes. However, only X size = can be changed to some value (as discussed with respect to Figure 1G). For the fine tuning of the overall dose, a change in the size of the gamma size can be used. Assuming a fixed bit rate, the gamma pixel size is determined by changing the deflection frequency, Befeng, and using different pattern scaling factors. Because the rasterization results will be used for all % fields, you cannot consider a particular field. The sub-pixel offset of the field. Each Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ Μ The corrections may include: 201123254 • Field mode shifts in χ and γ (only full pixel shifts). The parameter is that each field is updated once. • The overall pattern shift in X and ( (in sub-pixel resolution). The parameter is that each wafer scan is updated one or more times. • The overall dose change through the graphic target. The parameter is that each wafer scan is updated one or more times. The dose correction and sub-pixel shift of each beamlet are unprocessable. The root cause is the ability to shift in the X direction, its (4) (4) small beam mapping. To limit the error' this option will typically result in the use of a fairly small pixel size (approximately 2 nm). This option is a special case compared to architecture option B in the sense that a small bundle will write the same line for each field. In other words, the columns for the beamlet map are fixed and the same for each field. Due to &, the small beam specific correction can be compensated. S is a sub-pixel correction that is appropriate, and the beamlet will be written to the pattern with greater accuracy. Therefore, like the larger size (~3 5 nm), it does not cause a higher optical channel count towards the quencher. All corrections are supported, however, the field is in the ideal position, and therefore there is no offset between X 盥 /, Y between the fields. The processing flow may be different from architecture option A. For architecture option 7 L, the new bitmap must be generated frequently on each wafer or on several wafers from the input file. (F) Main scan. If the new map is regenerated, there may be a dependency on the wafer measurement (E1). Figure 45 /, 棰$ & ^ ” Should not be dependent on the processing flow. When there is no such dependency, the processing flow will be similar to the processing & process of Figure 35. When the effective estimate is for re-physiological + ^ The required tribute (slowly changing place > number) 'is not dependent. Therefore, the bamboo king starts early, but must be 61 201123254 must be confirmed after the actual measurement. If unexpected unexpected occurs Matching, heavy weight starts to regenerate and some production will be lost. Finally, the consideration is: If enough ram is available, the processing can be started as soon as possible after the main scan. This will increase the processing time frame by another 25 minutes. Support inline processing The solution is to treat the sheep element extremely powerfully to meet reasonable timing requirements. For the worst case condition (2.00 nm pixels, maximum stitching), the number of pixels to be rotated will be 35 G pixels per strip. The size of the vector data will be 606 每个 bytes per strip. In Figure 46, the architecture for inline processing is shown. The illustrated architecture shows a rasterizer, which will be Responsible for translating vector formats into inline processing tasks for stripe B/w images. The options for implementing inline rasterizers are: • Offline, processing, and control • Using FPGA logic “For instant rasterization, FpGA logic is used for the same purpose. For instant rasterization, many resources in FpGA must be used to meet performance requirements. For inline rasterization, use Fp (5A technology, available less than instant form) Resources to implement solutions. • Use GPU technology. The Graphic Processing Unit (Gpu) is a typical processor for video processing. These processing hacks are used to steer 3D images (games, Vista). The consumer system (desktop and laptop) is seen. The GPU is using a lot of parallelism. The G8 architecture utilizes 128 thread processors, while the current state of the art card GTX28 uses 240 threads. The performance of the processor is roughly one-fifth that of the core CPU. The performance of the GPU is obviously dependent on the degree of parallelism in its tasks. The rotation is a task that is fairly easy to parallelize. The dithering task of direction 62 201123254 is a certain degree (diagonal) parallel. • Multi-core CPU using the current state of the art. Today's multi-core cpu is extremely powerful. The example is Intel's new architecture: ~i 7 technology. The fpga solution is _ a relatively inexpensive solution. Compared to the architectural option d (in fpga instant rasterization), the performance requirements for this solution are much more relaxed (2.5 seconds for 7 stripes) This is 6 minutes longer than for a solid strip. So the 'FPGA is much smaller (and cheaper). However, the feasibility S depends on the feasibility of the VHDL's implementation of the algorithm. When evaluating software solutions, GPU technology will be optimal because the high parallelism available on the GPU will benefit the task of migrating. The downside is that GPU technology is rapidly evolving. This rapidly evolving hardware problem (at least by NVImA) has been addressed by providing a stable Compute Unified Dev1Ce Architecture (CUDA) API. This API is suitable for a wide range of graphics card models and versions. Today, there is even a product line for high performance computing (Tesla). This product line is for scientific computing and not for game mapping. For this architecture, the processing is described in the following steps. Entering the file in the format is transferred from the server to the hard drive. The rasterization module should process the input file to generate a new bit map before starting the initial scan or after the parameter changes. When the bit map is stored in the RAM memory 1 of the processing unit, the processing unit reads the bit map from its ram. This process is similar for architectural options A, B, and C. Rasterizer 疋 is implemented using FPGA #术. The logic will be similar to #for the instant thumbtization option. Inline solutions are lighter than instant solutions.

63 201123254 多。因此,將需要較少的邏輯格。對於FPGA解決辦法,存 在對於數據流的二個選項。在圖47中,顯示數據流,其中, FPGA將其輸出直接儲存在PU-RAM。假使柵格化的邏輯是 結合在如同處理單元的同個FPGA,此解決辦法是適當的。 在此例,構件共用相同的記憶體控制器。根據圖45的處理 圖,處理是能夠平行執行。然而,潛在的干擾是將FPGA分 離的理由。另一個可能性是在圖48所顯示,其中,節點CPU 將負貴從FPGA提取結果且將其儲存在PU-RAM。在圖49 中,顯示在主機與GPU之間的通訊。主機將程式(核心)與 數據儲存在GPU的DRAM十且將程式觸發。多處理器從 DRAM提取其需要的數據且將結果寫回到DRAM。在總作 業完成時,主機將從GPU的DRAM提取數據。在主機與 GPU之間的介面是典型為PCIe xl6匯流排且在數據轉移中 涉及DMA。當使用標準GPU硬體,在CPU節點與GPU卡 之間的介面是PCI-Express/16。GPU的内部架構(參閱:圖 51)顯示其完全集中在平行性。此特定GPU含有30個多處 理器及每個多處理器為8個分緒處理器。此相加起來為240 個分緒處理器。多處理器採用單指令多數據(SIMD,Single Instruction Multiple Data)型態且對其8個分緒處理器使用 晶載的(快速)共用記憶體。為了利用GPU架構的性能,其 任務是分成多個平行任務。桃格化任務是由二個子任務所 組成:轉列與遞色。 轉列任務的性質是在於相當容易平行化。將掃描線或 甚至像素轉列可視為獨立的處理。遞色任務的性質是較為 64 201123254 串行’因為量化誤差是以二個方向傳播(在遞色移動方向的 相同線上且到下條線)。然而,當僅在一個方向遞色,遞色 是沿著對角線而平行化。將下條線遞色應延遲一或二格以 正確處理前條線的量化誤差。 使用GPU的缺點包括:Gpu不便宜;在執行時的可觀 的功率消耗(例如:TDP=2()() w);以及,產生對於制衡運用 其功率的GPU的平行碼不是普通的任務。 卜多核心CPU解決辦法:當使用強大的多核心CPU作為 節點CPU,節點CPU將能夠執行柵格化任務。圖52顯示對 此組態的典型數據流。CPU從硬碟讀出向量輸人數據⑺。 CUP將實行柵格化任務且將位元映射儲存# pu_RAM⑷。 當掃描時,處理單元從PU_RAM讀出位元映射(5卜 缺點包括.處理器的費用;可觀的功率消耗(Intel Core2 Ex^eme四核心處理器:TDp=13() w);以及,相當低度的 平行吐(用於lntel c〇re2四核心處理器的4個核心)。 對於線内栅格化,不同的解決辦法是可行的。然而, 線内柵格化顯露一些共同的特徵:pu_RAM尺寸。如同對於 離線拇格化,線内柵格化要求位元映射儲存在PU-RAM。架 構選項B需要小的像素尺寸(例如:2〇〇nm,參閱附錄ai) 吊要儲存大約61 G位元組的(未壓縮)位元映射數 、十於架構選項c,使用較大的像素(例如:3 5〇 nm)。對 於3.5〇 rnn像素,20 G位元組將是適當的。RAM載入時間。 1 ;此解决辦法’假設僅有向量輸入數據是儲存在磁碟上 寸8.5 GB)。每當需要新的位元映射,向量輸入數據是 65 201123254 從磁碟所讀出且經柵格化以及被儲存在pu_ram。在此情 形:磁碟數據速率似乎不是瓶頸。對於此解決辦法的瓶頸 將是柵格化器。其性能是取決於多個因素且無法容易作預 測。替代方式將是在稍早階段來實行柵格化。位元映射可 錯存在PU-RAM或是於磁碟。將t間的位元映射儲存於磁 碟具有缺點在於,將具有對於載入時間的明顯瓶頸(參閱: 架構選項A)。 磁碟載入時間.對於新掃描的向量輸入數據必須從词 服器所下載。祠服器將顯然是對於通訊的瓶頸。對於改良 磁碟載入時間的選項是將從飼服器到節點的頻寬增大或是 將在伺服器上的位元映射數據壓縮。在磁碟儲存單元上儲 存10個版本的位元映射將必需85 GB的儲存容量。改良可 靠度(及讀取性能)提示使用鏡射組態(RAID 1}且使用二個 1 00 GB的磁碟。 假設主要演算法是大程度地平行化,在cup與之 間的概略.性能比較是基於以下特徵而作出:intel cpu核心 是以因數5而勝過分緒(thread)處理器;“化丨cpu含有4個 核心;及,GPU含有20個分緒處理器。 再假設平行性的全利用,性能比率(Intel:Gpu)的要點是 四核心:GPU=(4*5):240=1:12。實際上,數個因素將降低此 ‘‘理想”比率》因素是:執行成本的差異(對於此品牌的 GPU,整數除法的代價相當高);平行性程度;可將平行碼 寫入的程度;在限量的局部記憶體中可執行多少個分緒; 因為單指令多數據(SIMD)處理器的使用。在SIMD群組中 66 201123254 典型具有8個分緒處理器。此意指執行途徑因為總是(串行) 執行分支的二侧而擴展。 另一方面,像是Intel處理器之多核心的解決辦法使用 共用的快取記憶體。視數個因素而定,當多個核心為現用 時,每個核心的性能將降級。在此章節,作出其使用— CPU之柵格化(轉列與遞色)性能的估計。 為了估計性能,已經以C++來實施轉列與遞色模組。 僅使用C + +的00特徵,而不是任何性能關鍵指令,像是: 新、刪除或像是串列或佇列的任何先進數據結構。64*1000 nm格是使用為用於轉列與遞色的單元。藉由比較向量格式 輸入與位元映射輸出而用視覺確認的是:轉列與遞色是如 所預期。Visual C++ 2008編譯器已經使用為致能對於速度 的最佳化。 用於轉列的演算法是掃描線方式。使用現用遶緣表格 以維持其相交至少一條掃描線(像素線)的邊緣組。使用像素 尺寸是3.5nm (架構選項C)。儘管指定最多為64個邊緣, 每格使用52個(81%)為合理的平均值。 為了測量,已經選擇具有現代CPU的機器。cpu是以 2.14 GHz執行的Core 2 Du〇 (64〇〇)且具有執行視窗作業 系統的2GB的RAM。 使用的輸入向量格式是在格中的一組閉合多邊形的規 格》劑量格是被省去’但是處理納入γ相依的劑量因數。 對於轉列在y方向的移位總是G,但是演算法納人對 線相依移位值的作業。 '和田 67 201123254 碼的最佳化是藉由測量瑪改良所進行。正規的剖析器 是因為其有限的時間解析度而不適用。&而,已經使用在 Wni 32 API 中的 “QueryPerf〇rmanceC〇unter” 。此計數器 使用以ns解析度的CPU時間戳記計數器。碼是已經基於63 201123254 More. Therefore, fewer logic cells will be needed. For the FPGA solution, there are two options for the data stream. In Figure 47, a data stream is shown in which the FPGA stores its output directly in the PU-RAM. This solution is appropriate if the rasterized logic is combined with the same FPGA as the processing unit. In this case, the components share the same memory controller. According to the processing diagram of Fig. 45, the processing can be performed in parallel. However, the potential interference is the reason for separating the FPGA. Another possibility is shown in Figure 48, where the node CPU will take the negative results from the FPGA and store it in the PU-RAM. In Figure 49, the communication between the host and the GPU is shown. The host stores the program (core) and data in the GPU's DRAM and triggers the program. The multiprocessor extracts the data it needs from the DRAM and writes the result back to the DRAM. When the total job is completed, the host will extract data from the GPU's DRAM. The interface between the host and the GPU is typically a PCIe xl6 bus and involves DMA in the data transfer. When using standard GPU hardware, the interface between the CPU node and the GPU card is PCI-Express/16. The internal architecture of the GPU (see: Figure 51) shows that it is completely focused on parallelism. This particular GPU contains 30 multiprocessors and 8 multiprocessors per multiprocessor. This adds up to 240 processor pairs. Multiprocessors use the Single Instruction Multiple Data (SIMD) type and use crystal-loaded (fast) shared memory for their eight processor. To take advantage of the performance of the GPU architecture, its task is to split into multiple parallel tasks. The peachization task consists of two subtasks: transition and dithering. The nature of the task of transitioning is that it is fairly easy to parallelize. Scanning lines or even pixel-forwarding can be treated as separate processing. The nature of the dithering task is more than 64 201123254 serial 'because the quantization error propagates in two directions (on the same line in the direction of dithering and to the next line). However, when dithering in only one direction, the dithering is parallelized along the diagonal. The next line should be decremented by one or two squares to correctly handle the quantization error of the previous line. Disadvantages of using GPUs include: Gpu is not cheap; considerable power consumption at execution time (e.g., TDP = 2()() w); and it is not a common task to generate parallel codes for GPUs that use their power. Budu core CPU solution: When using a powerful multi-core CPU as the node CPU, the node CPU will be able to perform rasterization tasks. Figure 52 shows a typical data flow for this configuration. The CPU reads the vector input data (7) from the hard disk. The CUP will perform the rasterization task and store the bit map # pu_RAM(4). When scanning, the processing unit reads the bit map from the PU_RAM (5 disadvantages include: processor cost; considerable power consumption (Intel Core2 Ex^eme quad core processor: TDp=13() w); and, quite Low parallel spit (for 4 cores of the lntel c〇re2 quad core processor.) For inline rasterization, different solutions are possible. However, inline rasterization reveals some common features: pu_RAM size. As for offline flickering, inline rasterization requires bit maps to be stored in PU-RAM. Architecture option B requires a small pixel size (eg 2 〇〇 nm, see appendix ai). The number of (uncompressed) bit maps of the g-bytes, ten is less than the architectural option c, using larger pixels (eg, 3 5 〇 nm). For 3.5 〇rnn pixels, 20 G bytes will be appropriate. RAM load time. 1 ; This solution 'assumes that only vector input data is stored on the disk 8.5 GB. Whenever a new bit map is required, the vector input data is 65 201123254 read from the disk and rasterized and stored in pu_ram. In this case: the disk data rate does not seem to be a bottleneck. The bottleneck for this solution will be the rasterizer. Its performance depends on many factors and cannot be easily predicted. An alternative would be to implement rasterization at an earlier stage. The bit map can be misplaced in PU-RAM or on a disk. Storing the bit map between t on a disk has the disadvantage of having a significant bottleneck for load time (see: Architecture Option A). Disk load time. The vector input data for the new scan must be downloaded from the vocabulary. The server will obviously be the bottleneck for communication. The option for improved disk load time is to increase the bandwidth from the feeder to the node or to compress the bit map data on the server. Storing 10 versions of the bit map on the disk storage unit will require 85 GB of storage capacity. Improved reliability (and read performance) prompted the use of a mirrored configuration (RAID 1} and the use of two 100 GB disks. Suppose the main algorithm is largely parallelized, between the cup and the outline. Performance The comparison is based on the following characteristics: the intel cpu core outperforms the thread processor by a factor of 5; "the cpu cpu contains 4 cores; and the GPU contains 20 distinct processors. Again assume parallelism. The full use, performance ratio (Intel: Gpu) is the four core: GPU = (4 * 5): 240 = 1: 12. In fact, several factors will reduce this ''ideal' ratio" factor: execution cost The difference (for this brand of GPU, the cost of integer division is quite high); the degree of parallelism; the degree to which parallel codes can be written; how many threads can be executed in a limited amount of local memory; SIMD) processor usage. In the SIMD group 66 201123254 typically has 8 processor. This means that the execution path is extended because it always (serial) executes the two sides of the branch. On the other hand, like Intel Multi-core solution for processors Shared cache memory. Depending on several factors, when multiple cores are active, the performance of each core will be degraded. In this section, make use of it - CPU rasterization (transfer and dither) Estimation of performance. In order to estimate performance, the transfer and dither module has been implemented in C++. Only use the C++ 00 feature instead of any performance critical instructions like: new, delete or like serial or 伫Any advanced data structure of the column. The 64*1000 nm grid is used as a unit for indexing and dithering. By comparing the vector format input with the bit map output, it is visually confirmed that the transition and dithering are as As expected, the Visual C++ 2008 compiler has been used to enable speed optimization. The algorithm used for the conversion is the scan line method. Use the current edge table to maintain its intersection with at least one scan line (pixel line). Edge group. The pixel size is 3.5nm (Architecture Option C). Although up to 64 edges are specified, 52 (81%) per cell is a reasonable average. For measurement, a machine with a modern CPU has been selected. cpu At 2.14 GHz The Core 2 Du〇 (64〇〇) line has 2GB of RAM for executing the Windows operating system. The input vector format used is the specification of a set of closed polygons in the grid. The dose grid is omitted 'but the processing is included in γ Dependent dose factor. For the shift in the y direction is always G, but the algorithm is based on the operation of the line dependent shift value. 'Hetian 67 201123254 code optimization is carried out by measuring the horse improvement. The regular parser is not applicable because of its limited temporal resolution. & However, it has been used in the Wni 32 API "QueryPerf〇rmanceC〇unter". This counter uses a CPU timestamp counter in ns resolution. The code is already based

QueryPerformanceCounter的結果而由人工所最佳化。在最 佳化之後,負載是以下列的分數部分而分佈在應用程式: 轉列55%、遞色27%與輸入處理18%。 所述機器的單核心可在8.7秒内執行職_個格轉列 循環。此轉化為每秒執行U,494個循環n用雙核心 的執行是幾m線性方式所定標(87單核心、⑽咖個格 ->8.8雙核心200,_個格)。全條帶是由22〇〇,_個格所 構成。 因此,一個核心將在i個條帶耗費194秒。假設線性 定標,此意指的是:當使用7.5個核心,14個條帶是在6 分鐘内轉歹丨J。Core2 Duo (6400)不再是Intel cpu的最頂級 型號。因此,將應當以某個因數(例如:3〇%)提高核心性能。 另-方面’知道的是:使用多個核心絕不可能以線性方式 定標。假設此二個因素將彼此抵消。 性能結果是下列者的總計:使用的演算法;標度(尺寸 轉列格);演算法的完整性;使用的特定最佳化;在最佳化 所耗用的總時間;相較於原型而在實際组態所使用的快取/ 。己憶體,及,將在最終組態所使用的cpu的相對性能。 如對於選項A所論述,將其被保持在pu_RAM的影像 壓縮是可能的。柵格化器應將其經遞色或灰階影像壓縮, 68 201123254 而處理單元FPGA應將其解壓縮且選用式遞色。架構B將 由壓縮與過取樣技術而實際獲益。不再有必要為每個通道 使用2個光纖。架構C已經使用相當大的像素尺寸且將僅 由壓縮而獲益。此意指較小的PU_RAM與較少的載入時間。 然而,應該將解壓縮邏輯附加到處理單元FPGA。然而,解 壓縮將具有在線内處理工作量的顯著影響。 選項D :即時爐格仆. 圖62顯示其使用即時栅格化的實施例。此是類似於圖 61的實施例,除了柵格化是在典型為硬體實行的即時處理 期間而更進-步在過程中實行。對於束位置校準、場域尺 寸凋整及/或場域位置調整的修正是在向量格式pss格式數 據上作出,且接著柵格化將此轉換為B/w位元映射。因為 是在向量數據上作出修正,纟乂與γ方向的全像素移位與 子像素移位均可被作成。 …圖53顯精於此架構的功能方塊。對於此選項,功能 單7L 3與4 (柵格化)是在執行期間為即時執行。 修正包括: θ •在χ與γ的像素移位(全像素與子像素)修正。參數 疋每個場域更新一次。 •每個子通道的劑量修正。參數是每個場域更新一次。 每個通道對於¥的定標修正。參數是每個場域更新 次〇 •媳滅器時序偏移修正。灸齡θ >加 〜。 I正參數疋母個晶圓掃描更新一 69 201123254 型串流器將使用此數據作為輸入。飞圖 圖型串流器蓋生Β/W位元映射。在t即時轉列與遞色, 姐疋映射。在轉列與遞色期間, 所有種類的修正。從BAA/位元映射,圖型串 位元框,將對於通道的所有其小束的數據多工傳輸且= 光纖將數據傳送到熄滅器晶片。 ° 兩要用於將數據串流到雷射的資源。 ::由二個步驟所構成:從記憶體得到數據且以邏輯 :八轉列到像素,歸因於子束排序而 重㈣序到框。第—個步驟可由向量數據的實際轉= 疋從憶_取轉列的像素數據所構成。 為了將向量數據轉列到像素’各個條帶是分為以向量 001的子條帶。對於(最大)500 nm的軟邊緣, 王子條帶數目是2000 + 5〇〇+5〇〇/62.5=42.8個子條 帶。各個子体恶曰士 ^ /、 — Hz操作,条帶^道中轉列。各個管道將以約 8個管道將因此產生近乎需要的5The result of QueryPerformanceCounter is optimized by humans. After optimization, the load is distributed across the application in the following fractional parts: 55% for reversal, 27% for dithering, and 18% for input processing. The single core of the machine can perform a job_frame transition cycle in 8.7 seconds. This translates to U, 494 cycles per second. The execution with dual cores is scaled by a few m linear methods (87 single core, (10) coffee grid - > 8.8 dual core 200, _ grid). The whole strip is composed of 22 〇〇, _ 格. Therefore, one core will take 194 seconds in i strips. Assuming linear scaling, this means that when 7.5 cores are used, 14 strips are converted to J within 6 minutes. The Core2 Duo (6400) is no longer the top model of the Intel cpu. Therefore, core performance should be improved by a factor (for example: 3〇%). Another aspect] knows that using multiple cores is never possible to scale in a linear manner. Assume that these two factors will cancel each other out. The performance results are the sum of: the algorithm used; the scale (size to column); the integrity of the algorithm; the specific optimization used; the total time spent optimizing; compared to the prototype And in the actual configuration used by the cache /. The memory, and the relative performance of the CPU that will be used in the final configuration. As discussed for option A, it is possible to compress the image that is held in pu_RAM. The rasterizer should compress its dithered or grayscale image, 68 201123254 and the processing unit FPGA should decompress it and select the dither. Architecture B will actually benefit from compression and oversampling techniques. It is no longer necessary to use 2 fibers for each channel. Architecture C has used a fairly large pixel size and will benefit only from compression. This means a smaller PU_RAM with less load time. However, the decompression logic should be attached to the processing unit FPGA. However, decompression will have a significant impact on the in-line processing workload. Option D: Instant grid servant. Figure 62 shows an embodiment of its use of instant rasterization. This is an embodiment similar to that of Figure 61, except that rasterization is performed further in the process during immediate processing, typically performed by hardware. Corrections for beam position calibration, field size grading, and/or field position adjustment are made in vector format pss format data, and then rasterized to convert this to a B/w bit map. Since the correction is made on the vector data, the 像素 and γ-direction full-pixel shift and sub-pixel shift can be made. ... Figure 53 shows the functional blocks of this architecture. For this option, function sheets 7L 3 and 4 (rasterized) are executed for immediate execution during execution. The corrections include: θ • Pixel shifting (full pixel and subpixel) corrections in χ and γ. Parameters 更新 Each field is updated once. • Dose correction for each subchannel. The parameters are updated once per field. Each channel is corrected for the calibration of ¥. The parameter is the update of each field. • The annihilator timing offset correction. Moxibustion age θ > plus ~. I positive parameter 疋 mother wafer scan update one 69 201123254 type streamer will use this data as input. Fly chart The graphic streamer covers the Β/W bit map. In t instant transfer and dithering, sister mapping. All kinds of corrections during the transition and dithering. From the BAA/bit mapping, the pattern string box will multiplex the data for all of its small bundles of channels and = the fiber will transfer the data to the extinguisher wafer. ° Two resources to be used to stream data to the laser. :: consists of two steps: getting the data from the memory and logically: eight-column to the pixel, due to the sub-beam ordering (4) to the box. The first step can be composed of the actual data of the vector data = 疋 from the memory data of the recalled column. In order to transfer vector data to pixels 'each strip is divided into sub strips with vector 001. For a soft edge of (maximum) 500 nm, the number of prince strips is 2000 + 5 〇〇 + 5 〇〇 / 62.5 = 42.8 sub-strips. Each child is a bad sorcerer ^ /, - Hz operation, strips ^ road transfer column. Each pipe will have about 8 pipes and will therefore produce nearly 5

Cjbit/s 〇 在管道的7§ @ .. 、 只麵,使用FIFO以越過從記憶體時脈域到處 理時脈域的時脈敁 。。 叮脈埠邊界。此FIFO亦作為中間的儲存緩衝 器,由於記传& ^ '員寬必須在多個條帶共用。FIFO含有藉角 數據與劑量映 、町數據二者。轉列應用可隨機定址在FIFO底 部内。FIFO必笮人士 — 4含有至少三個區塊的數據以允許對於記慎 體仲裁器的一此穸從 * 〜 ‘ %弛。各個區塊的數據含有272個位元組。 3個區塊的數擔 a像〜816個位元組。標準的區塊隨機存取記憶 70 201123254 體3有1 8 K位元的數據=2 κ位元組的數據。此意指的是: 從數據尺寸的觀點,各個區塊隨機存取記憶體(bI〇ckram)可 適用於3個子條帶管道 '然而’從數據可用性的觀點,各 個管道應在頂端使用纟本㈣區塊隨機存取記憶體。 各個子條帶管道需要一些内部FF與Ιυτ以供處理。假 又可用的FF與LUT的數目及blOCKRAMS的需要數目 是超過必要的數目。 重新排列像素以供多束曝光。 在子條帶管道的底端,或若為在記憶體的位元映射數 據而直接在記憶料之下,數據是儲存在另—個fif〇。此 FIFO必須3有至少為245線的數據,其是用〖Μ以個 小束將像素寫入所需要。各線將(最多)含有删nm/2 ⑽:15000個像素。15000個像素*245線=367,5〇〇位元。此 等於20個區塊隨機存取記憶體,其化整為32個區塊隨機 存取記憶體以利於處理。 定框器/多工器從此等32個區塊隨機存取記憶體讀取 且形成適用於傳送到雷射的框。此等框是儲存在另一個 FIFO區塊隨機存取記憶體,其為必要以作為在黯時脈域 間的非同步邊界並且作為彈性的儲存單元。 基於格的輸入格式 向量U法是典型為用於產生圖型數據,諸如m 或〇ASIS格式。如上所述,不同的操作模式可能用於帶電 拉子微影系統。上述的—個模式是即時栅格化模式,豆中 以基於向量的輸人格式之圖型數據是被㈣且由處理單元Cjbit/s 〇 In the pipeline's 7§ @.., only the FIFO is used to cross the clock from the memory clock domain to the processing clock domain. . The boundary of the pulse. This FIFO also acts as an intermediate storage buffer, since the record & ^ 'member width must be shared across multiple stripes. The FIFO contains both the borrowing angle data and the dose mapping and town data. The transition application can be randomly addressed within the FIFO base. The FIFO must have data containing at least three blocks to allow for a 穸 对于 ‘ 对于 对于 。 。 。 。. The data for each block contains 272 bytes. The number of three blocks is like ~816 bytes. Standard block random access memory 70 201123254 Volume 3 has 1 8 K bits of data = 2 κ bytes of data. This means: From the data size point of view, each block of random access memory (bI〇ckram) can be applied to three sub-strip pipes 'however' from the point of view of data availability, each pipe should be used at the top (4) Block random access memory. Each sub-strip pipe requires some internal FF and Ιυτ for processing. The number of available FF and LUTs and the number of blOCKRAMS required are more than necessary. Rearrange the pixels for multiple exposures. At the bottom end of the sub-strip pipe, or if the data is mapped directly in the memory of the bit in the memory, the data is stored in another fif. This FIFO must have at least 245 lines of data, which is required to write pixels in small bundles. Each line will (maximally) contain nm/2 (10): 15000 pixels. 15,000 pixels * 245 lines = 367, 5 〇〇 bits. This is equal to 20 block random access memories, which are rounded into 32 block random access memories for processing. The framing/multiplexer reads from these 32 block random access memories and forms a frame suitable for transmission to the laser. These blocks are stored in another FIFO block random access memory, which is necessary as a non-synchronous boundary between the clock regions and as a flexible storage unit. Lattice-based input format The vector U method is typically used to generate graphical data, such as the m or 〇ASIS format. As mentioned above, different modes of operation may be used for the charged pull lithography system. The above-mentioned mode is an instant rasterization mode, in which the pattern data of the vector-based input format is (4) and is processed by the processing unit.

71 201123254 (諸如:FPGA)所即時處理(即:對於晶圓的—組場域的圖型 數據疋當该組場域的掃描發生時而至少部分被處理) 基於格的輸入格式可被用於此即時栅格化模式。輸入 格式的一個實施例描述了二個方面:特徵佈局與劑量率。 特徵佈局是使用基於格的方式所描述,對於即時fpga轉列 與遞色為適合且為最佳化。劑量率是由涵蓋所有特徵區域 (例如:場域)的固定尺寸網格所描述。 用於圖型數據之基於格的格式可產生其具有較可預測 尺寸的數據組,有利於將圖型數據串流到微影系統以供即 時及/或硬體處理。以向量格式的圖型數據提供每格為較不 可預測的尺寸。可使用以位元映射格式的圖型數據,但將 須作壓縮以供從預處理系統轉移到微影系統。位元映射數 ,的壓縮量可能每格為可觀地變化,視存在格中的特徵而 定。將此類壓縮數據串流到微影機器且然後將數據解壓縮 造成未壓縮數據之不可預測的傳輸率。 預先知道每格最多含有多少數據(位元)及若圖型數 被壓縮所達成的何等壓縮因數(例如:當相較於若以位元映 射格式所編碼的總尺寸)是有利的。基於格的格式是設計具71 201123254 (such as: FPGA) is processed on the fly (ie: the pattern data for the wafer-group field) is at least partially processed when the scan of the group field occurs. The grid-based input format can be used This instant rasterization mode. One embodiment of the input format describes two aspects: feature layout and dose rate. The feature layout is described in a grid-based manner and is suitable and optimized for instant fpga transitions and dithering. The dose rate is described by a fixed size grid covering all feature areas (eg, field). A grid-based format for pattern data can produce a data set having a more predictable size, facilitating the streaming of pattern data to the lithography system for immediate and/or hardware processing. Pattern data in vector format provides a relatively unpredictable size per cell. Pattern data in a bitmap mapping format can be used, but will be compressed for transfer from the preprocessing system to the lithography system. The amount of compression of the number of bit maps may vary appreciably per cell, depending on the features in the cell. Streaming such compressed data to a lithography machine and then decompressing the data results in an unpredictable transmission rate of uncompressed data. It is advantageous to know in advance how much data (bits) are contained in each cell and what compression factor is achieved if the pattern number is compressed (e.g., when compared to the total size encoded in the bit map format). The grid-based format is a design tool

Sit:。此為期望’因為提供保證基於格的圖型數據 〜D某個尺寸的記憶體(在設計時所選取的記憶體大 小)’其為實質小於未壓縮位元映射數據的大小。對於 通:壓縮演算法(諸如:ZIP)所壓縮的位元映射無法提供此 U °亦為期望’因為提供保證基於格的圖型數據可 個最大時間奮内Μ , 系 里内轉換為位元映射,此在如果發生即時柵格 72 201123254 化時是重要的。 甚者’若涵蓋位元映射場域的某個 從以基於格的格式編碼的‘1 缩權案=特定格必須 此格是編碼在槽案中的何處(無須如同將即知道 GDSII格式(其中特 检案疋以例如 此區域)。特徵疋隨機存在於槽案中)的情形而尋找 基於格的格式是因為其按照格所排列而亦 、到微影系統,且將圖型數據以待掃描的格順、用於 相較於向量格式而相當直接簡單。^的格順序排列是 =格的袼式亦由於僅將在各格中的特 、·爲碼而存在附加量的“㈣”。此 < 置 提供在場域中的特徵的絕對位置。㈣特徵^ = ^ 受限於格的尺寸)且因此相較於關於場域^ 置而祐要定義較少個位元。 :於描述特徵佈局之基於格的輸入格式之此的Sit:. This is desirable because it provides a guaranteed size-based pattern data ~D memory of a certain size (the size of the memory selected at the time of design)' which is substantially smaller than the size of the uncompressed bit map data. For the pass: compression bitmap algorithm (such as: ZIP) compressed bit map can not provide this U ° is also expected 'because the guarantee of grid-based graph data can provide maximum time, the system converts to bit Mapping, which is important if instant grid 72 201123254 occurs. Even if 'something from the bit map field is encoded from the cell-based format, the '1 rights-restriction=specific cell must be coded in the slot case (no need to know the GDSII format as if The special inspection case, for example, in this area), the feature 疋 randomly exists in the slot case, finds the grid-based format because it is arranged according to the grid, to the lithography system, and the pattern data is waiting The smoothness of the scan is quite straightforward compared to the vector format. The order of the lattices of ^ is the formula of the = lattice, and there is an additional amount of "(four)" because only the special code in each cell is a code. This < sets the absolute position of the feature provided in the field. (4) The feature ^ = ^ is limited to the size of the cell) and therefore defines fewer bits than the field. : in the grid-based input format describing the feature layout

0.2%步進尺寸 LSI. 73 201123254 對於特徵佈局格式,最小特徵間 特徵間距是本質上卩P & & 里资夢数斌小 間距的距離内,二:徵密度。意指的是:在最小特徵 僅可發生二次。㈣變(例如:抓>咖或〇ff->ON) 在圖67中,gg - , ..肩不一個實例的圖型佈局,其具有 小特徵間距〇>)的特徵(較淺色㈣域^ Μ㈣從最 特徵描述的重要結果是在於,64χ6 應最多描述4個稜6木, 调褥夕i格 格索引提供其基準位置…行傲轉列 位置來描述。 在轉列格之内的特徵可使用相對 在_格之内的(部分w徵可由其棱角或是由直線所 *述。可將線角度限制為45度的倍數,將向量 僅有Μ固可能方向,如在圖69所說明。八個方位碼是對: 如圖69所示的各個可能方位所作分配。 圖68說明稜角(c〇rner)概念。格(ceU)是顯示為含有(在 右側的)特徵的稜角與(在左側的)特徵邊緣的直線。稜角與 直線均視為“稜角”。棱角A是由a的位置(例如:^、 γΑ)與(例如:使用方位碼Edge卜2、Edge2=4所定義的)二八個 向量所定義。根據定義,以順時針方向從Edgel到Ehd 的移動方向中的區域是現用區域。以相同方式,直線是由 ‘虛擬稜角”點B (例如:χΒ ' γΒ)與二個邊緣(例如: Edgel=4、Edge2=0)所描述。此虛擬稜角的位置是在其定義 的線上的任意點。再者,從Edgel到Edge2的順時針移動 方向中的區域是現用區域。 201123254 在格之内,相同特徵的稜角應為匹 方形特徵,1編踽盔^。 ,貝丁間早 夫 …4碼為在64nmx64nm的格之中的4個匹配 衩角曰在圖70的左側的表格顯示其完全描述特徵的參數。 稜角是由其稜角座標(X,γ)所描述,且邊緣描述根據在圖69 所疋義的方向之稜角方位。從稜角座標與方位碼,可確定 在圖70的所有稜角是描述單一個特徵。 …對於在FPGA (或其他型式的硬體處理器)的處理,具有 固疋尺寸的數據結構是有利的。此使得較容易將格描述在 記憶體中定址且有助於使FPGA邏輯保持較簡單。 圖71顯示其由格中的稜角所描述之較複雜特徵形狀的 實例。亦使用沿著45度與_45度方位的直線以定義所顯示 的特徵❶ 位的特微邊綾 最小特徵間距確保在格中的稜角的最大數目。當考慮 具有在45度方位邊緣的特徵,格的最大尺度是其對角線: 其長度是等於格尺寸乘以對於方形格$ 2的+方根(例如: 對於64 nm*形格為64 χ ^)。當最小特徵間距是小於此 $角線長度,風險在於每個格可能存在超過4個稜角。此 情況是顯示在圖72中。在左側,圖例顯示方形特徵的規則 網格,方形特徵具有64 nm的間距、定位在64nm的格中、 且每格為具有4個棱角(棱角是由小圓圈所指出)。在右側, =形特徵的網格是被旋轉45度。強調的稜角顯示六個棱角 是存在中間的格上。 為了解決此問題,可應用數個解決辦法:0.2% step size LSI. 73 201123254 For the feature layout format, the minimum feature-to-feature spacing is essentially within the distance of the distance between the P && It means that only a minimum of two can occur in the minimum feature. (4) Change (for example: grab > coffee or 〇 ff-> ON) In Fig. 67, gg - , .. shoulders are not an example of a pattern layout, which has a small feature spacing 〇 >) (lighter) The color (four) domain ^ Μ (four) from the most characteristic description of the important result is that 64 χ 6 should describe up to 4 ribs 6 wood, 褥 褥 i i grid index provides its reference position ... row proud position to describe. Within the transition grid The features can be used relative to the _ cell (the partial w sign can be described by its angular or by a straight line. The line angle can be limited to a multiple of 45 degrees, and the vector can only be tamped in the possible direction, as in Figure 69. The eight orientation codes are pairs: the assignment of each possible orientation as shown in Figure 69. Figure 68 illustrates the concept of the angle (c〇rner). The grid (ceU) is shown as containing the edges of the feature (on the right) A straight line at the edge of the feature (on the left). Both the corner and the line are treated as "edges." The corner A is defined by the position of a (for example: ^, γΑ) and (for example, using the orientation code Edge 2 and Edge 2 = 4). Defined by two or eight vectors. By definition, in the direction of movement from Edgel to Ehd in a clockwise direction The area is the active area. In the same way, the line is described by the 'virtual edge' point B (for example: χΒ ' γΒ) and the two edges (for example: Edgel=4, Edge2=0). The position of this virtual corner is Any point on the line defined by it. Furthermore, the area in the clockwise direction of movement from Edgel to Edge2 is the active area. 201123254 Within the grid, the corners of the same feature should be square, 1 踽 helmet ^. , Bedin's morning husband... 4 yards for the four matching corners in the 64nm x 64nm grid. The table on the left side of Figure 70 shows the parameters that fully describe the features. The corners are defined by their angular coordinates (X, γ). Description, and the edge description is based on the angular orientation of the direction defined in Figure 69. From the angular coordinates and the azimuth code, it can be determined that all the corners in Figure 70 are described as a single feature. ... for FPGAs (or other types of hardware) Processing of the processor, it is advantageous to have a fixed-size data structure. This makes it easier to address the cell description in memory and to help keep the FPGA logic simpler. Figure 71 shows the corners in the cell. Description of the complex An example of a miscellaneous feature shape. A straight line along the 45-degree and _45-degree azimuth is also used to define the characteristic feature of the displayed feature. The minimum feature spacing ensures the maximum number of corners in the cell. The characteristic of the azimuthal edge, the largest dimension of the lattice is its diagonal: its length is equal to the grid size multiplied by the square root of the square lattice $ 2 (for example: 64 * ^ for 64 nm * lattice). The feature spacing is less than this $angle length, the risk is that there may be more than 4 corners per cell. This is shown in Figure 72. On the left, the legend shows a regular grid of square features with a square feature with a 64 nm pitch Positioned in a 64nm grid, and each grid has 4 corners (the corners are indicated by small circles). On the right side, the grid of = shaped features is rotated 45 degrees. The emphasized corners show that the six corners are in the middle of the grid. To solve this problem, several solutions can be applied:

L S 75 201123254 •對於仏45度線’指定較大的最小特徵間距,至少是 等於格對角線的長度(例如:對於64 nm方形格為Μ X V^nm)。 •減小格尺寸,使得格對角線是等於(或小於)最小特徵 間距(例如:對於64nm的最小特徵間距為^⑽⑽)。 •允許每個格的較大數目個(例如:六個)稜角。 •允許每個格的可變數目個稜角。 在以下說明中’假設為上述的第一個選項。 鄰近效鹿铬 需要鄰近效應修正以改良在將晶圓處理後的圖型(尤其 是稜角)。鄰近效應修正可藉由將幾何形狀或劑量局部微調 而對付。假設鄰近效應修正是藉由幾何形狀變化所進行, 使用環繞稜角的小襯線(serif),其典型為具有㈣的長度。 在圖73中,顯示具有襯線為附加到其一些稜角的二個 特徵的實例。每個稜角是較佳為具有選項以將襯線納入在 特定稜角。如在圖73所示,此類技術的一個重要結果是在 於:在-格的稜角所定義的襯線(例如:圖中的格2的特徵 B襯線)可部分在相鄰格中轉列(例如:特徵B襯線為延伸到 格3)。或者,具有其所有稜角在一格中的特徵(例如:在格 1的特徵A)需要在相鄰格中轉列其襯線的部分者(例如:在 格2的特徵A襯線)。 為了對付此’不同方式是可能的: •與相鄰隔共用關於襯線棱角的資訊。 •一旦外部襯線稜爲具有在將格轉列上的影響,就將 76 201123254 額外資訊封裝(複製)在格定義中。 •將襯線描述為正常稜角。此解決辦法明顯增加每格 (極度可變)的稜角數目。 IL量網格(grid、 除了特徵幾何形狀之外,劑量率是重要的系統參數, 在微尺度上為關係重大。劑量資訊可藉由提供劑量網格所 描述,劑量網格含有每格的一個劑量率(劑量資訊可用其他 方式所提供,例如··藉由對於各個特徵關聯一劑量值卜格 尺寸疋典型為等於或小於期望的臨界尺寸(CD)。理論上, 劑量網格是無關於轉列格網格。 用於處理該二種網格的二個選項為: •將二種網格定義而無關於彼此。 •將二種網格對準且選用式整合。 對於FPGA處理,將劑量網格與轉列格網格結合是可能 有利的。劑量網格尺寸是典型為小於轉列網格的尺寸。^ 可例如藉由將(3x3) 9個劑量格嵌入在轉列格之内而達成。 灰階值可為以0.2%步進而在1〇〇%與5〇%之間變化。因此, 每個劑量格是需要8位元。 然而,結果是連結二個獨立的概念。每當間距值改變, 亦具有對於劑量格尺寸的必然結果。 像素網枚 像素格尺寸與位置是較佳為彈性。像素可為非方形, 但在一個條帶/通道内將總是具有相同的尺度。像素可為由 (最差情形)4個轉列格所轉列。因為子像素的移位,每列可 77 201123254 使用不同(Y方向)對準。 _入格式規格 以下的規格是對於一個實施例所提供。轉列格包含其 含有尚達4個稜角之64乘64 nm的區塊與額外資訊。邊緣 是在稜角中所起始的向量,Edge丨或Edge2,且從Edge 1到 Edge2的順時針角度定義現用侧。稜角是在格之中的特徵的 稜角。當一線為橫越過格而無實際稜角,稜角可具有18〇 度的角度。假設每個轉列格最多為4個稜角❶ 例的稜角數據的規格是楛供在下+ : 名稱 位元數目 基本原理 X位置 8 64 nm®〇.5 nm Y位置 8 64 nm®0.5 nm Edgel方向 3 8個方向 Edge2方向 3 7個方向可能,等於Edge 1是 特例:項目未使用 襯線尺寸 5 0意指截止(〇ff) 總計 27 為了從其場域值來計算襯線尺寸,可使用不同策略, 例如:查表’丨中,場域值是使用作為在預先定義表格中 的索引,或是藉由計算(例如··襯線尺寸=值*0.5 nm,因此 ,、範圍為〇 . 15 5@G 5nm,假設為正襯線尺寸)。 對於一個實施例的轉列格數據的規格是提供在下表: 78 201123254 名稱 每個單元的位元 單元數目 總位元 稜角 27 4 108 劑量映射3x3 8 9 72 總計 180 據 下表是總結當使用上述格式時的數據量。對於此數 假設是不具有縫綴。 -- - --•m ,〜 f ^ 名稱 公式 結果 每個格的位元組數目 180位元/8 23位元組 每個條帶的格數目 3 3 mm/64nm*2jLtm 16E6 格 /64nm ^的格數目 13000*16E6 209E9 格 芝個場域的位元組數目 209E9*23 5T位元組 個條帶的位元組數目 16E6*23 3 70Μ位元組 可能存在數據壓縮的機會。舉例來說,預期的是:多 個格是含有少於4個稜角,且對於所有劑量格,劑量率= 月t*為相同值0 疋義固定尺寸的數據結構將減輕FpGA設計(定址以及 載入)的任務,但是具有對於記憶體的後果。對於通訊以 (磁碟)儲存,可使用標準的墨縮技術來將數據_。此^ 使用記錄為填充相同值(例⑹:對於未使用稜角是均為' 而报有效。壓縮亦對於重覆值為有效,如同對於劑旦:零) 的類似值。 里映射 對於上述實施例的一些設計問題是: 79 201123254 ♦每格最多為4個棱角可能是不夠的; *在相鄰格中尋找襯線是在處理時間與記憶體為昂貴 且若是可能而應避免; *襯線可能為不同於預期者的形狀; •每格為固定個稜角是合意於硬體實施; •每格為大的固定數目個稜角造成龐大的數據量; •每格為小的固定數目個棱角造成缺乏靈活性; •由資訊的理論觀點,將所有稜角編碼是過度資訊, 但是顯著為有利於在硬體中的實施; •稜角的解析度是較佳為0.25 nm而非0.5 nm; •僅將半數的稜角編碼可能是足夠的。 將較大區塊共同编石i 作為在大與小的固定數目個稜角之間的折衷方案,一 個可能性是限制對於較大數據區塊的最大棱角數目,例 如:在機械掃描方向為約較大16倍。假設的是,在此較大 區塊的一個區域中的局部最大棱角數目將由在此區塊的另 一個區域中的較少稜角數目所補償。 在最大棱角數目之高於4的上限是不合意的,歸因於 a己憶體使用的増加》然而,使用下限將不會涵蓋所有可能 情形。作為中間的解決辦法,考慮以下的方案:以較目前 格為大的區塊將數據編碼(例如:一次為i 6格的區塊),且 將稜角數目限制在該區塊内’其中’局部最大棱角數目可 為更高。在此方案中,襯線是如同棱角其本身所編碼,此 有助於實施。 80 201123254 為了實施此實施例,可對上述實施例作出以下的改變: •定義一個區塊,其在γ方向(偏轉方向)為62 5 nm且 在X方向(機械掃描方向)為nm; •格/區塊的Y尺寸是從64減小到62 5 ηηι。此具有二 個優點.16*62.5 = 1000 nm 且 62.5/0.25=250,其可為以 8 位元而有效率編碼; •费度圖可具有解析度為3 1 25 X 31 25 nm (1〇〇〇nm的 1/32); •稜角的最大數目是設定為每個區塊64個(平均每個 62,5 X 62.5 nm的格為4個稜角); •襯線是在數據内編碼,如同稜角其本身。 g下規格是對於此實施例所提供: 名稱 — 丨 1 值 轉列區塊 62.5乘1〇〇〇 nm的區塊,其含有64個棱角與 劑量資訊。 邊緣 在稜角起始的向量。Edgel或Edge2。從Edgei 到Edge2的順時針角度定義現用側。 棱角 在格中的特徵的稜角。假使一線為橫越過格而無 實際稜角,亦可能為具有角度180度的稜角。 假設每個轉列格最多為4個稜角。 對於此實施例的稜角數據的規格是提供在下表:L S 75 201123254 • Specify a larger minimum feature spacing for the 仏45 degree line, at least equal to the length of the diagonal of the grid (for example: Μ X V^nm for a 64 nm square). • Reduce the grid size so that the grid diagonal is equal to (or less than) the minimum feature spacing (for example: the minimum feature spacing for 64 nm is ^(10)(10)). • Allow a larger number (eg, six) of corners per cell. • Allow a variable number of corners for each cell. In the following description, 'the first option is assumed above. Adjacent effect deer chrome requires proximity effect correction to improve the pattern (especially the corners) after processing the wafer. Proximity effect correction can be counteracted by locally fine-tuning the geometry or dose. It is assumed that the proximity effect correction is performed by geometrical variations, using a serif that surrounds the corners, which typically has a length of (iv). In Fig. 73, an example in which the serif is attached to some of its corners is shown. Each corner is preferably provided with an option to incorporate the serif at a particular corner. As shown in Fig. 73, an important result of such a technique is that the serif defined in the corner of the lattice (for example, the characteristic B serif of the lattice 2 in the figure) can be partially rotated in the adjacent lattice. (For example: Feature B serif extends to grid 3). Alternatively, a feature having all of its edges in a cell (e.g., feature A in cell 1) requires a portion of its serif to be indexed in the adjacent cell (e.g., feature A serif in cell 2). In order to deal with this, different ways are possible: • Share information about the edges of the serifs with adjacent partitions. • Once the external serif edge has the effect of shifting the grid, the additional information is encapsulated (copied) in the grid definition. • Describe the serif as a normal corner. This solution significantly increases the number of edges (extremely variable) per division. The IL quantity grid (grid, in addition to the characteristic geometry, the dose rate is an important system parameter, which is significant on the micro scale. The dose information can be described by providing a dose grid, the dose grid containing one per grid Dose rate (dose information may be provided in other ways, for example by correlating a dose value for each feature with a dose size 疋 typically equal to or less than a desired critical dimension (CD). In theory, the dose grid is irrelevant The grid grid. The two options for working with the two grids are: • Define the two grids without regard to each other. • Align the two grids and select the integration. For FPGA processing, the dose is It may be advantageous to combine a grid with a reticle grid. The dose grid size is typically smaller than the size of the reticle grid. ^ can be done, for example, by embedding (3x3) 9 dose grids within the transition grid. The grayscale value can vary between 1% and 5〇% in 0.2% steps. Therefore, each dose cell requires 8 bits. However, the result is to link two separate concepts. The pitch value changes, also has The inevitable result of the size of the dose grid. The pixel grid size and position are preferably elastic. The pixels can be non-square, but will always have the same scale in one strip/channel. The pixel can be (the worst) Case) Four sub-columns are rotated. Because of the sub-pixel shift, each column can be used with different (Y-direction) alignments. 2011-11254 The following specifications are provided for one embodiment. Contains blocks and additional information with up to 4 corners of 64 by 64 nm. Edges are vectors starting in the corners, Edge丨 or Edge2, and the clockwise angle from Edge 1 to Edge2 defines the active side. The corners are the corners of the features in the grid. When a line is crossed and there is no actual corner, the corners can have an angle of 18 degrees. Assuming that each of the transitions has a maximum of 4 corners, the specification of the angular data is楛下下+ : Name Number of Bits Basic Principle X Position 8 64 nm®〇.5 nm Y Position 8 64 nm®0.5 nm Edgel Direction 3 8 Directions Edge2 Direction 3 7 Directions Possible, Equal to Edge 1 is a Special Case: Project No serif size used 5 0 means cutoff (〇ff) total 27 In order to calculate the size of the serif from its field value, different strategies can be used, for example: look up the table '丨, the field value is used as an index in a predefined table, Or by calculation (for example, serif size = value * 0.5 nm, therefore, the range is 〇. 15 5@G 5nm, assuming a positive serif size). Specifications for the reticle data of one embodiment Is provided in the following table: 78 201123254 Name of the number of bit cells per cell Total bit angle 27 4 108 Dose mapping 3x3 8 9 72 Total 180 The following table summarizes the amount of data when using the above format. For this number, the assumption is that there is no stitching. -- - --•m ,~ f ^ Name formula result number of bytes per cell 180 bits / 8 23 bytes Number of cells per strip 3 3 mm / 64 nm * 2jLtm 16E6 grid / 64 nm ^ The number of cells is 13000*16E6 209E9 The number of bytes in the field of Gezhi 209E9*23 The number of bytes in the band of 5T bytes is 16E6*23 3 The number of bytes in the 70-bit byte may exist. For example, it is expected that multiple grids contain less than 4 corners, and for all dose grids, the dose rate = month t* is the same value. 0 The fixed-size data structure will alleviate the FpGA design (addressing and loading) Into the task, but with consequences for the memory. For communication to (disk) storage, the standard ink reduction technique can be used to transfer the data _. This ^ usage record is filled with the same value (example (6): is valid for unused edges). Compression is also valid for repeated values, as for agent Dan: zero). Some of the design problems for the above embodiments are: 79 201123254 ♦ Up to 4 corners per cell may not be sufficient; * Finding serifs in adjacent cells is expensive in processing time and memory and if possible Avoid; *The serifs may be different from the expected shape; • Fixed angles per grid are desirable for hardware implementation; • A fixed number of corners per grid creates a large amount of data; • Each grid is small A fixed number of corners creates a lack of flexibility; • From the theoretical point of view of the information, encoding all the corners is over-information, but significantly beneficial for implementation in hardware; • The resolution of the corners is preferably 0.25 nm instead of 0.5 Nm; • Encoding only half of the edges may be sufficient. The larger block co-chops i as a compromise between a large number of large and small fixed number of corners, one possibility is to limit the maximum number of edges for larger data blocks, for example: in the mechanical scanning direction is about Big 16 times. It is assumed that the local maximum number of corners in a region of this larger block will be compensated for by the smaller number of corners in another region of the block. An upper limit of more than 4 at the maximum number of corners is undesirable due to the use of a memory. However, the lower limit of use will not cover all possible situations. As an intermediate solution, consider the following scheme: encode the data in a block larger than the current cell (for example, a block of i 6 cells at a time), and limit the number of edges to the 'where' part of the block. The maximum number of corners can be higher. In this scenario, the serif is coded as if it were an edge, which is helpful. 80 201123254 In order to implement this embodiment, the following changes can be made to the above embodiment: • Define a block having a γ direction (deflection direction) of 62 5 nm and an X direction (mechanical scanning direction) of nm; The Y size of the block is reduced from 64 to 62 5 ηηι. This has two advantages. 16*62.5 = 1000 nm and 62.5/0.25=250, which can be efficiently encoded in 8-bit; • The fee map can have a resolution of 3 1 25 X 31 25 nm (1〇 • 1/32 of 〇〇nm); • The maximum number of corners is set to 64 per block (average of 4, 62, 6 x 62.5 nm for 4 corners); • The serif is encoded within the data, Like the edges and corners themselves. The g lower specification is provided for this embodiment: Name — 丨 1 Value Transfer block 62.5 by 1 〇〇〇 nm block containing 64 angular and dose information. Edge The vector at the beginning of the corner. Edgel or Edge2. The clockwise angle from Edgei to Edge2 defines the active side. Angular angle of a feature in a lattice. If the line is traversed without the actual angularity, it may also be an angle with an angle of 180 degrees. Assume that each transition column has a maximum of 4 corners. The specifications for the angular data for this embodiment are provided in the following table:

81 201123254 名稱 位元數目 基本原理 X位置 12 1 〇〇0 nm(Sy〇.25 nm Y位置 8 • --- 62.5 nm(®y〇.25 nm Edgel方向 3 — 一 8個方向 Edge2方向 3 7個方向可能,等於Edgel是 特例:項目未使用 總計 26 ~— 一 ----^ gig:此實施例的轉列格數據的規格是提供在下表 名稱 每個單元的位元 單元數目 總位元 棱角 26 64 1664 劑量映射3x3 8 64 512 總計 1 2176 下表是總結當使用上述格式時的數據量。對於此數據 置表格的假設是不具有缝綴。此估計並未考量捨入,其當 將資訊儲存在實際RAM時而發生。 名稱 公式 結果 每個區塊的位元組數目 2176位元/8 272位元組 每個條帶的區塊數目 3 3mm/l 000nm*2/tm /62.5nm 1056000 每個場域的區塊數目 13000*1E6 13.7E9區塊 每個場域的位元組數目 13E9*272 3.4T位元組 每個條帶的位元組數目 1E6*272 274M位元組 存在壓縮的機會》舉例來說’預期的是:多個區塊是 82 201123254 含有少於64個稜角,且對於相鄰的劑量格,劑量率將具有 類似值。然而,壓縮亦導致較複雜的實施。數據可能是在 被輸送通過系統時而作壓縮。 a由資訊的理論觀點,將具有所有座標的所有稜角蝙螞 是不必要的。然而,此戲劇性地減少在實施中的計算工作 量。亦將區塊邊界交又編碼是可為裨益的增加棱角的 數目,但是將在FPGA中的計算工作量減少更多。此外,應 納入考量的是:整個轉列過程應可從數據的二端所執行。 在一個方向將一歧“明顯”眘邻么、土 .., —貝貝訊嚙去,可能當在其他方向 掃描時而引起問題。 區塊是同樣可能定位在偏轉掃描方向。存在為何不應 如此的二個理由。實施的平行性必須在條帶内的數個條帶 處理數據’且若是以此方歧位數據,此將是不可能的。 此外二在偏轉掃描方向的粒度將是1GG()nm,㈣於縫綴為 不合意。纟目前情況’包括縫綴區域之條帶寬度的粒度是 62.5 nm ° 將數據封包在記憶體應該得到一些想法。若對於劑量 映射的數據是儲存在不同於稜角數據的個別位元路徑可能 會有所助益。 使用先前段落的方式具有以下的裨益: •數據量較小(例如:3.5 TB而非5 TB); •特徵解析度較尚(例如:〇 2 5 n m而dt Λ c \ m nm 而非 0.5 nm); •對於襯線且對於在局部銘固μ社& a 你句0丨靶圍的稜角數目,彈性較高; •實施較不複雜。 83 201123254 圊型束微影系统 圖74顯示一種帶電粒子多個小束微影系統1的實施例 的簡化示意圖,微影系統1是基於其不具有所有電子小束 的共同交越的電子束光學系統。此光學系統是詳述於美國 專利申請案序號第61/045,243號,其整體為以參照方式而 納入於本文。 此種微影系統適當包含:小束產生器,其產生複數個 小束;小束調變器,其將該等小束圖型化為調變的小束; 及,小束投射器’其用於將該等小束投射到目標表面上。 小束產生器是典型包含源與至少一個孔隙陣列。小束調變 器是典型為小束熄滅器,其具有熄滅偏轉器陣列與束光闌 陣列。小束投射器典型包含掃描偏轉器與投射透鏡系統。 圖74並未明確顯示本發明的晶圓定位與支撐結構。 微影系統1尤為適以結合如本文所述的所謂的兩次或 夕人知&來實施几餘掃描功能性。到目標表面上的掃描線 準確度的此達成改良致使能夠實現第二次掃描,其將在第 一次掃描順序中所留空的間隙填滿。 在圖74所示的實施例中,微影系統包含電子源' 3 用於產生同質、擴展的電子束 10 kev的範圍中維持相當低β 較佳為低,電子源較佳為相關 在約-1到-l〇kV之間,雖然亦 來自電子源3的電子束4 使電子束4準直的準直器透鏡 4。束能量較佳為在約1到 為了達成此舉,加速電壓是 於在接地電位的目標而保持 可使用其他設定。 通過雙重八極及隨後為用於 5。如將瞭解,準直器透鏡5 84 201123254 可為任何型式的準直光學系統,隨後’電子束4撞擊於分 束器,其在一個適合實施例為孔隙陣列6A。孔隙陣列6A 將部分束阻斷且允許複數個子束2〇通過孔隙陣列6A。孔隙 陣列較佳包含其具有通孔的板。因此,產生複數個平行的 電子子束20。 第二孔隙陣列6B從各個子束產生若干個小束7。系統 產生大虽小束7 ’較佳為約1〇,〇〇〇到loo ooo個小朿,雖然 誠然可能使用較多或較少的小束。注意,亦可使用其他的 已知方法來產生準直的小束。 此允許子束的操控,其結果是對系統操作為有益,尤 其是當小束的數目增加到5,0〇〇或更多個時。此類操控是例 如由聚光透鏡、準直器或其將子束會聚到光軸(例如:在投 射透鏡的平面)的透鏡結構所實現。 聚光透鏡陣列21 (或一組聚光透鏡陣列)被納入在子束 產生孔隙陣列6A的後方,用於將子束2〇聚焦朝向在束光 闌陣列10中的對應開口。第二孔隙陣列6B從子束2〇產生 小束7。小束產生孔隙陣列6B較佳納入為結合小束熄滅器 陣列9;舉例來說,可將二者組裝在一起以形成子組件。在 圖74之中,扎隙陣列6B從各個子束2〇產生三個小束7, 其撞擊在束光闌陣列10的對應開口,使得三個小朿為由在 末端模組22 t的投射透鏡系統所投射到目 對於在末端模組22中的各個投射透鏡系統 6B產生更大量的小束。在一個實施例中, (排列在7x7陣列中的)49個小束且將其指 標上。實際上, ’可由孔隙陣列 從各個子束產生 引通過單個投射81 201123254 Name Number of Bits Basic Principle X Position 12 1 〇〇0 nm (Sy〇.25 nm Y Position 8 • --- 62.5 nm (®y〇.25 nm Edgel Direction 3 - One 8 Directions Edge2 Direction 3 7 The direction is possible, equal to Edgel is a special case: the project is not used for a total of 26 ~ - one --- ^ gig: the specification of the transfer column data of this embodiment is the total number of bit units provided in each unit of the table below name Angle 26 64 1664 Dose Mapping 3x3 8 64 512 Total 1 2176 The following table summarizes the amount of data when using the above format. The assumption for this table is that there is no stitching. This estimate does not consider rounding, which will The information is stored in the actual RAM. Name formula result The number of bytes per block 2176 bits / 8 272 bytes The number of blocks per strip 3 3mm / l 000nm * 2 / tm / 62.5nm 1056000 Number of blocks per field 13000*1E6 Number of bytes per field in 13.7E9 block 13E9*272 3.4T bytes Number of bytes per strip 1E6*272 274M bytes exist The opportunity to compress", for example, 'expected: multiple blocks are 82 201123254 contains less than 64 corners, and the dose rate will have similar values for adjacent dose grids. However, compression also leads to more complex implementations. Data may be compressed as it is transported through the system. Theoretically, it would be unnecessary to have all the angular bats with all the coordinates. However, this dramatically reduces the computational effort in the implementation. Also coding the block boundaries is the number of added edges that can be beneficial, but The computational effort in the FPGA will be reduced more. In addition, it should be taken into consideration that the entire transfer process should be performed from the second end of the data. In one direction, it will be "obvious" and be careful. , —Bebe, it may cause problems when scanning in other directions. The block is also likely to be positioned in the direction of the deflection scan. There are two reasons why this should not be the case. The parallelism of the implementation must be within the strip. Several strips process the data' and this would be impossible if the data is in this way. In addition, the granularity in the deflection scan direction will be 1GG() nm, and (4) the stitching is undesired.纟 The current situation 'the granularity of the strip width including the stitching area is 62.5 nm °. The data should be encapsulated in the memory should get some ideas. If the data for the dose mapping is stored in an individual bit path different from the angular data, there may be The benefits of using the previous paragraphs are as follows: • Small amount of data (eg 3.5 TB instead of 5 TB); • Feature resolution is better (eg 〇2 5 nm and dt Λ c \ m nm) Instead of 0.5 nm); • For the serif and for the local number of the corners of the target, the elasticity is higher; • The implementation is less complicated. 83 201123254 圊-beam lithography system FIG. 74 shows a simplified schematic diagram of an embodiment of a charged beam multi-beam lithography system 1 based on electron beam optics that does not have all electron beamlets system. The optical system is described in detail in U.S. Patent Application Serial No. 61/045,243, the entire disclosure of which is incorporated herein by reference. Such a lithography system suitably includes: a beamlet generator that generates a plurality of beamlets; a beamlet modulator that maps the beamlets into modulated beamlets; and, a beamlet projector Used to project the beamlets onto the target surface. The beamlet generator is typically comprised of a source and at least one array of pores. The beamlet modulator is typically a beamlet extinguisher with an extinguishing deflector array and a beam stop array. Beamlet projectors typically include a scanning deflector and a projection lens system. The wafer positioning and support structure of the present invention is not explicitly shown in FIG. The lithography system 1 is particularly adapted to perform several scan functions in combination with so-called two-time or singularity & This improved accuracy of the scan line to the target surface enables a second scan that fills the gap left in the first scan sequence. In the embodiment shown in FIG. 74, the lithography system includes an electron source '3' for generating a homogenous, extended electron beam 10 kev that maintains a relatively low temperature, preferably low, and the electron source is preferably correlated. Between 1 and -l〇kV, although the electron beam 4 from the electron source 3 also collimates the collimator lens 4 of the electron beam 4. The beam energy is preferably at about 1 to achieve this, the accelerating voltage is maintained at the target of the ground potential and other settings can be used. Pass double octet and subsequently for 5. As will be appreciated, the collimator lens 5 84 201123254 can be any type of collimating optics, and then the 'electron beam 4 strikes the beam splitter, which in one suitable embodiment is the aperture array 6A. The aperture array 6A blocks a partial beam and allows a plurality of beamlets 2 to pass through the aperture array 6A. The aperture array preferably comprises a plate having a through hole. Thus, a plurality of parallel electron beamlets 20 are produced. The second aperture array 6B produces a number of beamlets 7 from the respective beamlets. The system produces a large beam 7 ', preferably about 1 〇, and loo ooo, although it is certainly possible to use more or less beamlets. Note that other known methods can also be used to generate a collimated beamlet. This allows manipulation of the beamlets, which results in a benefit to system operation, especially when the number of beamlets is increased to 5,0 or more. Such manipulation is achieved, for example, by a concentrating lens, a collimator, or a lens structure that converges the beamlets onto an optical axis (e.g., in the plane of the projection lens). A concentrating lens array 21 (or a group of concentrating lens arrays) is incorporated behind the beamlet generation aperture array 6A for focusing the beamlets 2 朝向 toward corresponding openings in the beam pupil array 10. The second aperture array 6B produces a beamlet 7 from the beamlet 2〇. The beamlet generation aperture array 6B is preferably incorporated into the beamlet extinguisher array 9; for example, the two can be assembled together to form a subassembly. In Fig. 74, the lash array 6B generates three small bundles 7 from the respective sub-beams 2 , which impinge on corresponding openings of the beam aperture array 10 such that three small pupils are projected by the end module 22 t The projection of the lens system to the target produces a greater amount of beamlets for each of the projection lens systems 6B in the end module 22. In one embodiment, 49 small bundles (arranged in a 7x7 array) are indexed. In fact, ' can be generated from each sub-beam by a pore array through a single projection

85 201123254 透鏡系統’雖然可將每個子束的小束數目增加到2〇〇個或 更多者。 從束4透過子束20的中間階段逐步產生小束7具有優 點在於,可用相當有限的子束20且位在相當遠離目標處來 實現主要光學操作。一個此類操作將子束會聚到對應於一 個投射透鏡系統的一點《較佳而言,在操作與會聚點之間 的距離是大於在會聚點與目標之間的距離。最適合的是, 在此結合利用靜電投射透鏡。此會聚操作致使系統能夠符 合縮小光點尺寸、增大電流、及縮小點展開的需求,以在 進階節點完成可靠的帶電粒子束微影,尤其是在具有小於 90 nm的臨界尺度的節點。 小束7接著通過調變器陣列9。此調變器陣列9可包含 具有複數個熄滅器的小東熄滅器陣列,各個熄滅器為能夠 將或多個電子小束7偏轉。媳滅器是較明確為提供第一 與第二電極的靜電偏轉器,第二電極是接地或共同電極。 小束熄滅器陣列9與東光闌陣列1〇是構成調變裝置。基於 小束控制數據,調變機構8將圖型添加到電子小束7。^型 將藉由存在於末端模組22之内的構件而投射到目標24。 在此實施例中,束光闌陣列1G包含用於允許小束通過 的孔隙陣列。I光闌陣列在其基本的形式中包含基板,其 提供通孔,典型為圓孔’雖然亦可使用其他形狀。在一個 實施例中,束光闌陣列H)的基板是由具有規律間隔的通孔 陣列的石夕晶圓所形成’且可用金屬的表面層所塗覆以防止 表面充電。在-個實施例中’該金屬是不形成天然氧化物 86 201123254 表層的-種型式者,諸如:CrM〇。 實知例中,束光闌陣列10的通路被對準在小束 熄滅器陣列9中的方,^ ^ D ^ β ,孔。小束尨滅器陣列9與小束光闌陣列 10二-起操作以將小束7阻斷或讓小束7通過。若小束媳 車J 9將小束偏轉,則其將不會通過在小束光鬧陣列 1/的對應孔隙,而是將會由小束阻斷陣列1〇@基板所阻 斷。但右是小束熄滅器陣列9未將小束偏轉,則其將通過 在小束光闌陣列1G的對應孔隙且將接著投射為目標24的 目標表面1 3上的光點。 2影系統進-步包含用於將小束控制數據供應到小束 德滅益陣列的數據途徑。可用光纖來傳送小束控制數據。 來自各個光纖端的調變光束是投射在小束熄滅器陣歹"的 光敏元件上。各個光束持有用於控制其耦合到光敏元件的 .一或多個調變器的部分圖型數據。 隨後,電子小束7進入末端模組。下文,術語“小束” 疋關於已經調變後的小束。此類的調變小束是有效包含依 時間方式的順序部分。此等順序部分的一些者可具有較低 強度且較佳為具有零強度,即:止在束光闌的部分。為了 允许小束定位到對於隨後掃描週期的起始位置,一些部分 將具有零強度。 末端模組22較佳構成為可插入、可更換式單元,其包 含種種構件。在此實施例中,末端模組包含束光闌陣列i 〇、 掃描偏轉器陣列11、及投影透鏡配置丨2,雖然並不是所有 此等者為均須納入在末端模組中且其可為以不同方式配 87 201123254 置。 在通過小束^料列1G之後,調變的小束7通過掃福 偏轉器陣列U,其提供各個小束7在X-及/或Y-方向(實^ 為二直;未偏轉小束7的方向)的偏轉。在此實施例中,偏 轉器陣列11掃描靜電偏轉器,其致使能夠施加相當小的驅 動電壓,如將在下文所解說。 接著,小束通過投影透鏡配置12且投射到目標平面中 的目‘(八典型為晶圓)的目標表面24上。對於微影應用, 目私通承包含其提供帶電粒子敏感層或抗蝕劑層的晶圓。 投影透鏡配置12將小束聚焦,較佳為造成其直徑約10到 3〇奈米的幾何光點尺寸。以此類設計的投影透鏡配置ι2較 佳提供約100到500倍的縮小。在此較佳實施例中,投影 透鏡配置12有利為位在接近於目標表面。 在一些實施例中,束保護器可位在目標表面24與聚焦 投影透鏡配置12之間。束保護器可為提供必要孔隙的箔片 或板,用於將從晶圓所釋放的抗蝕劑粒子在其可能到達微 影系統中的任何敏感元件前而吸收。替代或附加而言,掃 描偏轉陣列11可提供在投影透鏡配置12與目標表面24之 間。 概略而言,投影透鏡配置12將小束7聚焦到目標表面 24 ^此外,進一步確保單個像素的光點尺寸為正確。掃描 偏轉器11將小束7偏轉為遍及目標表面24。此外,須確保 在目標表面24上的像素位置於微刻度為正確。尤其,掃〆 偏轉器11的操作必須確保將像素適當相配到像素網格,其 88 201123254 隶後為構成在目標表面24上的圖型。將瞭解的是,在目標 表面上的像素的微刻度定位是適合為由其在目標24下方的 晶圓定位系統所致能。 此類的高品質投射是相關以得到其提供可複製結果的 微影系統。通常,目標表面24包含在基板頂部上的抗蝕劑 膜。部分的抗蝕劑膜將由帶電粒子(即:電子)小束的施加以 化學方式修改。此結果為,膜的照射部分將多少有些為可 溶於顯影劑,造成在晶圓上的抗蝕劑圖型。在晶圓上的抗 融劑圖型可隨後為轉移到下層,即:藉由實施在半導體製 造技術中所習知的蝕刻及/或沉積步驟。明顯地,若照射為 不均勻,抗敍劑可旎為並非以均勻方式顯影,導致在圖型 中的錯誤《甚者,諸多的此類微影系統利用複數個小束。 偏轉步驟不應該造成在照射中的任何差異。 在此類光學系統的一個實施例申,在來自相鄰子束2〇 的第一與第二群組的小束7之間保留空間。此外,系統定 義為含有束區域51與非束區域52,如在圖75所示。成為 束區域51與非束區域52的劃分是存在調變裝置♦以及末 端模組(例如:投射透鏡系統)内。可利用非束區域52在浐 射透鏡系統中,用於提供機械支撐結構以使得任何振動^ 應最小化。對應於非束區域52的空間可被填滿,例如:= 定圖型為在轉移過程的後續步驟中轉移到目標上的空間。 此後續步驟是在相對於柱而將目標移動後而實行。填滿命 間的特定順序亦稱為寫入策略。 二 本發明已經關於上文論述的某些實施例所描述。應注 89 201123254 意,已經描述種種結構與替代者,其可為連同本文所述的 任何實施例所使用,如將為熟悉此技術人士所習知。再者, 將 < 可的疋,在沒有脫離本發明的精神與範疇的情況下, 此等實施例容許對於熟悉此技術人士為眾所週知的種種修 改與替代形式。是以’雖然已經描述特定實施例,此等實 施例僅為舉例且不限制本發明的料,其為定義在隨附的 申請專利範圍中。 【圖式簡單說明】 本發明的種種觀點與本發明的實施例的某些實例是在 圖式中說明,其中: 圖1是顯示無遮罩微影系統的概念圖; 圖2A是帶電粒子微影系統的實施例的簡化示意圖; 圖2B是在數據途徑中的元件的簡化圖; 圖3與4顯示小束熄滅器陣列的一部分; 圖5是顯示在分割為場域的晶圓上的寫入方向的圖例; 圖6是顯示掃描線位元框與小束偏轉的圖例; 圖7是說明圖型偏移與圖型定標的實例的圖例; 圖8是顯示對於使用四個小束來寫入條帶的可能交插 方案的實例的圖例; 圖9是簡化的四小束熄滅器陣列與掃描線圖型的圖例; 圖1 〇疋對於因數κ與在掃描線間的距離的值的表格; array 圖1 1是九個小束的陣列的圖例,顯示束間距pb、投射 間距Pproj、網格寬度Wpr〇j與傾斜角 90 201123254 圖12是框起始指示位元的圖例; 圖13是具有X個處理單元的節點的示意圖; 圖14是每個掃描的通道位置的概念圖; 圖15與16是對於二個掃描的處理單元對通道的分配 的概念圖; 圖17至23是曲線圖,說| 數據途徑容量的模擬實驗結果; 5兒明關於微影機器容量而改變 圖24是顯示在微影系統中的處理的相依性的流程圖 圖25與26是說明\與丫圖型移位的實例的圖例; 圖 2785 201123254 Lens system 'Although the number of beamlets per sub-beam can be increased to 2 or more. The stepwise generation of the beamlet 7 from the intermediate stage of the beam 4 through the beamlet 20 has the advantage that a relatively limited beamlet 20 can be used and positioned relatively far from the target to achieve the primary optical operation. One such operation concentrates the beamlets to a point corresponding to a projection lens system. Preferably, the distance between the operation and the convergence point is greater than the distance between the convergence point and the target. Most suitably, an electrostatic projection lens is used in combination here. This convergence operation enables the system to meet the need to reduce spot size, increase current, and reduce dot unwinding to achieve reliable charged particle beam lithography at advanced nodes, especially at nodes with critical dimensions less than 90 nm. The beamlet 7 then passes through the modulator array 9. The modulator array 9 can include an array of small east extinguishers having a plurality of extinguishers, each of which is capable of deflecting one or more electron beamlets 7. The quencher is an electrostatic deflector that is more specifically providing first and second electrodes, and the second electrode is a ground or common electrode. The beamlet extinguisher array 9 and the east beam array 1 are constituting a modulation device. Based on the beamlet control data, the modulation mechanism 8 adds the pattern to the electron beamlet 7. The ^ type will be projected onto the target 24 by means of components present within the end module 22. In this embodiment, the beam stop array 1G includes an array of apertures for allowing the beamlets to pass. The I pupil array comprises, in its basic form, a substrate that provides a via, typically a circular aperture, although other shapes may be used. In one embodiment, the substrate of the beam stop array H) is formed by a stone wafer of regularly spaced via arrays and can be coated with a metal surface layer to prevent surface charging. In one embodiment, the metal is a type that does not form a natural oxide 86 201123254 skin layer, such as: CrM〇. In the example, the path of the beam stop array 10 is aligned to the square of the beamlet extinguisher array 9, ^ ^ D ^ β , the hole. The beamlet quencher array 9 and the beamlet array 10 operate to block the beamlets 7 or pass the beamlets 7. If the small beam J 9 deflects the beam, it will not pass through the corresponding aperture in the beamlet array 1/, but will be blocked by the beamlet blocking array 1@@ substrate. But to the right, the beamlet extinguisher array 9 does not deflect the beamlets, which will pass through the corresponding apertures in the beamlet array 1G and will then be projected as the spot on the target surface 13 of the target 24. The 2-shadow system step-by-step includes a data path for supplying beamlet control data to the small bundle of de-emphasis arrays. The fiber optic can be used to transmit the beamlet control data. The modulated beam from each fiber end is projected onto the photosensor of the beamlet extinguisher array. Each beam holds partial pattern data for controlling one or more modulators that are coupled to the photosensitive element. Subsequently, the electron beamlet 7 enters the end module. Hereinafter, the term "small beam" 疋 refers to the small beam that has been modulated. This type of modulation beamlet is effectively included in a sequential part of the time mode. Some of these sequential portions may have a lower intensity and preferably have a zero intensity, i.e., stop at the portion of the beam stop. In order to allow the beamlet to be positioned to the beginning of the subsequent scan cycle, some portions will have zero intensity. The end module 22 is preferably constructed as an insertable, replaceable unit that includes a variety of components. In this embodiment, the end module includes a beam aperture array i 〇, a scanning deflector array 11 , and a projection lens arrangement 丨 2, although not all of them are included in the end module and In different ways, 87 201123254 is placed. After passing through the beamlet 1G, the modulated beamlets 7 pass through the bail deflector array U, which provides the respective beamlets 7 in the X- and/or Y-direction (solid ^ two straight; undeflected beamlets) Deflection of the direction of 7). In this embodiment, the deflector array 11 scans the electrostatic deflector, which enables a relatively small drive voltage to be applied, as will be explained below. The beamlets are then passed through the projection lens arrangement 12 and projected onto the target surface 24 of the target ' (typically wafer) in the target plane. For lithography applications, it is intended to include wafers that provide a charged particle sensitive layer or resist layer. The projection lens arrangement 12 focuses the beamlets, preferably to a geometric spot size having a diameter of about 10 to 3 nanometers. Projection lens configuration ι2 of this type preferably provides a reduction of about 100 to 500 times. In the preferred embodiment, the projection lens arrangement 12 is advantageously positioned proximate to the target surface. In some embodiments, the beam protector can be positioned between the target surface 24 and the focused projection lens configuration 12. The beam protector can be a foil or plate that provides the necessary porosity for the resist particles released from the wafer to be absorbed before it can reach any sensitive elements in the lithography system. Alternatively or in addition, the scan deflection array 11 can be provided between the projection lens arrangement 12 and the target surface 24. In summary, the projection lens arrangement 12 focuses the beamlet 7 to the target surface 24^ In addition, it is further ensured that the spot size of a single pixel is correct. Scanning deflector 11 deflects beamlet 7 throughout target surface 24. In addition, it must be ensured that the pixel position on the target surface 24 is correct on the micro scale. In particular, the operation of the broom deflector 11 must ensure that the pixels are properly matched to the pixel grid, which is followed by the pattern formed on the target surface 24. It will be appreciated that the micro-scale positioning of the pixels on the target surface is suitable for enabling the wafer positioning system underneath the target 24. High quality projection of this type is related to get a lithography system that provides reproducible results. Typically, target surface 24 contains a resist film on top of the substrate. A portion of the resist film will be chemically modified by the application of a small beam of charged particles (i.e., electrons). As a result, the irradiated portion of the film will be somewhat soluble in the developer, resulting in a resist pattern on the wafer. The resist pattern on the wafer can then be transferred to the lower layer, i.e., by performing etching and/or deposition steps as is conventional in semiconductor fabrication techniques. Obviously, if the illumination is not uniform, the anti-synthesis agent may not be developed in a uniform manner, resulting in errors in the pattern. "Our many such lithography systems utilize a plurality of small beams. The deflection step should not cause any difference in the illumination. In one embodiment of such an optical system, space is reserved between the first and second groups of beamlets 7 from adjacent beamlets 2A. Further, the system is defined to include a beam region 51 and a non-beam region 52, as shown in FIG. The division into the beam region 51 and the non-beam region 52 is in the presence of the modulation device ♦ and the end module (e.g., projection lens system). A non-beam region 52 can be utilized in the lenticular lens system for providing a mechanical support structure to minimize any vibration. The space corresponding to the non-beam region 52 can be filled, for example: = The pattern is the space that is transferred to the target in a subsequent step of the transfer process. This subsequent step is performed after moving the target relative to the column. The specific order in which the commands are filled is also known as the write strategy. The present invention has been described in relation to certain embodiments discussed above. It is to be noted that various structures and alternatives have been described which may be used in conjunction with any of the embodiments described herein, as will be apparent to those skilled in the art. Furthermore, the embodiments are susceptible to various modifications and alternative forms that are well known to those skilled in the art without departing from the spirit and scope of the invention. The present invention is intended to be illustrative only and not limiting of the invention, which is defined in the scope of the appended claims. BRIEF DESCRIPTION OF THE DRAWINGS Various aspects of the present invention and some examples of embodiments of the present invention are illustrated in the drawings, in which: FIG. 1 is a conceptual diagram showing a maskless lithography system; FIG. 2A is a charged particle micro A simplified schematic of an embodiment of a shadow system; FIG. 2B is a simplified diagram of elements in a data path; FIGS. 3 and 4 show a portion of a beamlet blanker array; FIG. 5 is a diagram showing writes on a wafer that is divided into fields Fig. 6 is a diagram showing a scanning line bit frame and beam deflection; Fig. 7 is a diagram illustrating an example of pattern offset and pattern scaling; Fig. 8 is a diagram showing the use of four beamlets for writing Figure for an example of a possible interleaving scheme for strips; Figure 9 is a simplified illustration of a four beamlet extinguisher array and scan line pattern; Figure 1 表格 Table for the value of the factor κ and the distance between scan lines Array Figure 1 is an illustration of nine small beam arrays showing beam spacing pb, projection pitch Pproj, grid width Wpr〇j and tilt angle 90 201123254 Figure 12 is an illustration of the box start indicator bit; Figure 13 Schematic diagram of a node with X processing units Figure 14 is a conceptual diagram of the channel position of each scan; Figures 15 and 16 are conceptual diagrams of the allocation of channels for two scanned processing units; Figures 17 to 23 are graphs, showing the results of simulation experiments of data path capacity 5 shows a change in the lithography machine capacity. FIG. 24 is a flow chart showing the dependence of the processing in the lithography system. FIGS. 25 and 26 are diagrams illustrating an example of the \ and 丫 pattern shift; FIG.

塊圖; 圖2 8是數據途徑的簡化功能方塊圖; 圖29是重疊在條帶上的佈局圖型特徵的圖例; 圖30是遞色過程的圖例; 圖31是在位元框中的位元移位的圖例; 圖32是對於參數N=4與κ=3的小束位置的圖例; 圖33是顯不數據途徑的數據處理與儲存元件的示意方 流程圖;Figure 2 is a simplified functional block diagram of the data path; Figure 29 is an illustration of the layout pattern features overlaid on the strip; Figure 30 is a legend of the dithering process; Figure 31 is a bit in the bit box Figure for the meta-shift; Figure 32 is a graphical representation of the beamlet position for the parameters N=4 and κ=3; Figure 33 is a schematic flow diagram of the data processing and storage elements of the data-only path;

圖36是圖塑串流器旬 圖37是顯示在圖36 郎點的元件的方塊圖; 據流的功能圖; 之圖型串流器節點的元件間的數Figure 36 is a block diagram of the components of Figure 33. Figure 37 is a block diagram of the elements of Figure 36; the functional diagram of the stream; the number of elements between the graphs of the streamer node

I S I 91 201123254 圖38是顯示數據途徑的處理與傳輸元件的細節的方塊 团 · 圖, 圖39疋包括壓縮與解壓縮功能的數據途徑的一部分的 功能方塊圖; 圖4〇說明遞色的單色測試影像的實例; 圖41疋包括在通道轉列後的壓縮與解壓縮功能的數據 途徑的一部分的功能方塊圖; 圖42顯示-格的轉列位元映射的實例; 圖43疋小網格輸入像素與大輸出像素的概念圖; 圖44 $數據途徑的另—個實施例的功能方塊圖; 圖45疋』不對於圖44之數據途徑的處理的相依性的 流程圖; 圖 46是圖型串流器節點的元件的方塊圖; 圖47與48是顯示在圖46 的替代數據流的功能圖; 之圖型串流器節點的元件間 圖49是顯示數據途徑的 圖5 0是顯示在圖型串流 的功能圖; 疋件間的通訊的示意圖; 器節點的元件間的替代數據流 圖51是用於數據途徑的咖的内部架構的圖例; 圖二2是顯示在圖型串流器節點的元件 的功能圖; 圖5 3疋數據途徑的另一個管始也丨认丄 θ θ 回貫施例的功能方塊圖; 圖54疋顯示數據途徑的處 圖 兴得輸7L件的細節的方塊 92 201123254 圖55是具有交插/多工的子通道的數據途徑的示意圖; 圖56是使用列選擇器與行選擇器的解多工方案的示意 圖, 圖57是像素尺寸與網格寬度的表格,取決於每個圖型 束的小束數目(Npat beams)、陣列傾斜角(aarray)、投射間距 (Pproj)、與 K 因數; 圖58A是說明智慧型邊界策略的圖例; 圖58B是說明軟邊緣策略的圖例; 圖59是使用離線柵格化的數據途徑的實施例的功能流 程圖; 圖60是使用線内栅格化的數據途徑的實施例的功能流 程圖; 圖61是使用線内柵格化的數據途徑的另一個實施例的 功能流?程圖; 圖62是使用即時柵格化的數據途徑的實施例的功能流 程圖; 圖63是說明四個小束的陣列的圖例; 圖64是說明縫綴方案的圖例; 圖65是說明具有因數K=1與κ=3的寫入策略的圖例; 圖66是對於具有4個小束的圖㉟束的可能&值的圖例; 圖67是說明圖型佈局的實例的圖例; 圖6 8是說明稜角概念的圖例; 圖69是說明向量方位的圖例; 圖70是說明方形特徵的編碼的圖例; 93 201123254 圖71 f說明複雜特徵形狀的編碼的圖例,· 圖72疋說明小於一 ^ L 、 格的對角線長度之最小特徵間距的 實例的圖例; 些稜角的襯線之特徵的實 圖73是說明具有附加到其一 例的圖例; 圖74是顯示一 的簡化示意圖;以及 種帶電粒子多個小束微影系統的實施例 圖75是 疋顯示成為束區域與非束區域的劃分的圖 例。 【主要元件符號說明 1 微影糸統 電子源 電子束 準直器透鏡 孔隙陣列 孔隙陣列 孔隙陣列 小束 小束熄滅器陣列 束光闌陣列 子束 聚光透鏡陣列 末端模組 目標 3 4 5 6ISI 91 201123254 Figure 38 is a block diagram showing details of the processing and transport elements of the data path, Figure 39 is a functional block diagram of a portion of the data path including compression and decompression functions; Figure 4 is a diagram illustrating a dithered monochrome An example of a test image; Figure 41 is a functional block diagram of a portion of the data path including compression and decompression functions after channel tracing; Figure 42 shows an example of a truncated bit map of a cell; A conceptual diagram of input pixels and large output pixels; Figure 44 is a functional block diagram of another embodiment of the data path; Figure 45 is a flow chart showing the dependency of the processing of the data path of Figure 44; Figure 46 is a diagram Block diagram of elements of a type of streamer node; Figures 47 and 48 are functional diagrams of the alternate data stream shown in Figure 46; Figure 49 of the diagram of the streamer node is shown in Figure 49. A functional diagram of a stream of graphics; a schematic diagram of communication between components; an alternative data flow between components of a node; Figure 51 is an illustration of the internal architecture of a coffee channel for data access; Figure 2 is a graphical representation of the graphics flow Figure 5 shows the function block diagram of the θ θ back-through example; Figure 54 shows the details of the data path Block 92 201123254 Figure 55 is a schematic diagram of a data path with interleaved/multiplexed subchannels; Figure 56 is a schematic diagram of a demultiplexing scheme using column selectors and row selectors, and Figure 57 is a pixel size and grid width The table depends on the number of beamlets (Npat beams), the array tilt angle (aarray), the projected pitch (Pproj), and the K factor for each of the pattern bundles; Fig. 58A is a diagram illustrating the smart boundary strategy; Fig. 58B is An illustration of a soft edge strategy is illustrated; Figure 59 is a functional flow diagram of an embodiment of a data path using offline rasterization; Figure 60 is a functional flow diagram of an embodiment of a data path using inline rasterization; Figure 61 is a Functional flow diagram of another embodiment of an in-line rasterized data path; Figure 62 is a functional flow diagram of an embodiment of a data path using instant rasterization; Figure 63 is an illustration of four small beam arrays Legend; Figure 64 A legend illustrating a stitching scheme; Fig. 65 is a diagram illustrating a writing strategy having a factor of K = 1 and κ = 3; Fig. 66 is a diagram of a possible & value of the bundle of Fig. 35 having 4 small bundles; Fig. 6 is a diagram illustrating an example of an angular layout; Fig. 69 is a diagram illustrating a vector orientation; Fig. 70 is a diagram illustrating encoding of a square feature; 93 201123254 Fig. 71 f illustrates a complex feature shape Encoded legend, Fig. 72A illustrates an example of an example of a minimum feature spacing of diagonal lengths less than one L, and lattice; a real graph 73 of features of the angular serrations is illustrated with an example attached to an example thereof; Figure 74 is a simplified schematic diagram showing one; and an embodiment of a plurality of small beam lithography systems with charged particles. Figure 75 is a diagram showing the division of the beam region and the non-beam region. [Main component symbol description 1 lithography electron source electron beam collimator lens pore array pore array pore array small beam small beam extinguisher array beam 阑 array sub-beam concentrating lens array end module target 3 4 5 6

6A6A

6B 7 9 10 20 21 22 24 94 201123254 51 52 100 101 102 103 104 110 111 112 113 114 115 116 117 118 119 120 130 131 ' 140 143 145 146 束區域 非束區域 帶電粒子微影系統 晶圓定位糸統 電子光學柱 數據途徑 目標 帶電粒子源 孔隙陣列 聚光透鏡陣列 準直器透鏡系統 XY偏轉器陣列 第二孔隙陣列 第二聚光透鏡陣列 束熄滅器陣列 束光闌陣列 束偏轉器陣列 投射透鏡陣列 電子束 132 ' 133 小束 預處理單元 電光轉換裝置 光纖 光束 95 201123254 147 透鏡 148 面鏡 149 光電轉換裝置 966B 7 9 10 20 21 22 24 94 201123254 51 52 100 101 102 103 104 110 111 112 113 114 115 116 117 118 119 120 130 131 ' 140 143 145 146 Beam area non-beam area charged particle lithography system wafer positioning system Electro-optical column data path target charged particle source aperture array concentrating lens array collimator lens system XY deflector array second aperture array second concentrating lens array beam extinguisher array beam 阑 array beam deflector array projection lens array electron Beam 132 ' 133 small beam pre-processing unit electro-optical conversion device fiber beam 95 201123254 147 lens 148 mirror 149 photoelectric conversion device 96

Claims (1)

201123254 七、申請專利範圍·· l —種用於定義使用微影處理以寫入在目標上的特徵 的方法,該種方法包含: 定義格陣列,該等特徵佔用一或多個格;以及 對於各個格,描述其屬於該格内的特徵的任何稜角。 2.如申請專利範圍第丨項之方法,其中各個稜角是由稜 帛肖里與第二向量所描述,該二個向量是起自 5茨位置。 由 申明專利1&圍第2項之方法’其中該等稜角位置是 由一個座標所描述。 4. 如申請專利範圍第2或 ¥ ^ Λ * Λ ^ , 飞項之方去,其中該等棱角位 置疋由直角座標所描述。 5. 如申請專利範圍第2頊 指定對於該向量的方向的方, 其中各個向量是由其 』里町万向的方位碼所描述。 6. 如申請專利範圍第2 由當從第-向量以預定方 =、4特徵疋定義為 與格邊S所界;t的區域。° ”二向量時的該等向量 7. 如申請專利範圍第6 時針方向。 項之方法,其中該預定方向是順 8·如申請專利範圍第1 其部分屬於一格内而不具 ,-中虛擬稜角是對於 定義。 、 以格内的任何稜角的特徵所 9.如申請專利範圍第8項 彼此方位為180度的第—鱼 ,/、中該虛擬棱角是由 /、第二向量所描述。201123254 VII. Scope of application for patents · · A method for defining features that use lithography to write on a target, the method comprising: defining a grid array, the features occupying one or more cells; Individual cells, describing any corners that belong to the features within the cell. 2. The method of claim 2, wherein each of the corners is described by a ridge and a second vector, the two vectors being from a position of 5 s. The method of claim 1 & 2, wherein the angular position is described by a coordinate. 4. If the scope of patent application is 2 or ¥ ^ Λ * Λ ^, the party of the flying item goes, where the angular position is described by a rectangular coordinate. 5. For example, the scope of the patent application is specified in the direction of the vector, where each vector is described by its position code. 6. If the scope of the patent application is 2, the region from the first vector is defined by the predetermined square =, 4 features 与 with the edge S; ° "The vector in the case of two vectors. 7. For example, the scope of the patent application is 6 o'clock. The method of the item, wherein the predetermined direction is shun. 8. If the patent application scope is the first part, the part belongs to a cell without a The angular angle is for the definition. The characteristics of any of the corners in the grid are as follows: 9. In the eighth item of the patent application, the first fish is oriented at 180 degrees to each other, and the virtual corner is described by the second vector. 97 201123254 10. 如申請專利範圍第項之方法,其中該等向量可僅具 有平行於格邊界或垂直於格邊界的方向。 八 11. 如申請專利範圍第2項之方法,其中該等向量可僅 具有平行於格邊界、垂直於格邊界或對於格邊界為45度的 方向。 間距 距》 12·如申請專利範圍第1項 ’且其中該等格具有尺寸 之方法,其中定義最小特徵 為等於或小於該最小特徵間 ’其中定義最小特徵 小於2的平方根乘 13.如申請專利範圍第1項之方法 間距,且其中該等格具有尺寸為等於或 以該最小特徵間距的一半。 門距、:申請專利範圍第1〇項之方法’其中定義最小特徵 0以八有尺寸為等於或大於該等格的尺寸乘以2的平方 根。 在心ί專利範圍第1G項之方法,其中對於具有方位 二=格邊界為45度的邊緣的特徵或部分特徵,定義最小 :微間距以具有尺寸為等於或大於該等格的尺寸…的 1項之方法,其中對於各個格可 16.如申請專利範圍第 定義最大數目個稜角。 各個格可含有 各個格包含對 17. 如申請專利範圍第1項之方法,其中 一或多個特徵及/或一或多個特徵的一邹分。 18. 如申請專利範圍第1項之方法,其中 於晶圓的部分場域的圖型數據。 98 201123254 .如申凊專利範圍第18項之方法,装中夂加从 具中各個格包令坊 日日圓的場域條帶的圖型數據。 20.-種處理用於微影法的圖型數據的方法,該種方法 包含· 〆 以向量格式提供該圖型數據; 將該向量圖型數據變換以產生基於格的格式的 據;以及 双 將該基於格的圖型數據柵格化以產生用於該 二階圖型數據。 ’的 21. 如申請專利範圍帛20社方法,其中該基於格的圖 型數據包含描述其佔用格陣列的一或多個格之特徵的格數 據'•亥格數據對於各個格描述其屬於該格内的特徵的 棱角。 · 22. 如申請專利範圍第2〇或21項之方法,其中將該基 於格的圖型數據柵格化是在實行該微影法時以即時處理二 實行。 23. 如申請專利範圍第2〇項之方法’其中將該基於格的 圖型數據柵格化包含: 將s亥基於格的圖型數據轉列以產生多階圖型數據; 將3玄多階圖型數據遞色以產生該二階圖型數據。 24. —種使用帶電粒子微影機器根據圖型數據以將晶圓 曝光的方法,該帶電粒子微影機器產生複數個帶電粒子小 束以將該晶圓曝光,該種方法包含: 以向量格式提供該圖型數據; 99 201123254 將該向董圖型數據變換以產生基於格的格式的圖型數 據; 將該基於格的圖型數據栅格化以產生二階圖型數據; 將該二階圖型數據串流到小束熄滅器陣列以將該帶電 粒子微影機器所產生的該等小束接通及切斷;以及 基於該二階圖型數據以將該等小束接通及切斷。 25. 如申請專利範圍第24項之方法,其中該基於格的圖 型數據包含描述其佔用格陣列的一或多個格之特徵的格數 據,該格數據對於各個格描述其屬於該格内的特徵的任何 稜角。 26. 如申請專利範圍第24或25項之方法,其中將該基 於格的圖型數據柵格化是在該微影機器正將該晶圓曝光時 以即時處理所實行。 W·如申請專利範圍第24項之方法,其中將該基於格的 圖型數據栅格化包含: 將该基於格的圖型數據轉列以產生多階圖型數據; 將該多階圖型數據遞色以產生該二階圖型數據。 八、圖式: (如次頁) 10097 201123254 10. The method of claim 2, wherein the vectors may only have directions parallel to the lattice boundary or perpendicular to the lattice boundary. 8. The method of claim 2, wherein the vectors may have only a direction parallel to the lattice boundary, perpendicular to the lattice boundary, or 45 degrees to the lattice boundary. Pitch "12", as in the scope of claim 1, and wherein the cell has a size, wherein the minimum feature is defined to be equal to or less than the minimum feature, wherein the minimum feature is less than 2 square root multiplied by 13. The method spacing of item 1 of the range, and wherein the element has a size equal to or half of the minimum feature spacing. Door distance, the method of claim 1 of the patent scope, wherein the minimum feature is defined as 0 having a size equal to or greater than the size of the cell multiplied by the square root of 2. The method of claim 1G, wherein for a feature or a partial feature having an edge having an azimuth two = lattice boundary of 45 degrees, a minimum is defined: the micro-spacing to have a size having a size equal to or greater than the size of the element... The method wherein each of the cells is 16. The maximum number of corners is defined as defined in the scope of the patent application. Each of the cells may contain a pair of methods, such as the method of claim 1, wherein one or more features and/or one or more features are recited. 18. The method of claim 1, wherein the pattern data of a portion of the field of the wafer. 98 201123254. For the method of claim 18 of the patent scope, the pattern data of the field strips of the various days of the box is added. 20. A method of processing pattern data for lithography, the method comprising: providing the pattern data in a vector format; transforming the vector pattern data to produce a grid-based format; and The grid-based pattern data is rasterized to produce data for the second-order pattern. 21. The method of claim 20, wherein the grid-based pattern data includes lattice data describing characteristics of one or more of its occupied grid arrays. The angularity of the features within the grid. 22. The method of claim 2, wherein the grid-based pattern data is rasterized in an instant processing when the lithography method is implemented. 23. The method of claim 2, wherein the rasterizing the grid-based pattern data comprises: translating the pattern data of the grid based on the grid to generate multi-level graph data; The meta-pattern data is dithered to produce the second-order graph data. 24. A method of using a charged particle lithography machine to expose a wafer based on pattern data, the charged particle lithography machine generating a plurality of charged particle beamlets to expose the wafer, the method comprising: Providing the graphic data; 99 201123254 transforming the data to the Dong pattern to generate the pattern data of the grid-based format; rasterizing the grid-based pattern data to generate the second-order pattern data; The data stream is streamed to the beamlet extinguisher array to turn the beamlets generated by the charged particle lithography machine on and off; and the beamlets are turned on and off based on the second order pattern data. 25. The method of claim 24, wherein the grid-based pattern data includes lattice data describing features of one or more of its occupied grid arrays, the grid data describing each of the grids belonging to the grid Any angular feature of the feature. 26. The method of claim 24, wherein the rasterizing the pattern data of the grid is performed by immediate processing when the lithography machine is exposing the wafer. W. The method of claim 24, wherein the grid-based pattern data rasterization comprises: translating the grid-based pattern data to generate multi-level pattern data; The data is dithered to produce the second order pattern data. Eight, the pattern: (such as the next page) 100
TW99116035A 2009-05-20 2010-05-20 Cell based vector format TW201123254A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US17976609A 2009-05-20 2009-05-20

Publications (1)

Publication Number Publication Date
TW201123254A true TW201123254A (en) 2011-07-01

Family

ID=45046624

Family Applications (1)

Application Number Title Priority Date Filing Date
TW99116035A TW201123254A (en) 2009-05-20 2010-05-20 Cell based vector format

Country Status (1)

Country Link
TW (1) TW201123254A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109709765A (en) * 2019-03-04 2019-05-03 江苏维普光电科技有限公司 Eliminate the defect inspection method and device of design layout tolerance
CN112859327A (en) * 2019-11-27 2021-05-28 成都理想境界科技有限公司 Image output control method and optical fiber scanning imaging system

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109709765A (en) * 2019-03-04 2019-05-03 江苏维普光电科技有限公司 Eliminate the defect inspection method and device of design layout tolerance
CN109709765B (en) * 2019-03-04 2022-04-01 江苏维普光电科技有限公司 Defect detection method and device for eliminating tolerance of design layout
CN112859327A (en) * 2019-11-27 2021-05-28 成都理想境界科技有限公司 Image output control method and optical fiber scanning imaging system
CN112859327B (en) * 2019-11-27 2022-10-18 成都理想境界科技有限公司 Image output control method and optical fiber scanning imaging system

Similar Documents

Publication Publication Date Title
TWI528411B (en) Dual pass scanning
CN101427184B (en) Lithography system and projection method
TWI485530B (en) Pattern data conversion for lithography system
JP7178841B2 (en) A method of illuminating a target using a definitive positioning grid
US8378319B2 (en) System and method for generating direct-write pattern
KR20190022402A (en) Dose­related feature reshaping in an exposure pattern to be exposed in a multi­beam writing apparatus
TW201123254A (en) Cell based vector format
TW201207653A (en) A parallel image processing system
TWI547766B (en) Multi-tone rasterization
TWI515759B (en) Data path for a charged particle lithography apparatus
EP4095882A1 (en) Pattern data processing for programmable direct-write apparatus
JP7299705B2 (en) Correction of edge placement dependent on non-linear dose and blur
JP4563682B2 (en) Image forming method and apparatus