JP2014178229A

JP2014178229A - Teacher data creation method, image classification method and image classification device

Info

Publication number: JP2014178229A
Application number: JP2013052755A
Authority: JP
Inventors: Akira Matsumura; 明松村
Original assignee: Dainippon Screen Manufacturing Co Ltd
Current assignee: Dainippon Screen Manufacturing Co Ltd
Priority date: 2013-03-15
Filing date: 2013-03-15
Publication date: 2014-09-25

Abstract

PROBLEM TO BE SOLVED: To provide technology for effectively performing learning even when the sufficient number of teacher images cannot be prepared for all of a plurality of classification categories.SOLUTION: A teacher data creation method comprises: a teaching step of associating one classification category into which each of a plurality of teacher images should be classified among a plurality of classification categories with each of the plurality of teacher images; a primary creation step of using at least one of the plurality of classification categories as a target category and making a set of values of a plurality of kinds of feature quantities calculated for the teacher image associated with the target category into teacher data associated with the target category; and a secondary creation step of generating a set of values of new feature quantities corresponding to a point at which distance from a point occupied by the teacher data is within a predetermined distance in a feature quantity space consisting of the plurality of kinds of feature quantities based on one piece of teacher data and making the values of feature quantities as new teacher data associated with the target category.

Description

この発明は、複数の教師画像に基づく画像分類技術に関するものであり、特に教師画像に基づいて教師データを作成する技術に関する。 The present invention relates to an image classification technique based on a plurality of teacher images, and more particularly to a technique for creating teacher data based on teacher images.

例えば半導体やプリント基板等の製造技術分野では、製品に含まれる欠陥を検出しこれを分析・評価するために、評価対象物を顕微鏡等を介して撮像し、得られた画像について複数種の特徴量を算出して自動分類を行うことが研究されている（例えば、特許文献１参照）。この種の技術では、予め収集された欠陥画像（教師画像）のそれぞれに対して、オペレータにより、欠陥の種別に対応して予め設定された複数の分類カテゴリのいずれに分類されるべきかの関連付け（教示）が行われる。そして、各教師画像について求めた複数種の特徴量の値と、当該教師画像に関連付けられた分類カテゴリとに基づいて適宜の学習手法、例えば判別分析やロジスティック回帰分析、ニューラルネットワーク、遺伝的プログラミングまたはサポートベクタマシン等により機械学習が行われて、判別木または分類器が構成される。こうして構成された判別木や分類器を用いて未知の欠陥画像が分類される。 For example, in the field of manufacturing technology such as semiconductors and printed circuit boards, in order to detect defects contained in products and analyze / evaluate them, the object to be evaluated is imaged through a microscope, etc. Research has been conducted to calculate the amount and perform automatic classification (see, for example, Patent Document 1). In this type of technology, each of the defect images (teacher images) collected in advance is associated with one of a plurality of classification categories set in advance corresponding to the type of defect by the operator. (Teaching) is performed. Based on the values of the plurality of types of feature values obtained for each teacher image and the classification category associated with the teacher image, an appropriate learning method such as discriminant analysis or logistic regression analysis, neural network, genetic programming or Machine learning is performed by a support vector machine or the like, and a discrimination tree or a classifier is configured. An unknown defect image is classified using the discriminant tree and classifier constructed in this way.

この種の技術では、学習に用いられる教師画像が多いほど統計的には分類精度が向上する。そのため、欠陥の種別ごとにできるだけ多くの教師画像を収集することが求められる。しかしながら、例えば欠陥の発生頻度の差異に起因して、全ての欠陥種別に対して十分な数の教師画像を用意することができない場合がある。少数の教師画像を用いて機械学習を行った場合、過学習と呼ばれる学習の過剰適合現象が生じ、欠陥画像を適切に分類することができなくなるおそれがある。 In this type of technique, the classification accuracy improves statistically as the number of teacher images used for learning increases. Therefore, it is required to collect as many teacher images as possible for each type of defect. However, there may be a case where a sufficient number of teacher images cannot be prepared for all defect types due to, for example, a difference in the occurrence frequency of defects. When machine learning is performed using a small number of teacher images, an over-matching phenomenon called over-learning may occur, and defect images may not be properly classified.

一方、このような問題に対応するものとして、例えば特許文献２および３に記載された技術がある。特許文献２に記載の技術では、欠陥分類の過程で分類不能な未知の欠陥画像が出現したとき、当該欠陥画像を新たに教師画像に追加したり必要に応じ新たな分類カテゴリを創設して再学習が行われる。また、特許文献３に記載の技術では、欠陥のない良品である検査対象物を撮像した画像に何らかの加工を施すことによって疑似的な欠陥画像を生成する。そして、これを実際に得られた欠陥画像とともに教師画像とすることで教師画像の数を増大させて学習に供することにより、機械学習の効果向上が図られている。 On the other hand, there are technologies described in Patent Documents 2 and 3, for example, to deal with such problems. In the technique described in Patent Document 2, when an unknown defect image that cannot be classified appears in the defect classification process, the defect image is newly added to the teacher image, or a new classification category is created and re-created as necessary. Learning is done. In the technique described in Patent Document 3, a pseudo defect image is generated by performing some processing on an image obtained by imaging a non-defective non-defective inspection object. Then, by using this as a teacher image together with the actually obtained defect image, the number of teacher images is increased and used for learning, thereby improving the effect of machine learning.

特許第４１５５４９７号公報（例えば、図５）Japanese Patent No. 4155497 (for example, FIG. 5) 特開２０００−０５７３４９号公報（例えば、段落００５２）JP 2000-057349 A (for example, paragraph 0052) 特開２０１１−２１４９０３号公報（例えば、段落００３７）JP 2011-214903 A (for example, paragraph 0037)

上記した通り、機械学習に基づく画像分類技術においては種々の欠陥のそれぞれに対して多くの教師画像が準備されることが望ましいが、必ずしもそれが達成されない場合がある。一方、特許文献２に記載の技術では、教師画像として適切な欠陥画像が相当数出現するまで上記問題は解決されない。また特許文献３に記載の技術における疑似欠陥画像は当然に実在する欠陥を表すものではないため、その作成方法によっては、実際には発生し得ないような特徴を有する画像が教師画像として学習に供されてしまうおそれがある。すなわち、このような不適切な画像を教師画像とすることにより、学習結果に基づく分類精度が却って低下してしまうという問題が残されている。 As described above, in the image classification technique based on machine learning, it is desirable to prepare many teacher images for each of various defects, but this may not always be achieved. On the other hand, with the technique described in Patent Document 2, the above problem cannot be solved until a considerable number of defect images suitable as teacher images appear. In addition, since the pseudo defect image in the technique described in Patent Document 3 does not naturally represent an actual defect, an image having characteristics that cannot be actually generated is learned as a teacher image depending on the creation method. There is a risk of being served. That is, there remains a problem that the classification accuracy based on the learning result is lowered by using such an inappropriate image as a teacher image.

この発明は上記課題に鑑みなされたものであり、複数の分類カテゴリの全てに対して十分な数の教師画像を準備することができない場合でも、効果的に学習を行うことのできる技術を提供することを目的とする。 The present invention has been made in view of the above problems, and provides a technique capable of effectively learning even when a sufficient number of teacher images cannot be prepared for all of a plurality of classification categories. For the purpose.

この発明にかかる教師データ作成方法は、上記目的を達成するため、複数の教師画像のそれぞれに対して、複数の分類カテゴリのうち当該教師画像が分類されるべき一の分類カテゴリを関連付ける教示工程と、前記複数の分類カテゴリのうち少なくとも１つを対象カテゴリとして、当該対象カテゴリに関連付けられた前記教師画像について求めた複数種の特徴量の値の組を、当該対象カテゴリに関連付けた教師データとする一次作成工程と、一の前記教師データに基づき、前記複数種の特徴量からなる特徴量空間において当該教師データが占める点からの距離が所定距離以内である点に対応する新たな前記特徴量の値の組を生成し、それらの特徴量の値と前記対象カテゴリとを関連付けて新たな教師データとする二次作成工程とを備えている。 In order to achieve the above object, the teaching data creation method according to the present invention relates to each of a plurality of teacher images, a teaching step for associating one classification category in which the teacher image should be classified among a plurality of classification categories; , Using at least one of the plurality of classification categories as a target category, a set of a plurality of feature value values obtained for the teacher image associated with the target category is used as teacher data associated with the target category. Based on the primary creation step and the one teacher data, a new feature amount corresponding to a point that is within a predetermined distance from a point occupied by the teacher data in the feature amount space composed of the plurality of types of feature amounts. A secondary generation step of generating a set of values and associating the feature value with the target category to form new teacher data

このように構成された発明では、実在する教師画像から教師データが求められるとともに、教師データに対応する特徴量空間内の点からの距離が所定距離以内である点に対応する特徴量の組が、元の教師データと同じ分類カテゴリに関連付けられた新たな教師データとされる。 In the invention configured as described above, teacher data is obtained from an actual teacher image, and a set of feature amounts corresponding to a point within a predetermined distance from a point in the feature amount space corresponding to the teacher data is obtained. The new teacher data is associated with the same classification category as the original teacher data.

上記従来技術のように画像に何らかの加工を施して疑似欠陥画像を生成して教師画像を補充した場合、特徴量空間内では真の欠陥画像と疑似欠陥画像とが大きく離隔することがあり得る。特に特徴量空間の次元が高くなると、見かけ上は僅かな差異でも特徴量空間内では大きな差となることがある。このように特徴量空間上で本来の欠陥画像から遠い位置に生成された疑似欠陥画像を教師画像に含めることは、特徴量空間における分類カテゴリ同士の境界を不明確にし、分類精度の低下をもたらす。 When a pseudo defect image is generated by performing some processing on the image as in the above prior art and the teacher image is supplemented, the true defect image and the pseudo defect image may be largely separated in the feature amount space. In particular, when the dimension of the feature amount space is increased, even a slight difference in appearance may be a large difference in the feature amount space. Including the pseudo defect image generated at a position far from the original defect image in the feature amount space in the teacher image in this way makes the boundary between the classification categories in the feature amount space unclear and causes a reduction in classification accuracy. .

一方、本発明では、教師画像を疑似的に生成するのではなく、特徴量空間上において、既に存在する一の教師データに対応する点の周囲に新たな点を生成することで教師データを補充する。このように、元の教師データからの特徴量空間における距離が規定された教師データを新たに生成することにより、特徴量空間における分類カテゴリ間の境界を不明確にすることのない必要十分な数の教師データを補充することができる。そして、これらの教師データを学習に供することで、各分類カテゴリに対して十分な数の教師画像を準備することができない場合でも効果的な学習が可能となる。 On the other hand, in the present invention, teacher data is supplemented by generating new points around a point corresponding to one existing teacher data in the feature amount space, instead of generating a teacher image in a pseudo manner. To do. In this way, a necessary and sufficient number that does not obscure the boundaries between classification categories in the feature amount space by newly generating teacher data in which the distance in the feature amount space from the original teacher data is defined Can be supplemented with teacher data. By using these teacher data for learning, even when a sufficient number of teacher images cannot be prepared for each classification category, effective learning can be performed.

この発明において、対象カテゴリに関連付けられた教師画像が複数ある場合、例えば、それらに対応する教師データのそれぞれについて、二次作成工程を実行するようにしてもよい。こうすることにより、特定の教師データのみに基づく偏った教師データの補充を未然に回避することができ、補充された教師データを用いて適切に学習を行うことが可能となる。 In this invention, when there are a plurality of teacher images associated with the target category, for example, the secondary creation step may be executed for each of the teacher data corresponding to them. By doing so, it is possible to avoid biased supplementation of teacher data based only on specific teacher data, and it is possible to perform appropriate learning using the supplemented teacher data.

また例えば、二次作成工程により作成した新たな教師データの少なくとも１つに基づく二次作成工程をさらに実行してもよい。実在の教師画像に対応する教師データに基づく教師データの補充のみでは教師データの数が不足する場合がある。このような場合、さらに多くの教師データを補充することが望ましいが、教師画像に対応する教師データの周囲にのみ多くの教師データを作成するよりも、教師画像に対応する教師データから作成された教師データの周囲にも新たな教師データを作成する方が、より効果的な学習が可能となる点で好ましい。 Further, for example, a secondary creation process based on at least one of new teacher data created by the secondary creation process may be further executed. There may be a case where the number of teacher data is insufficient only by supplementing teacher data based on teacher data corresponding to an actual teacher image. In such a case, it is desirable to replenish more teacher data, but it was created from the teacher data corresponding to the teacher image rather than creating a lot of teacher data only around the teacher data corresponding to the teacher image. It is preferable to create new teacher data around the teacher data because more effective learning is possible.

また例えば、二次作成工程において生成する新たな教師データは、基となる教師データに対応する点を体心とする特徴量空間における超立方体の頂点に対応するものであってもよい。基となる教師データから特徴量空間において所定距離内にある点を特定する方法としては種々のものが考えられる。このうち、当該教師データに対応する点を体心とする超立方体の頂点とする方法では、当該教師データを表す各種特徴量の少なくとも１つにおいてその値に所定値を加算することのみによって新たな教師データを生成することができるので、簡単な処理で教師データを補充することができる。 Further, for example, the new teacher data generated in the secondary creation step may correspond to the vertices of the hypercube in the feature amount space whose center is the point corresponding to the base teacher data. Various methods can be considered as a method for identifying a point within a predetermined distance in the feature amount space from the base teacher data. Among these, in the method of using the point corresponding to the teacher data as a vertex of the hypercube, a new value is obtained only by adding a predetermined value to the value in at least one of various feature quantities representing the teacher data. Since the teacher data can be generated, the teacher data can be supplemented by a simple process.

また例えば、少なくとも１つの教師画像が関連付けられた分類カテゴリのうち、関連付けられた教師画像の数が最も少ないものを対象カテゴリの１つとするようにしてもよい。教師画像の絶対数が十分に多くても、一部の分類カテゴリにおいて当該分類カテゴリに関連付けられる教師画像の数が少なければ、教師画像数の不足に起因する過学習等の問題は依然として残る。欠陥分類の分野においては、欠陥の発生頻度の差異に起因してこのような教師画像数の偏りが生じやすい。このように教師画像の数が少ない分類カテゴリを対象カテゴリとして教師データの補充を行うようにすれば、教師データ数の不均衡に起因する過学習等の不都合を効果的に回避することが可能である。 Further, for example, among the category categories associated with at least one teacher image, the category having the smallest number of associated teacher images may be set as one of the target categories. Even if the absolute number of teacher images is sufficiently large, problems such as over-learning due to the shortage of the number of teacher images still remain if the number of teacher images associated with the classification category is small in some classification categories. In the field of defect classification, such a deviation in the number of teacher images tends to occur due to a difference in the frequency of occurrence of defects. By replenishing teacher data with a classification category having a small number of teacher images as a target category, it is possible to effectively avoid inconveniences such as over-learning due to an imbalance in the number of teacher data. is there.

また、この発明にかかる画像分類方法は、上記目的を達成するため、上記したいずれかの教師データ作成方法により作成した教師データと、対象カテゴリ以外の分類カテゴリに関連付けられた教師画像のそれぞれに対応し複数の特徴量の値の組からなる複数の教師データとに基づく学習を行って分類器を構成する工程と、構成された分類器を用いて分類対象画像を分類カテゴリのいずれかに分類する工程とを備えている。 Further, in order to achieve the above object, the image classification method according to the present invention corresponds to each of the teacher data created by any one of the teacher data creation methods described above and the teacher image associated with a classification category other than the target category. A classifier is configured by performing learning based on a plurality of teacher data composed of a plurality of feature value sets, and the classification target image is classified into one of the classification categories using the configured classifier. Process.

このように構成された発明では、上記のように構成された教師データ作成方法により作成された教師データを用いることで効果的に学習を行うことができるので、学習により構成された分類器を用いた分類対象画像の分類を精度よく行うことができる。 In the invention configured as described above, learning can be performed effectively by using the teacher data created by the teacher data creation method configured as described above. Therefore, the classifier configured by learning is used. The classification target images can be classified with high accuracy.

また、この発明にかかる画像分類装置は、上記目的を達成するため、複数の教師画像を取得する画像取得手段と、前記教師画像のそれぞれに対して複数の分類カテゴリのうち当該教師画像が分類されるべき一の分類カテゴリを関連付ける教示情報を受け付ける受付手段と、前記教師画像のそれぞれについて複数種の特徴量を算出する特徴量算出手段と、前記教師画像に基づき教師データを作成する教師データ作成手段と、前記教示情報と前記教師データとに基づく学習を行って分類器を構成し、該分類器を用いて分類対象画像を分類する分類手段とを備え、前記教師データ作成手段は、前記複数の教師画像のそれぞれについて求められた前記複数種の特徴量の値の組を、それぞれ当該教師画像が関連付けられた前記分類カテゴリに関連付けて１つの前記教師データとする一方、少なくとも１つの前記教師データに基づき、当該教師データが前記複数種の特徴量からなる特徴量空間において占める点からの距離が所定距離以内である点に対応する新たな前記特徴量の値の組を生成し、それらの特徴量の値の組を前記対象カテゴリに関連付けて１つの前記教師データとする。 In order to achieve the above object, an image classification apparatus according to the present invention classifies an image acquisition unit that acquires a plurality of teacher images, and the teacher image among a plurality of classification categories for each of the teacher images. Receiving means for receiving teaching information for associating one classification category to be obtained; feature quantity calculating means for calculating a plurality of types of feature quantities for each of the teacher images; and teacher data creating means for creating teacher data based on the teacher images A classifier configured to perform learning based on the teaching information and the teacher data, and classify a classification target image using the classifier, wherein the teacher data creating unit includes the plurality of classifiers. Associating the plurality of types of feature value values obtained for each of the teacher images with the classification category to which the teacher image is associated On the other hand, based on at least one of the teacher data, a new distance corresponding to a point that the teacher data occupies in the feature amount space composed of the plurality of types of feature amounts is within a predetermined distance. A set of feature value values is generated, and the set of feature value values is associated with the target category as one piece of the teacher data.

このように構成された発明では、上記した原理に基づく教師データの補充が可能となっており、実在する教師画像に対応する教師データとこうして補充された教師データとを用いた学習結果に基づく画像分類を行うことで、複数の分類カテゴリに対して十分な数の教師画像を準備することができない場合でも、分類対象画像の分類を精度よく行うことが可能である。この発明においても、教師画像の数が少ない分類カテゴリに対して教師データの補充が行われることが望ましい。 In the invention configured as described above, supplementation of teacher data based on the above-described principle is possible, and an image based on a learning result using teacher data corresponding to an existing teacher image and teacher data supplemented in this way. By performing the classification, it is possible to classify the classification target images with high accuracy even when a sufficient number of teacher images cannot be prepared for a plurality of classification categories. Also in this invention, it is desirable that supplementation of teacher data is performed for classification categories with a small number of teacher images.

この発明によれば、各分類カテゴリに対して十分な数の教師画像が準備できない場合でも、教師データを適切に補充して、画像の分類に供するための学習を効果的に行うことが可能となる。 According to the present invention, even when a sufficient number of teacher images cannot be prepared for each classification category, it is possible to effectively supplement the teacher data and effectively perform learning for image classification. Become.

この発明を好適に適用可能な検査システムの概略構成を示す図である。1 is a diagram showing a schematic configuration of an inspection system to which the present invention can be preferably applied. 欠陥検出装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of a defect detection apparatus. 欠陥分類装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of a defect classification device. 欠陥分類装置により実行される事前学習処理を示すフローチャートである。It is a flowchart which shows the prior learning process performed by the defect classification device. 特徴量空間における欠陥画像の分布の一例を示す図である。It is a figure which shows an example of distribution of the defect image in feature-value space. この実施形態における教師データ補充方法の原理を説明するための図である。It is a figure for demonstrating the principle of the teacher data supplement method in this embodiment. 教師データ補充の結果を例示する図である。It is a figure which illustrates the result of teacher data supplement.

図１はこの発明を好適に適用可能な検査システムの概略構成を示す図である。この検査システム１は、検査対象である半導体基板Ｓの外観に現れたピンホールや異物等の欠陥検査を行い、検出された欠陥の自動分類を行う検査システムである。検査システム１は、基板Ｓ上の検査対象領域から欠陥を検出する欠陥検出装置２と、欠陥検出装置２により検出された欠陥をより詳細に分析して欠陥の種類を特定する欠陥分類装置３とを備えている。欠陥検出装置２は基板Ｓの製造ラインに組み込まれ、検査システム１はいわゆるインライン型のシステムとなっている。 FIG. 1 is a diagram showing a schematic configuration of an inspection system to which the present invention can be preferably applied. This inspection system 1 is an inspection system that performs inspection of defects such as pinholes and foreign matters appearing on the appearance of a semiconductor substrate S to be inspected, and automatically classifies detected defects. The inspection system 1 includes a defect detection device 2 that detects a defect from an inspection target region on the substrate S, a defect classification device 3 that analyzes the defect detected by the defect detection device 2 in more detail and identifies the type of defect, and It has. The defect detection device 2 is incorporated in the production line of the substrate S, and the inspection system 1 is a so-called in-line system.

欠陥検出装置２は、基板Ｓ上の検査対象領域の画像を取得する画像取得ユニット２０と、画像取得ユニット２０からの画像データを解析して欠陥検出を行う検出ユニット２１とを備えている。画像取得ユニット２０は、例えばＣＣＤセンサまたはＣＭＯＳセンサなどの撮像素子を有し基板Ｓ上の検査対象領域を撮像することにより画像データを取得する撮像部２０１、基板Ｓを保持するステージ２０２、および、撮像部２０１に対してステージ２０２を相対的に移動させるステージ駆動部２０３を備えている。ステージ駆動部２０３はボールねじ、ガイドレールおよびモータにより構成される。撮像部２０１およびステージ駆動部２０３は検出ユニット２１に設けられた機構制御部２１１により制御されており、機構制御部２１１からの制御指令に応じて動作することで、基板Ｓ上の検査対象領域を撮像する。 The defect detection apparatus 2 includes an image acquisition unit 20 that acquires an image of an inspection target area on the substrate S, and a detection unit 21 that analyzes the image data from the image acquisition unit 20 and detects a defect. The image acquisition unit 20 has an imaging element such as a CCD sensor or a CMOS sensor, for example, and acquires an image data by acquiring an image of an inspection target area on the substrate S, a stage 202 that holds the substrate S, and A stage driving unit 203 that moves the stage 202 relative to the imaging unit 201 is provided. The stage driving unit 203 includes a ball screw, a guide rail, and a motor. The imaging unit 201 and the stage driving unit 203 are controlled by a mechanism control unit 211 provided in the detection unit 21, and operate according to a control command from the mechanism control unit 211, so that an inspection target region on the substrate S is changed. Take an image.

撮像部２０１による撮像で得られた画像データに対して欠陥検出部２１２が適宜のデータ処理を施して欠陥検出を行う。欠陥検出部２１２は、撮像部２０１から与えられた画像データに対して適宜のデータ処理、例えばノイズ除去処理や平滑化処理などを施し、こうして処理された画像データに基づき欠陥検出を行う。より具体的には、実際の基板Ｓにおいて撮像された画像と、記憶部２１３に予め記憶された無欠陥の基板の画像とを比較して、その差異が予め定められた基準を超える場合には当該画像に欠陥が含まれると判定する。欠陥検出部２１２は、検出した欠陥の画像データをインターフェース（Ｉ／Ｆ）２１４を介して欠陥分類装置３に送信する。 The defect detection unit 212 performs appropriate data processing on the image data obtained by imaging by the imaging unit 201 to detect defects. The defect detection unit 212 performs appropriate data processing such as noise removal processing or smoothing processing on the image data given from the imaging unit 201, and performs defect detection based on the image data thus processed. More specifically, when an image captured on the actual substrate S is compared with an image of a defect-free substrate stored in advance in the storage unit 213, and the difference exceeds a predetermined reference It is determined that the image includes a defect. The defect detection unit 212 transmits the detected defect image data to the defect classification device 3 via the interface (I / F) 214.

欠陥分類装置３は、欠陥検出装置２から与えられる欠陥データに基づき、検出された欠陥の種別を判定する。すなわち、欠陥分類装置３は欠陥検出装置２から与えられる欠陥画像データに基づいて当該欠陥が属すべき分類カテゴリへと欠陥を自動分類（ＡＤＣ；Automatic Defect Classification）する機能を有している。この欠陥分類装置３は、この発明にかかる画像分類装置の一実施形態であり、本発明にかかる教師データ作成方法および画像分類方法を実行可能な装置である。 The defect classification device 3 determines the type of the detected defect based on the defect data given from the defect detection device 2. That is, the defect classification device 3 has a function of automatically classifying defects (ADC: Automatic Defect Classification) into classification categories to which the defect belongs based on the defect image data given from the defect detection device 2. The defect classification device 3 is an embodiment of the image classification device according to the present invention, and is a device capable of executing the teacher data creation method and the image classification method according to the present invention.

欠陥分類装置３は、予め読み込まれた制御プログラムを実行することにより、図１に示す各機能ブロックを専用ハードウェアおよびソフトウェアにより実現する。欠陥分類装置３は、欠陥検出装置２から欠陥画像データを受信するためのインターフェース（Ｉ／Ｆ）３１、欠陥画像データに基づく自動分類により、当該欠陥の種別を判定する欠陥分類部（ＡＤＣ）３２を備えている。インターフェース３１は例えばＬＡＮ（Local Area Network）を介して他の外部機器と通信が可能となっている。 The defect classification device 3 implements each functional block shown in FIG. 1 by dedicated hardware and software by executing a control program read in advance. The defect classification device 3 includes an interface (I / F) 31 for receiving defect image data from the defect detection device 2, and a defect classification unit (ADC) 32 that determines the type of the defect by automatic classification based on the defect image data. It has. The interface 31 can communicate with other external devices via, for example, a LAN (Local Area Network).

欠陥検出装置２から与えられる画像データおよび欠陥分類部３２における処理過程で生じる種々の処理データは必要に応じて記憶部３３に随時保存される。欠陥分類部３２は、与えられた欠陥画像データに基づき特徴量算出部３４が算出した複数種の特徴量の値に基づき、当該欠陥画像を、欠陥種別に対応して予め設定された複数の分類カテゴリのいずれかに分類する。 The image data given from the defect detection device 2 and various processing data generated in the process in the defect classification unit 32 are stored in the storage unit 33 as needed. The defect classifying unit 32 classifies the defect image into a plurality of classifications set in advance corresponding to the defect type based on the values of the plurality of types of feature amounts calculated by the feature amount calculating unit 34 based on the given defect image data. Sort into one of the categories.

さらに、欠陥分類装置３は、ユーザからの操作入力を受け付けるキーボードおよびマウスなどの入力受付部３５および操作手順や処理結果等のユーザ向け視覚情報を表示する表示部３６を備えている。 Furthermore, the defect classification device 3 includes an input receiving unit 35 such as a keyboard and a mouse that receives an operation input from a user, and a display unit 36 that displays visual information for the user such as an operation procedure and a processing result.

欠陥分類部３２は、検出された欠陥をＳＶＭ（サポートベクタマシン；Support Vector Machine）、ニューラルネットワーク、決定木、判別分析等の学習アルゴリズムを利用して分類する処理をソフトウェア的に実行する。具体的には、欠陥検査装置２から与えられて記憶部３３に記憶保存された欠陥画像データと、当該欠陥画像が分類されるべき分類カテゴリに関するユーザからの教示情報とに基づいて予め適宜の学習アルゴリズムによる機械学習を行っておく。そして、未分類の欠陥画像に対応する画像データが新たに与えられると、学習結果に基づく自動分類を行って新たな欠陥画像をいずれかの分類カテゴリに分類する。 The defect classification unit 32 executes a process of classifying the detected defect using a learning algorithm such as SVM (Support Vector Machine), neural network, decision tree, discriminant analysis, and the like in software. Specifically, appropriate learning is performed in advance based on defect image data given from the defect inspection apparatus 2 and stored and stored in the storage unit 33, and teaching information from a user regarding a classification category in which the defect image should be classified. Perform machine learning using an algorithm. When new image data corresponding to an unclassified defect image is given, automatic classification based on the learning result is performed to classify the new defect image into any one of the classification categories.

図２は欠陥検出装置の動作を示すフローチャートである。欠陥検出装置２は、検査対象基板Ｓを撮像し、その撮像結果から欠陥を検出するための動作として以下の欠陥検出処理を実行する。最初に検査対象基板Ｓが画像取得ユニット２０のステージ２０２にロードされると（ステップＳ１０１）、機構制御部２１１が撮像部２０１およびステージ駆動部２０３を制御して、基板Ｓ上の所定の検査対象位置を撮像し画像を取得する（ステップＳ２０２）。こうして得られた画像データに対して、欠陥検出部２１２は、必要に応じて例えば輝度調整やノイズ除去等の処理を行う。さらに、予め記憶部２１３に記憶されている無欠陥の基板の画像データのうち、当該検査対象位置に対応する画像データを読み出して、撮像により得られた画像データとの画素単位の差分を算出する（ステップＳ１０３）。 FIG. 2 is a flowchart showing the operation of the defect detection apparatus. The defect detection apparatus 2 images the inspection target substrate S, and executes the following defect detection processing as an operation for detecting defects from the imaging result. When the inspection target substrate S is first loaded on the stage 202 of the image acquisition unit 20 (step S101), the mechanism control unit 211 controls the imaging unit 201 and the stage driving unit 203 to perform a predetermined inspection target on the substrate S. The position is imaged and an image is acquired (step S202). For the image data obtained in this way, the defect detection unit 212 performs processes such as brightness adjustment and noise removal as necessary. Further, the image data corresponding to the inspection target position is read out from the image data of the defect-free substrate stored in the storage unit 213 in advance, and the difference in pixel units from the image data obtained by imaging is calculated. (Step S103).

撮像された画像に欠陥が含まれなければ予め記憶されていた無欠陥の画像との差は小さい一方、欠陥が含まれていれば大きな差異が現れる。このことを利用して、両者の差分を求めることで欠陥の有無を判定することが可能である。具体的には、得られた差分画像に対して適宜の平滑化処理および二値化処理を行って（ステップＳ１０４）、有意な差が認められる領域があればその部分を欠陥部分として抽出する（ステップＳ１０５）。欠陥が検出された画像については（ステップＳ１０６）、当該画像を欠陥画像として、インターフェース２１４を介して欠陥分類装置３に送信する（ステップＳ１０７）。上記処理を、検査対象位置を順次変えながら基板Ｓの全体について行うことで（ステップＳ１０８）、基板Ｓの検査が完了する。 If the captured image does not include a defect, the difference from the previously stored defect-free image is small, but if a defect is included, a large difference appears. By utilizing this fact, it is possible to determine the presence or absence of a defect by obtaining the difference between the two. Specifically, appropriate smoothing processing and binarization processing are performed on the obtained difference image (step S104), and if there is a region where a significant difference is recognized, that portion is extracted as a defective portion ( Step S105). For an image in which a defect is detected (step S106), the image is transmitted as a defect image to the defect classification device 3 via the interface 214 (step S107). By performing the above processing on the entire substrate S while sequentially changing the inspection target position (step S108), the inspection of the substrate S is completed.

欠陥検出装置２は、検査対象基板Ｓの各位置を検査して欠陥の有無を判定する。一方、欠陥分類装置３は、検出された欠陥の種別を判定して、欠陥の発生傾向を分析してその結果を製造ラインにフィードバックしたり、欠陥検出装置２による欠陥検出処理の処理内容の調整を行うのに用いられる。検出された欠陥の分析を欠陥分類装置３により行うことで、欠陥検出装置２は欠陥検出に特化した処理を実行することができるので、高速での欠陥検出が可能となる。 The defect detection device 2 inspects each position of the inspection target substrate S to determine whether there is a defect. On the other hand, the defect classification device 3 determines the type of detected defect, analyzes the tendency of occurrence of the defect, and feeds back the result to the production line, or adjusts the processing content of the defect detection processing by the defect detection device 2 Used to do By analyzing the detected defect by the defect classification device 3, the defect detection device 2 can execute a process specialized for defect detection, so that a defect can be detected at high speed.

図３は欠陥分類装置の動作を示すフローチャートである。上記した通り、欠陥分類装置３は欠陥検出装置２から出力される欠陥画像を、その欠陥種別に応じた分類カテゴリに分類する。より具体的には、欠陥分類部３２が予め収集された欠陥画像データに基づき事前学習（後述）を行っておき（ステップＳ２０１）、未分類の欠陥画像が欠陥検出装置２から新たに与えられると（ステップＳ２０２）、特徴量算出部３４が当該欠陥画像を特徴付ける複数の特徴量の値を算出する（ステップＳ２０３）。そして、欠陥分類部３２が、算出された特徴量の値に基づき、事前学習により構成された分類器を用いて当該欠陥画像を複数の分類カテゴリのいずれかに分類する（ステップＳ２０４）。 FIG. 3 is a flowchart showing the operation of the defect classification apparatus. As described above, the defect classification device 3 classifies the defect image output from the defect detection device 2 into a classification category corresponding to the defect type. More specifically, when the defect classification unit 32 performs pre-learning (described later) based on the defect image data collected in advance (step S201), an unclassified defect image is newly given from the defect detection device 2. (Step S202), the feature quantity calculation unit 34 calculates a plurality of feature quantity values that characterize the defect image (Step S203). Then, based on the calculated feature value, the defect classification unit 32 classifies the defect image into one of a plurality of classification categories using a classifier configured by prior learning (step S204).

分類カテゴリは当該基板Ｓにおいて発生し得る複数の欠陥種別に対応して複数種が予め設定されており、例えば異物、傷、パターン断線、短絡などが代表的なものであるが、これらに限定されるものではない。 A plurality of types of classification categories are set in advance corresponding to a plurality of types of defects that may occur in the substrate S, and for example, foreign materials, scratches, pattern disconnections, short circuits, and the like are representative, but are not limited thereto. It is not something.

図４は欠陥分類装置により実行される事前学習処理を示すフローチャートである。事前学習処理は、種々の欠陥を撮像した教師画像と、ユーザから教示される、当該教師画像がどの分類カテゴリに分類すべきものであるかを示す教示情報とに基づいて行われる。具体的には、まず最初に教師画像の収集を行う。ここでは欠陥検出装置２から出力される欠陥画像を教師画像としてこれを順次記憶部３３に保存することで収集を行うものとするが（ステップＳ３０１）、例えば予め用意された画像ライブラリや外部のストレージに収集された教師画像データを取得する態様であってもよい。 FIG. 4 is a flowchart showing a pre-learning process executed by the defect classification apparatus. The pre-learning process is performed based on a teacher image obtained by imaging various defects and teaching information indicating which classification category the teacher image should be classified, which is taught by the user. Specifically, teacher images are collected first. Here, the defect images output from the defect detection apparatus 2 are collected as teacher images, which are sequentially stored in the storage unit 33 (step S301). For example, an image library prepared in advance or an external storage is used. It is also possible to obtain the teacher image data collected in the above.

こうして収集された多数の教師画像のそれぞれについて、特徴量算出部３４が特徴量の算出を行う（ステップＳ３０２）。ここで算出される特徴量の種類は、未知の欠陥画像の分類を行うときに用いられるものと同じである。算出された特徴量の値は、各教師画像に対応する教師データの一部として記憶部３３に記憶保存される。 For each of the many teacher images collected in this way, the feature amount calculation unit 34 calculates a feature amount (step S302). The types of feature quantities calculated here are the same as those used when classifying unknown defect images. The calculated feature value is stored and saved in the storage unit 33 as part of the teacher data corresponding to each teacher image.

次に、各教師画像に対して、当該教師画像が分類されるべき分類カテゴリを関連付ける。より具体的には、収集された教師画像を順に表示部３６に表示させ、当該教師画像がどの欠陥種別に属するものであるかについて、入力受付部３５を介してユーザから教示入力を受け付ける。そして、教師画像と、教示された欠陥種別に対応する分類カテゴリとを関連付けて（ステップＳ３０３）、当該教師画像の特徴量の値と、当該教師画像に関連付けられた分類カテゴリを指定するユーザからの教示情報とをセットにして、これを当該教師画像に対応する教師データとして記憶部３３に記憶保存しておく。 Next, a classification category in which the teacher image is to be classified is associated with each teacher image. More specifically, the collected teacher images are sequentially displayed on the display unit 36, and a teaching input is received from the user via the input receiving unit 35 regarding which defect type the teacher image belongs to. Then, the teacher image is associated with the classification category corresponding to the taught defect type (step S303), and the feature value of the teacher image and the user who designates the classification category associated with the teacher image are specified. The teaching information is set as a set and stored in the storage unit 33 as teacher data corresponding to the teacher image.

続いて、こうして準備された教師データ、すなわち教師画像を表す特徴量の値の組と教示情報とに基づき、欠陥分類部３２が適宜の学習アルゴリズムによる学習を行い、分類器を構成する（ステップＳ３０４）。そして、学習結果を検証するため、構成された分類器を用いて教師データ自身の分類を行い（ステップＳ３０５）、その分類成績を算出する（ステップＳ３０６）。分類成績は適宜の性能評価尺度を用いて表すことができ、例えば分類カテゴリ毎のpurityおよびaccuracyにより表すことができる。こうして算出された分類成績が表示部３６に表示されることで、ユーザは学習の適切さを評価することができる。 Subsequently, based on the teacher data prepared in this way, that is, based on the set of feature value values representing the teacher image and the teaching information, the defect classification unit 32 performs learning using an appropriate learning algorithm to configure a classifier (step S304). ). Then, in order to verify the learning result, the teacher data itself is classified using the configured classifier (step S305), and the classification result is calculated (step S306). The classification results can be expressed using an appropriate performance evaluation scale, and can be expressed by, for example, purity and accuracy for each classification category. The classification result calculated in this way is displayed on the display unit 36, so that the user can evaluate the appropriateness of learning.

統計的には用意された教師画像の数が多いほど有効な学習を行うことができるが、欠陥の発生頻度はその種別によってまちまちであり、欠陥種別ごとに必ずしも十分な数の欠陥画像を収集することができるとは限らない。特に、製造プロセスを新たに立ち上げた直後では、ある欠陥種別については多くの欠陥画像が収集されても他の欠陥種別については僅かしか欠陥画像が得られないことがある。 Statistically, the greater the number of prepared teacher images, the more effective learning can be performed, but the frequency of defects varies depending on the type of defect, and a sufficient number of defect images are collected for each defect type. It is not always possible. In particular, immediately after a new manufacturing process is started, even if many defect images are collected for a certain defect type, only a few defect images may be obtained for other defect types.

図５は特徴量空間における欠陥画像の分布の一例を示す図である。欠陥画像の分類に用いられる特徴量としては種々のものが知られており、実際の欠陥分類においても多種類の特徴量が用いられる。このため、使用される複数種の特徴量のそれぞれを一の座標軸とする特徴量空間も多次元空間となるが、ここでは原理を理解しやすくするために、２種類の特徴量Ｘ1，Ｘ2からなる２次元の特徴量空間を考える。収集された欠陥画像（教師画像）をその特徴量の値に応じて特徴量空間にプロットすると、例えば図５に示すように、類似した特徴を有する欠陥画像がある程度まとまっていくつかのクラスタを形成する。図５における各点は、欠陥画像を特徴量で表したときそれらの値を特徴量空間における座標値として持つ点を表しており、それぞれの点が１つの欠陥画像に対応する。同じ欠陥種別に分類される欠陥画像は当然に互いに類似した画像上の特徴を持つため、特徴量空間内でも互いに近接すると考えられる。すなわち、同じ欠陥種別に属する欠陥画像が特徴量空間内で１つのクラスタを形成する。 FIG. 5 is a diagram illustrating an example of a defect image distribution in the feature amount space. Various types of feature values are known for use in defect image classification, and many types of feature values are used in actual defect classification. For this reason, the feature amount space in which each of a plurality of types of feature amounts is used as one coordinate axis is also a multidimensional space. Here, in order to facilitate understanding of the principle, two types of feature amounts X1 and X2 are used. Consider the following two-dimensional feature space. When the collected defect image (teacher image) is plotted in the feature amount space according to the feature value, for example, as shown in FIG. 5, defect images having similar features gather to some extent to form several clusters. To do. Each point in FIG. 5 represents a point having these values as coordinate values in the feature amount space when the defect image is represented by a feature amount, and each point corresponds to one defect image. Since defect images classified into the same defect type naturally have similar features on the images, it is considered that they are close to each other in the feature amount space. That is, defect images belonging to the same defect type form one cluster in the feature amount space.

欠陥種別ごとの特徴量空間におけるクラスタの広がりが既知であれば、未知の欠陥画像が与えられたときに、当該欠陥画像に対応する点が占める特徴量空間の位置から当該欠陥画像の欠陥種別を判定することができる。これを可能とするために、教師画像に基づく学習を行う。 If the spread of clusters in the feature amount space for each defect type is known, when an unknown defect image is given, the defect type of the defect image is determined from the position of the feature amount space occupied by the point corresponding to the defect image. Can be determined. In order to make this possible, learning based on a teacher image is performed.

ここで、例えば図５に例示する欠陥種別ＡおよびＢにはそれぞれ比較的多くの教師画像が関連付けられており、これらに対応する多くの点によりクラスタが形成されている。したがって、当該欠陥種別の類型の多くが出揃っていると考えられ、クラスタの広がり範囲を比較的高い精度で推定することができる。したがって、特徴量空間内で当該広がり範囲内の位置を占める未知の欠陥画像が、高い確度で当該欠陥種別に属するものと判断することができる。また欠陥種別ＡとＢとの境界が特定しやすく、これらの間での誤分類の確率を低くすることができる。 Here, for example, relatively many teacher images are associated with the defect types A and B illustrated in FIG. 5, and a cluster is formed by many points corresponding to these. Therefore, it is considered that many types of the defect types are available, and the spread range of the cluster can be estimated with relatively high accuracy. Therefore, it is possible to determine that an unknown defect image that occupies a position within the spread range in the feature amount space belongs to the defect type with high accuracy. Further, it is easy to specify the boundary between the defect types A and B, and the probability of misclassification between them can be reduced.

一方、例えば図５に示す欠陥種別Ｃのように、関連付けられる教師画像の数が少ない場合、特徴量空間において当該欠陥種別が分布する範囲を特定することが困難である。すなわち、例えば一点鎖線で示すように比較的小さな広がりであるのか、あるいは二点鎖線で示すようにより大きな広がりであるのかについて、現状の学習アルゴリズムでは既存の教師画像のみで的確に判断することができない。そのため、特徴量空間における欠陥種別ＡとＣとの境界、および欠陥種別ＢとＣとの境界の特定が困難となり、結果として誤分類が増えてしまうことになる。このような現象は過学習と呼ばれる学習の過剰適合現象によるものであり、分類カテゴリ間で教師画像の数の不均衡が大きいときに顕著となる。このような場合に対応するために、この実施形態の事前学習処理は、必要に応じて教師データを補充して再学習を行うことができるように構成されている。 On the other hand, when the number of associated teacher images is small, such as the defect type C shown in FIG. 5, it is difficult to specify a range in which the defect type is distributed in the feature amount space. That is, for example, whether it is a relatively small spread as indicated by a one-dot chain line or a larger spread as indicated by a two-dot chain line cannot be accurately determined only by an existing teacher image. . Therefore, it becomes difficult to specify the boundary between the defect types A and C and the boundary between the defect types B and C in the feature amount space, and as a result, misclassification increases. Such a phenomenon is due to an overfitting phenomenon of learning called overlearning, and becomes prominent when there is a large imbalance in the number of teacher images between classification categories. In order to cope with such a case, the pre-learning process of this embodiment is configured such that teacher data can be supplemented and re-learning can be performed as necessary.

この問題を解決するために前述した従来技術の技術思想を適用すると、教師画像数の少ない欠陥種別の欠陥を模した画像を疑似的に作成し、該画像を教師画像として補充し学習を行うこととなる。しかしながら、たとえ欠陥の発生メカニズムを模したとしても実際に発生した欠陥ではないため、見かけ上は僅かな画像の差異であっても多次元特徴量空間上では大きな差異となり、各欠陥種別に属する欠陥画像が作るクラスタの広がりが却って不明瞭となってしまうことがあり得る。 Applying the above-described prior art technical idea to solve this problem, it is possible to artificially create an image simulating a defect of a defect type with a small number of teacher images, and to supplement the image as a teacher image and perform learning It becomes. However, even if the defect generation mechanism is imitated, it is not an actual defect, so even a slight difference in the image is a large difference in the multidimensional feature space, and the defect belongs to each defect type. The spread of the clusters created by the image may be obscured.

そこで、この実施形態では、既に存在する教師データが特徴量空間において占める点の近傍に、補充により作成される新たな教師データに対応する点を派生させることにより、教師データの補充を行う。すなわち、実画像として欠陥画像を模した画像を作成することはしない。このため、特徴量空間における既存の教師データからの新たな教師データまでの距離を限定されたものとすることができる。 Therefore, in this embodiment, teacher data is replenished by deriving a point corresponding to new teacher data created by replenishment in the vicinity of a point occupied by already existing teacher data in the feature amount space. That is, an image imitating a defect image is not created as a real image. For this reason, the distance from the existing teacher data to the new teacher data in the feature amount space can be limited.

図６はこの実施形態における教師データ補充方法の原理を説明するための図である。図６（ａ）に示すように、既存の教師データに対応する特徴量空間内の点Ｐ（ｖ1，ｖ2）を考える。この点Ｐの座標値ｖ1およびｖ2は、それぞれこの点に対応する教師画像について求めた特徴量Ｘ1およびＸ2の値に相当する。そして、当該点Ｐから特徴量空間内で所定の距離内に新たな点Ｑを設定し、当該点Ｑの座標値に対応する特徴量の値と、作成の基になった点Ｐの分類カテゴリとを関連付けて、これを新たな教師データとする。 FIG. 6 is a view for explaining the principle of the teacher data supplementing method in this embodiment. As shown in FIG. 6A, a point P (v1, v2) in the feature amount space corresponding to existing teacher data is considered. The coordinate values v1 and v2 of the point P correspond to the values of the feature amounts X1 and X2 obtained for the teacher image corresponding to this point, respectively. Then, a new point Q is set within a predetermined distance in the feature amount space from the point P, and the feature value corresponding to the coordinate value of the point Q and the classification category of the point P that is the basis of the creation Are used as new teacher data.

原理的には、例えば点Ｐを中心とし所定の半径Ｒを有する特徴量空間内の超球面（二次元特徴量空間では破線で示される円）上の１点を点Ｑとすることができるが、より簡易的には、点Ｐを体心とする特徴量空間内の超立方体（二次元特徴量空間では点線で示される正方形）の１つの頂点として点Ｑを設定することができる。多次元特徴量空間における超球面は多数の特徴量の関数により表されるので、超球面上の点の座標を求めるための計算が煩雑である。これに対して、特徴量空間内の超立方体の頂点の座標は、超立方体の辺を各座標軸の方向と平行としておけば、各座標軸の座標値に一定値を加算するのみで求めることができる。 In principle, for example, one point on a hypersphere (a circle indicated by a broken line in the two-dimensional feature space) having a predetermined radius R with the point P as the center can be set as the point Q. More simply, the point Q can be set as one vertex of a hypercube (a square indicated by a dotted line in the two-dimensional feature amount space) in the feature amount space having the point P as a body center. Since the hypersphere in the multi-dimensional feature quantity space is represented by a large number of feature quantity functions, the calculation for obtaining the coordinates of the points on the hypersphere is complicated. On the other hand, the coordinates of the vertices of the hypercube in the feature amount space can be obtained only by adding a constant value to the coordinate values of each coordinate axis if the sides of the hypercube are parallel to the direction of each coordinate axis. .

二次元特徴量空間を表した図６（ａ）の例では、基となる点ＰのＸ1座標値ｖ1に所定値Δｖ1を加算または減算した値を、点ＱのＸ1座標値とすることができる。また、点ＰのＸ2座標値ｖ2に所定値Δｖ2を加算または減算した値を、点ＱのＸ2座標値とすることができる。図６（ａ）では点Ｐに対して右上の頂点に点Ｑ（ｖ1＋Δｖ1，ｖ2＋Δｖ2）を作成しているが、他の頂点に新たな点を作成してもよい。また２以上の頂点をそれぞれ新たな点としてもよい。 In the example of FIG. 6A showing the two-dimensional feature amount space, a value obtained by adding or subtracting a predetermined value Δv1 to the X1 coordinate value v1 of the base point P can be used as the X1 coordinate value of the point Q. . A value obtained by adding or subtracting the predetermined value Δv2 to the X2 coordinate value v2 of the point P can be used as the X2 coordinate value of the point Q. In FIG. 6A, the point Q (v1 + Δv1, v2 + Δv2) is created at the upper right vertex with respect to the point P, but a new point may be created at another vertex. Two or more vertices may be used as new points.

なお、各特徴量はそれぞれ個別の算出基準に基づき数値化されるので、特徴量ごとに数値のスケールが異なることが当然にあり得る。この点に鑑み、特徴量空間内の超球または超立方体を有意なものとするために、この場合の各特徴量の値は正規化されたものであることが好ましい。特徴量の正規化演算は、例えば、点Ｐに対応する教師データにおける特徴量の値を、全教師データ間での当該特徴量の最大値で除することにより行うことができる。例えば点ＰのＸ1座標値ｖ1については、点Ｐに対応する教師データにおける特徴量Ｘ1の値をｖ0、同特徴量Ｘ1の全ての教師データ間での最大値をｖmaxとしたとき、
ｖ1 ＝ｖ0／ｖmax … （式１）
により求めることができる。 In addition, since each feature-value is digitized based on each separate calculation standard, naturally the scale of a numerical value may differ for every feature-value. In view of this point, in order to make the hypersphere or hypercube in the feature quantity space significant, it is preferable that the value of each feature quantity in this case is normalized. The feature amount normalization calculation can be performed, for example, by dividing the feature amount value in the teacher data corresponding to the point P by the maximum value of the feature amount among all the teacher data. For example, for the X1 coordinate value v1 of the point P, when the value of the feature quantity X1 in the teacher data corresponding to the point P is v0 and the maximum value among all the teacher data of the feature quantity X1 is vmax,
v1 = v0 / vmax (Formula 1)
It can ask for.

また例えば、点Ｐに対応する教師データにおける特徴量の値を、全教師データ間での当該特徴量の最大値と最小値との差で除することにより行うことができる。例えば点ＰのＸ1座標値ｖ1については、点Ｐに対応する教師データにおける特徴量Ｘ1の値をｖ0、同特徴量Ｘ1の全ての教師データ間での最大値をｖmax、最小値をｖminとしたとき、
ｖ1＝ｖ0／（ｖmax−ｖmin） … （式２）
により求めることができる。特徴量の数値を結果として求めることを目的としないため、算出ルールが各特徴量間で統一されていれば、（式１）、（式２）のいずれの算出方法によってもよい。 Further, for example, it can be performed by dividing the feature value in the teacher data corresponding to the point P by the difference between the maximum value and the minimum value of the feature value among all the teacher data. For example, for the X1 coordinate value v1 of the point P, the value of the feature quantity X1 in the teacher data corresponding to the point P is v0, the maximum value among all the teacher data of the feature quantity X1 is vmax, and the minimum value is vmin. When
v1 = v0 / (vmax−vmin) (Expression 2)
It can ask for. Since the purpose is not to obtain the numerical value of the feature value as a result, any calculation method of (Expression 1) or (Expression 2) may be used as long as the calculation rule is unified among the feature values.

上記のようにして点Ｐから派生的に作成される点Ｑを、以下の説明では点Ｐの「子」と表現することがある。また、基となった点Ｐを点Ｑの「親」と表現することがある。また点Ｐに対応する教師データを「親データ」、点Ｑに対応する教師データを「子データ」と表現することがある。 In the following description, the point Q that is derived from the point P as described above may be expressed as a “child” of the point P. In addition, the base point P may be expressed as the “parent” of the point Q. The teacher data corresponding to the point P may be expressed as “parent data”, and the teacher data corresponding to the point Q may be expressed as “child data”.

図４に戻って、事前学習処理の説明を続ける。ステップＳ３０６で求められ表示部３６に表示された分類成績が不十分であるとユーザが判断したときには、入力受付部３５を介して教師データの補充を指示することができる。教師データの補充が必要とされたとき（ステップＳ３０７においてＹＥＳ）、続いて教師データ数の不足している欠陥種別がユーザ入力により指定される（ステップＳ３０８）。また、欠陥種別ごとの教師データ数に応じて、データを補充すべき欠陥種別が自動的に選択されるようにしてもよい。例えば教師データ補充の効果は元の教師データ数の少ない欠陥種別において特に顕著であることから、教師データ数の少ない欠陥種別から順に補充を行うようにすることができる。この場合、例えば何番目の欠陥種別まで補充を行うかをユーザに指定させてもよい。 Returning to FIG. 4, the description of the pre-learning process will be continued. When the user determines that the classification result obtained in step S306 and displayed on the display unit 36 is insufficient, it is possible to instruct supplementation of teacher data via the input receiving unit 35. When it is necessary to supplement teacher data (YES in step S307), a defect type for which the number of teacher data is insufficient is designated by user input (step S308). Further, the defect type to be supplemented with data may be automatically selected according to the number of teacher data for each defect type. For example, since the effect of supplementing teacher data is particularly noticeable for defect types with a small number of original teacher data, supplementation can be performed in order from a defect type with a small number of teacher data. In this case, for example, the user may be allowed to specify how many defect types are replenished.

そして、指定された欠陥種別に対して教師データの補充を行う。データの補充は、上記した通り、特徴量空間において既存の教師データの近傍に新たな教師データを派生させることにより行う。このとき、特定の教師データの周囲にのみ新しい教師データが生成されるのを防止するために、「子」を持たない教師データのみを「親」として、新たな子データを生成する（ステップＳ３０９）。子を持たない全ての教師データについて子データを派生させることで、教師データの数が増大する。１つの親データからＮ個の子データを生成した場合、教師データの数は元のＮ倍となる。 Then, teacher data is supplemented for the designated defect type. As described above, the supplementation of data is performed by deriving new teacher data near the existing teacher data in the feature amount space. At this time, in order to prevent new teacher data from being generated only around specific teacher data, new child data is generated by setting only the teacher data having no “child” as “parent” (step S309). ). By deriving child data for all teacher data having no children, the number of teacher data increases. When N child data are generated from one parent data, the number of teacher data is N times the original data.

多次元特徴量空間における超立方体の頂点は多数あり、その全てに子を生成すると膨大な数の教師データが生成されてしまい、欠陥種別ごとの教師データ数が却って不均衡となる。１つの親から生成する子の生成数を予め定めておき、多数の頂点からその生成数に対応する数の頂点を選択してそれらの頂点に子を生成するのが好ましい。選択のルールは任意であり、例えば乱数を用いて無作為に決定することができる。ただし、各頂点のうちその位置またはその近傍に既に教師データに対応する点があるものについては、選択から除外されることが好ましい。こうすることで、教師データの偏在を防止することができる。 There are many vertices of the hypercube in the multi-dimensional feature amount space, and if a child is generated for all of them, a huge number of teacher data is generated, and the number of teacher data for each defect type is unbalanced. Preferably, the number of children to be generated from one parent is determined in advance, and a number of vertices corresponding to the number of generations are selected from a large number of vertices, and children are generated at those vertices. The selection rule is arbitrary, and can be determined at random using, for example, a random number. However, among the vertices, those that already have a point corresponding to the teacher data at or near that position are preferably excluded from selection. In this way, uneven distribution of teacher data can be prevented.

教師データを作成しようとする点の近傍に他の教師データが存在するか否かについては、例えば特徴量空間におけるユークリッド距離により判断することができる。ただし、既存の全ての教師データについて演算を行うことは計算量が膨大となるため現実的でない。これに代えて、例えば当該点を内部に含むまたは当該点を体心とする適宜のサイズの超立方体を仮定し、その中に含まれる他の点を探索する方法を用いることができる。このようにした場合、座標軸ごとの、すなわち特徴量ごとの数値の相互比較により近傍点を探索することができるので、必要な演算量を大幅に軽減することが可能である。 Whether or not there is other teacher data in the vicinity of the point where the teacher data is to be created can be determined, for example, from the Euclidean distance in the feature amount space. However, it is not practical to perform calculations on all existing teacher data because the calculation amount is enormous. Instead of this, for example, it is possible to use a method of searching for other points included in a hypercube having an appropriate size including the point inside or having the point as a body center. In this case, it is possible to search for neighboring points by mutual comparison of numerical values for each coordinate axis, that is, for each feature amount, so that it is possible to greatly reduce the necessary calculation amount.

上記のようにして新たに生成された教師データについては、元の教師データと同じ分類カテゴリ（欠陥種別）に属するものとして、以後は元の教師データと区別することなく扱う。続いて、新たに生成された教師データと元の教師データとを用い、性能評価を行う（ステップＳ３１０）。この性能評価は、前述したステップＳ３０４ないしＳ３０６の処理と同じものである。このとき分類成績に変化がなければ（ステップＳ３１１においてＮＯ）、教師データの補充が十分でないとみなし、ステップＳ３０９に戻って再びデータ補充を行う。 The teacher data newly generated as described above is treated as being belonging to the same classification category (defect type) as the original teacher data without being distinguished from the original teacher data. Subsequently, performance evaluation is performed using the newly generated teacher data and the original teacher data (step S310). This performance evaluation is the same as the processing in steps S304 to S306 described above. At this time, if there is no change in the classification result (NO in step S311), it is considered that the supplementation of the teacher data is not sufficient, and the process returns to step S309 and the data supplement is performed again.

このとき、先のデータ補充で親となった教師データについては再びこれを親とせず、先のデータ補充で新たに生成された子を新たな親としてデータ補充を行う。すなわち、図６（ｂ）に示すように、先のデータ補充で点Ｐを親として子Ｑ１が生成されたとき、次のデータ補充では、点Ｑ１を親として子Ｑ２を生成する。またこの場合、点Ｑ１を体心とする超立方体の頂点のうち、点Ｐとは異なる頂点に子Ｑ２を生成する。 At this time, the teacher data that became the parent in the previous data supplementation is not used as the parent again, and the data newly supplemented by using the child newly generated in the previous data supplementation as the new parent. That is, as shown in FIG. 6B, when the child Q1 is generated with the point P as the parent in the previous data supplementation, the child Q2 is generated with the point Q1 as the parent in the next data supplementation. Further, in this case, a child Q2 is generated at a vertex different from the point P among the vertices of the hypercube having the point Q1 as a body center.

このように、教師データを補充するに際しては、既存の教師データのうち過去のデータ補充において親となっていないもの、すなわち子を持たない教師データから新たな子を生成する。また、親を体心とする超立方体の頂点のうち、その位置または近傍位置に教師データが存在しない点に教師データを補充する。このようにすることで、収集された教師画像に対応する教師データの周囲に偏りのない教師データを補充し、特徴量空間において当該欠陥種別が占める空間を拡充してゆくことがことができる。 Thus, when supplementing teacher data, a new child is generated from the existing teacher data that is not a parent in the past data supplement, that is, teacher data having no children. Also, the teacher data is supplemented to the point where the teacher data does not exist at or near the vertex of the hypercube having the parent as the body. By doing so, it is possible to supplement teacher data with no bias around the teacher data corresponding to the collected teacher image, and to expand the space occupied by the defect type in the feature amount space.

データ補充後の性能評価で分類成績に変化が見られたとき（図４のステップＳ３１１においてＹＥＳ）、その結果をユーザに提示してさらなる教師データの補充が必要か否かを判断させる（ステップＳ３０７）。そして、補充が必要ないと判断されるまで上記処理を繰り返すことにより、教師データの補充が完了する。 When there is a change in the classification result in the performance evaluation after data supplementation (YES in step S311 in FIG. 4), the result is presented to the user to determine whether further supplementation of teacher data is necessary (step S307). ). Then, by repeating the above processing until it is determined that supplementation is not necessary, supplementation of teacher data is completed.

図７は教師データ補充の結果を例示する図である。欠陥種別Ｃを対象として教師データの補充を行ったとき、特徴量空間上では、収集された教師画像に対応する教師データ（黒丸印で示す）の周辺に補充された教師データ（白丸印で示す）が偏りなく分布した状態となる。これにより、欠陥種別Ｃに属するクラスタの外延が図５に示す補充前の状態よりも明確となる。その結果、これらの教師データに基づく学習が効果的に行われ、誤分類の少ない分類器を構成することが可能となる。 FIG. 7 is a diagram illustrating the result of teacher data supplementation. When teacher data is supplemented for defect type C, teacher data (indicated by white circles) supplemented around the teacher data (indicated by black circles) corresponding to the collected teacher image in the feature space. ) Are distributed evenly. Thereby, the extension of the cluster belonging to the defect type C becomes clearer than the state before replenishment shown in FIG. As a result, learning based on these teacher data is effectively performed, and a classifier with few misclassifications can be configured.

以上のように、この実施形態では、教師画像に基づく教師データとともに補充された教師データを用いて学習を行うことで、過学習の弊害が抑制され、誤分類の確率を低くすることができる。そして、教師データの補充は、特徴量空間上で既存の教師データの近傍に新たな教師データを作成する方法によって行う。教師画像を疑似的に補充する方法では特徴量空間でのばらつきが却って大きくなる懸念があるが、本実施形態では特徴量空間上での位置が管理された教師データが補充されるので、このような問題がなく、学習の精度を効果的に高めることが可能である。 As described above, in this embodiment, learning is performed using the teacher data supplemented together with the teacher data based on the teacher image, so that the adverse effects of over-learning can be suppressed and the probability of misclassification can be reduced. The teacher data is supplemented by a method of creating new teacher data in the vicinity of the existing teacher data in the feature amount space. In the method of supplementing the teacher image in a pseudo manner, there is a concern that the variation in the feature amount space is increased, but in this embodiment, teacher data whose position in the feature amount space is managed is supplemented. Therefore, it is possible to effectively improve the learning accuracy.

以上説明したように、この実施形態では、欠陥分類装置３が本発明の「画像分類装置」として機能し、欠陥検出装置２、およびこれから欠陥画像データを受信する欠陥分類装置３のインターフェース３１が本発明の「画像取得手段」として機能している。また、入力受付部３５が本発明の「受付手段」として機能している。また、特徴量算出部３４が本発明の「特徴量算出手段」として機能する一方、欠陥分類部３２が本発明の「分類手段」として機能している。そして、欠陥分類部３２と特徴量算出部３４とが一体として、本発明の「教師データ作成手段」として機能している。 As described above, in this embodiment, the defect classification device 3 functions as the “image classification device” of the present invention, and the defect detection device 2 and the interface 31 of the defect classification device 3 that receives defect image data from the defect detection device 2 are the present. It functions as the “image acquisition means” of the invention. Further, the input receiving unit 35 functions as the “receiving unit” of the present invention. The feature quantity calculation unit 34 functions as the “feature quantity calculation unit” of the present invention, while the defect classification unit 32 functions as the “classification unit” of the present invention. The defect classification unit 32 and the feature amount calculation unit 34 function as a “teacher data creation unit” of the present invention.

図４のフローチャートにおいて、ステップＳ３０３が本発明の「教示工程」に相当し、ステップＳ３０２とＳ３０３とが本発明の「一次作成工程」に相当している。一方、ステップＳ３０９が本発明の「二次作成工程」に相当している。また上記実施形態では、欠陥種別Ａ，ＢおよびＣがいずれも本発明の「分類カテゴリ」に相当し、このうち教師画像数が最も少なくデータ補充の対象となる欠陥種別Ｃが、本発明の「対象カテゴリ」に相当している。 In the flowchart of FIG. 4, step S303 corresponds to the “teaching process” of the present invention, and steps S302 and S303 correspond to the “primary creation process” of the present invention. On the other hand, step S309 corresponds to the “secondary creation process” of the present invention. In the above embodiment, the defect types A, B, and C all correspond to the “classification category” of the present invention, and among these, the defect type C that has the smallest number of teacher images and is the object of data supplementation is “ It corresponds to “Target category”.

なお、本発明は上記した実施形態に限定されるものではなく、その趣旨を逸脱しない限りにおいて上述したもの以外に種々の変更を行うことが可能である。例えば、上記実施形態では、教師データの作成後、作成した教師データに基づく学習を行っている。これは教師データの妥当性を検証するためであるが、本発明の教師データ作成方法において学習のプロセスは必須の要件ではない。例えば、教師画像の収集とユーザによる教示作業が終了した後、欠陥種別ごとの教師画像の数からデータ補充の要否を判断し、必要とされる欠陥種別を対象としてデータ補充を行うようにしてもよい。この場合には学習作業を行わずに教師データが作成される。そして、教師データの補充が完了した後に学習を行うことで、未知の欠陥画像の分類が可能となる。 The present invention is not limited to the above-described embodiment, and various modifications other than those described above can be made without departing from the spirit of the present invention. For example, in the above embodiment, learning is performed based on the created teacher data after the teacher data is created. This is to verify the validity of the teacher data, but the learning process is not an essential requirement in the teacher data creation method of the present invention. For example, after the collection of teacher images and the teaching work by the user is completed, it is determined whether or not data supplement is necessary from the number of teacher images for each defect type, and data supplement is performed for the required defect types. Also good. In this case, teacher data is created without performing learning work. Then, learning is performed after the supplementation of teacher data is completed, so that unknown defect images can be classified.

また、上記実施形態では、教師データ補充の要否をユーザに判断させているが、例えば次のようにすることでユーザ判断を介在させることなく教師データを作成することも可能である。すなわち、教師データの必要数を予め定めておき、教示作業の結果、当該必要数に教師画像数が達しない分類カテゴリがあればそれを対象カテゴリとして、教師データ数が必要数以上になるまでデータ補充を行うようにしてもよい。 In the above embodiment, the user determines whether or not it is necessary to supplement teacher data. For example, teacher data can be created without intervention by the user by performing the following, for example. In other words, the required number of teacher data is determined in advance, and if there is a classification category in which the number of teacher images does not reach the required number as a result of the teaching work, this is used as a target category until the number of teacher data exceeds the required number. Replenishment may be performed.

また、上記実施形態では、親となる教師データに対応する特徴量空間上の点を体心とする超立方体の頂点を新たな教師データに対応する点としているが、教師データの生成ルールはこれに限定されるものではない。例えば特徴量空間内の超球を仮定してもよいことは先に説明した通りであり、またこのような超立方体または超球の内部に新たな点を派生させるようにしてもよい。このように特徴量空間内において親からの距離が制限された位置に子を派生させることが、本発明の主たる技術思想である。 In the above embodiment, the vertex of the hypercube centered on the point on the feature space corresponding to the parent teacher data is the point corresponding to the new teacher data. It is not limited to. For example, as described above, a hypersphere in the feature amount space may be assumed, and a new point may be derived inside such a hypercube or hypersphere. Thus, the main technical idea of the present invention is to derive the child at a position where the distance from the parent is limited in the feature amount space.

また、上記実施形態では対応していないが、事前学習処理において、ユーザが教示作業の内容を変更する場合がある。すなわち、各教師画像にいったん付与した欠陥種別の情報を、分類成績等を検証した結果としてユーザが事後的に変更することがある。このような場合、当該教師画像に基づき派生的に作成された教師データについても同様に、関連付けられた欠陥種別を変更することで、有効な教師データとして引き続き使用することが可能である。 Although not supported in the above embodiment, the user may change the content of the teaching work in the pre-learning process. In other words, the defect type information once given to each teacher image may be changed afterwards by the user as a result of verifying the classification result. In such a case, the teacher data derived based on the teacher image can be continuously used as effective teacher data by changing the associated defect type.

また、上記実施形態は、欠陥検出装置２と欠陥分類装置３とを備える検査システム１であるが、欠陥検出機能とその分類機能とを一体化させた装置に対しても、本発明を適用することが可能である。また、撮像機能を備えず欠陥分類のみを専門的に行う装置であってもよい。また例えば、本発明にかかる教師データ作成方法を例えばパーソナルコンピュータなどの演算処理装置上のソフトウェアとして実現し、該演算処理装置が検査装置により取得される欠陥画像を教師画像として教師データを作成し、該教師データを検査装置に与えて検査装置で欠陥分類を行う構成としてもよい。 Moreover, although the said embodiment is the inspection system 1 provided with the defect detection apparatus 2 and the defect classification apparatus 3, this invention is applied also to the apparatus which integrated the defect detection function and its classification function. It is possible. Moreover, the apparatus which is not provided with an imaging function and specialized in only defect classification may be used. Also, for example, the teacher data creation method according to the present invention is realized as software on an arithmetic processing device such as a personal computer, and the arithmetic processing device creates teacher data using a defect image acquired by the inspection device as a teacher image, The teacher data may be given to the inspection apparatus and defect classification may be performed by the inspection apparatus.

また、上記実施形態は半導体基板の欠陥を検査・分類する画像分類装置であるが、本発明の適用対象たる画像分類装置は、半導体基板を検査する装置だけでなく、他の対象物、例えばプリント基板やガラス基板等を検査する装置や、各種材料の表面状態を検査する表面検査装置であってもよい。 Moreover, although the said embodiment is an image classification apparatus which test | inspects and classifies the defect of a semiconductor substrate, the image classification apparatus which is an application object of this invention is not only the apparatus which test | inspects a semiconductor substrate but other objects, for example, a print It may be a device for inspecting a substrate, a glass substrate or the like, or a surface inspection device for inspecting the surface state of various materials.

また、上記実施形態は、半導体基板等から欠陥を検出し分類する検査システムに本発明を適用し、欠陥種別に対応して分類カテゴリが規定されるものであるが、本発明が対象とする画像はこのような欠陥画像に限定されず、特徴量に基づいて評価される種々の画像を扱う技術全般に対して、本発明を適用することが可能である。 In the above embodiment, the present invention is applied to an inspection system that detects and classifies defects from a semiconductor substrate or the like, and classification categories are defined corresponding to defect types. However, the present invention is not limited to such a defect image, and the present invention can be applied to all techniques for handling various images evaluated based on feature amounts.

本発明は複数の特徴量によって表される画像を扱う技術に好適であり、特に、教師データに基づく機械学習を伴う技術分野に対して好適に適用することが可能である。 The present invention is suitable for a technique for handling an image represented by a plurality of feature amounts, and can be suitably applied particularly to a technical field involving machine learning based on teacher data.

１検査システム
２欠陥検出装置（画像取得手段）
３欠陥分類装置（画像分類装置）
３１インターフェース（画像取得手段）
３２欠陥分類部（分類手段、教師データ作成手段）
３４特徴量算出部（特徴量算出手段、教師データ作成手段）
３５入力受付部（受付手段）
Ｓ３０２一次作成工程
Ｓ３０３教示工程、一次作成工程
Ｓ３０９二次作成工程 DESCRIPTION OF SYMBOLS 1 Inspection system 2 Defect detection apparatus (image acquisition means)
3. Defect classification device (image classification device)
31 Interface (Image acquisition means)
32 Defect classification unit (classification means, teacher data creation means)
34 feature amount calculation unit (feature amount calculation means, teacher data creation means)
35 Input reception part (reception means)
S302 Primary creation step S303 Teaching step, Primary creation step S309 Secondary creation step

Claims

A teaching step of associating each of the plurality of teacher images with one classification category in which the teacher image is to be classified among the plurality of classification categories;
First, using at least one of the plurality of classification categories as a target category, a set of feature value values obtained for the teacher image associated with the target category is used as teacher data associated with the target category. Creation process,
Based on the one teacher data, a new set of feature value values corresponding to a point within a predetermined distance from a point occupied by the teacher data in the feature amount space composed of the plurality of types of feature amounts is generated. And a secondary creation step of associating the feature value and the target category as new teacher data.

The teacher data creation method according to claim 1, wherein the secondary creation step is executed for each of the teacher data corresponding to the plurality of teacher images associated with the target category.

The teacher data creation method according to claim 1 or 2, further comprising executing the secondary creation step based on at least one of the new teacher data created by the secondary creation step.

The new teacher data generated in the secondary creation step corresponds to a vertex of a hypercube in the feature amount space having a point corresponding to the teacher data as a body as a body. Teacher data creation method described in 1.

The teacher data creation according to any one of claims 1 to 4, wherein among the classification categories associated with at least one of the teacher images, the one with the smallest number of the associated teacher images is one of the target categories. Method.

A plurality of feature value values corresponding to each of the teacher data created by the teacher data creation method according to claim 1 and the teacher image associated with the classification category other than the target category. A step of configuring a classifier by performing learning based on a plurality of teacher data composed of
A method for classifying images to be classified into any of the classification categories using the classifier.

Image acquisition means for acquiring a plurality of teacher images;
Receiving means for receiving teaching information that associates one of the plurality of classification categories with which one of the teacher images is to be classified;
Feature quantity calculating means for calculating a plurality of types of feature quantities for each of the teacher images;
Teacher data creating means for creating teacher data based on the teacher image;
A classifier is configured by performing learning based on the teaching information and the teacher data, and includes a classifying unit that classifies a classification target image using the classifier,
The teacher data creating means associates the plurality of types of feature value values obtained for each of the plurality of teacher images with each of the teacher data in association with the classification category to which the teacher image is associated. On the other hand, based on at least one of the teacher data, a new value of the feature amount corresponding to a point that is less than a predetermined distance from a point occupied by the teacher data in the feature amount space including the plurality of types of feature amounts. An image classification apparatus characterized by generating a set of feature values and associating the set of feature value values with the target category as one piece of the teacher data.

The teacher data creation means includes the teacher data associated with at least one of the classification categories including the one with the smallest number of the associated teacher images among the classification categories associated with at least one of the teacher images. The image classification apparatus according to claim 7, wherein a new set of feature amount values is generated based on the image.