JP2017040522A

JP2017040522A - Teaching support method and image classification method

Info

Publication number: JP2017040522A
Application number: JP2015161593A
Authority: JP
Inventors: 松村　明; Akira Matsumura; 明松村
Original assignee: Screen Holdings Co Ltd
Current assignee: Screen Holdings Co Ltd
Priority date: 2015-08-19
Filing date: 2015-08-19
Publication date: 2017-02-23
Anticipated expiration: 2035-08-19
Also published as: JP6596260B2

Abstract

PROBLEM TO BE SOLVED: To provide a support for enabling a user to perform efficient teaching operation that the user performs with a reduced user burden.SOLUTION: A two-dimensional map space is set as a square on an XY plane. At two diagonal vertices of the square, a reference vector Rmin as a zero vector and a reference vector Rmax such that values of normalized elements are all 1 are arranged, and at the other two vertices, reference vectors R1, R2 representing two typical examples corresponding to two classification categories that a user specifies are arranged. A poisson equation is solved within the map space based upon them as boundary conditions so as to determine initial values of reference vectors at respective nodes indicated by grating points. The respective nodes which are thus initialized are learnt with a tutor image so as to generate a self-organization map, and similarities between the typical examples and an unlearnt tutor image are visualized.SELECTED DRAWING: Figure 5

Description

この発明は、画像処理技術を応用した画像の分類技術に関するものであり、特に、複数の画像に対し分類カテゴリを付与するユーザの教示作業を支援する技術に関する。 The present invention relates to an image classification technique to which an image processing technique is applied, and more particularly to a technique for supporting a user's teaching work of assigning a classification category to a plurality of images.

複数の画像を画像相互の類似度に応じて複数のカテゴリに自動分類する技術として、教師あり学習を用いたものがある。この技術では、事前に収集された教師画像の特徴量分布（標本分布）から各分類カテゴリの特徴量分布を推定し、誤分類を最小にするようなカテゴリ間境界を求めることで分類器（識別関数）を構築する。代表的な例としては、線形判別法やニューラルネットワーク、サポートベクタマシン（ＳＶＭ）などが知られている。 As a technique for automatically classifying a plurality of images into a plurality of categories according to the degree of similarity between images, there is a technique using supervised learning. This technology estimates the feature value distribution of each classification category from the feature value distribution (sample distribution) of the teacher image collected in advance, and determines the classifier (discriminator) by finding the boundary between categories that minimizes misclassification. Function). As typical examples, a linear discrimination method, a neural network, a support vector machine (SVM), and the like are known.

この技術では、教師画像に対して当該画像が分類されるべき分類カテゴリが適切に付与されることが、分類精度を向上させる上で必要となる。しかしながら、教師画像に対する分類カテゴリの付与は、ユーザ（熟練者）の目視判断に基づく教示作業により行われており、その作業負担は極めて大きく、またある程度の誤教示も避けられない。 In this technique, it is necessary to improve the classification accuracy that a classification category in which the image is to be classified is appropriately given to the teacher image. However, the assignment of the classification category to the teacher image is performed by a teaching work based on the visual judgment of the user (expert), the work burden is extremely large, and a certain amount of erroneous teaching is unavoidable.

誤教示に起因する分類精度の低下を抑制するための技術としては、例えば特許文献１、２に記載されたものがある。これらの技術では、教示作業により各分類カテゴリに割り振られた画像の特徴量のばらつきが分類カテゴリごとに評価され、教示の結果の妥当性が判断される。 For example, Patent Documents 1 and 2 disclose techniques for suppressing a decrease in classification accuracy caused by erroneous teaching. In these techniques, the variation in the feature amount of the image assigned to each classification category by the teaching work is evaluated for each classification category, and the validity of the teaching result is determined.

特開２０１３−１６７５４５号公報JP2013-167545A 特開２０１４−０７０９４４号公報Japanese Patent Application Laid-Open No. 2014-070944

上記従来技術では、不適切な教示がなされた可能性のある画像を抽出することはできたとしても、それをどのように修正すればよいかの手がかりは示されない。例えば、当該画像が同じ分類カテゴリが付与された他の画像とどの程度乖離しているのか、あるいは他の分類カテゴリが付与された画像とどの程度類似しているのかといった情報が、ユーザに提示されない。また、最初に全ての画像に対し教示が必要である点については通常の教師あり学習と差異がない。このため、教示作業を行うユーザの負担軽減には必ずしもつながらない。 In the above prior art, even though an image that may have been inappropriately taught can be extracted, there is no clue as to how to correct it. For example, information such as how far the image is different from other images with the same classification category or how similar the image is with other classification categories is not presented to the user. . In addition, there is no difference from normal supervised learning in that teaching is necessary for all images first. For this reason, it does not necessarily reduce the burden on the user who performs the teaching work.

また、ユーザの主観に頼らない分類方法として、教師なし学習を用いる方法も考えられる。しかしながら、初期条件の与え方によって結果が変わるため、画像をその外観に基づき分類するという目的では、結局ユーザが経験に基づき適当な初期条件を与えることがいずれかの段階で必要となり、学習アルゴリズムはそのような要求に応えるものではなく、そのために必要な情報の提供もなされない。 As a classification method that does not depend on the user's subjectivity, a method using unsupervised learning is also conceivable. However, since the results vary depending on how the initial conditions are given, the purpose of classifying images based on their appearance is that the user must eventually give appropriate initial conditions based on experience, and the learning algorithm It does not meet such demands, nor does it provide necessary information.

これらの理由から、ユーザが複数の画像に分類カテゴリを付与する教示作業を行うのに際して、例えばコンピュータ処理により、ユーザ負担を軽減し、効率的な教示作業を可能とするための支援が提供されることが望ましいが、そのような技術は確立されるに至っていない。 For these reasons, when a user performs a teaching operation for assigning a classification category to a plurality of images, for example, by computer processing, a support for reducing the user burden and enabling an efficient teaching operation is provided. Although desirable, no such technology has been established.

この発明は上記課題に鑑みなされたものであり、ユーザが行う教示作業に対し、ユーザ負担を軽減し、効率的な教示作業を可能とするための支援を提供することを目的とする。 This invention is made in view of the said subject, and it aims at providing the support for reducing a user burden with respect to the teaching work which a user performs, and enabling efficient teaching work.

この発明にかかる教示支援方法は、Ｎ（Ｎは２以上の整数）種の分類カテゴリのそれぞれに対応するＮ枚の典型画像を含むＭ（ＭはＮより大きい整数）枚の画像に対し分類カテゴリを付与する教示作業をコンピュータが支援する教示支援方法であって、前記コンピュータが、前記分類カテゴリの数に応じた次元を有するマップ空間内の各ノードの参照ベクトルを初期化する初期化工程と、前記画像の各々を前記マップ空間において当該画像の特徴ベクトルと最も近い参照ベクトルを有するノードに分類し、分類結果に基づいて各ノードの参照ベクトルを更新することで各ノードを学習させる学習工程と、前記学習工程で得られた前記マップ空間における前記画像の分布を可視化して提示出力する出力工程とを実行する。 The teaching support method according to the present invention provides a classification category for M (M is an integer greater than N) images including N typical images corresponding to each of N (N is an integer of 2 or more) types of classification categories. An initialization process in which the computer initializes a reference vector of each node in a map space having a dimension corresponding to the number of classification categories; A learning step of classifying each of the images into nodes having a reference vector closest to the feature vector of the image in the map space, and learning each node by updating the reference vector of each node based on the classification result; An output step of visualizing and presenting the distribution of the image in the map space obtained in the learning step;

これらの各工程は、Ｔ．コホネンにより提案された自己組織化マップ（Self-organizing map；ＳＯＭ）の処理アルゴリズムに類似している。ＳＯＭでは、多次元の入力ベクトルを任意の次元のマップ空間に写像して可視化することができ、マップ空間内の各ノードを示す参照ベクトルを入力ベクトルによって学習させる教師なし学習により、類似する入力ベクトルがマップ空間上で近接配置されるようになる。入力ベクトルが画像を表す特徴ベクトルであるとき、類似した特徴を有する画像がマップ空間上で近接配置されクラスタを構成することになる。このことを利用して、複数の画像を相互間の類似の程度に応じ可視化して提示することが可能である。 Each of these steps is described in T.W. It is similar to the self-organizing map (SOM) processing algorithm proposed by Kohonen. In SOM, a multi-dimensional input vector can be mapped and visualized in an arbitrary dimensional map space, and a similar input vector can be obtained by unsupervised learning in which a reference vector indicating each node in the map space is learned by the input vector. Are arranged close to each other on the map space. When the input vector is a feature vector representing an image, images having similar features are arranged close to each other in the map space to form a cluster. By utilizing this, it is possible to visualize and present a plurality of images according to the degree of similarity between them.

原理的に教師データを用いない従来のＳＯＭでは、マップ空間における各ノードの参照ベクトルの初期値がランダムに発生される。このため、複数の画像からなる同一の画像セットであっても、マップ空間上に形成されるクラスタの位置は初期値に依存することになる。このような性質は、教示作業を支援するという目的には不向きである。 In the conventional SOM that does not use teacher data in principle, the initial value of the reference vector of each node in the map space is randomly generated. For this reason, even in the same image set composed of a plurality of images, the position of the cluster formed on the map space depends on the initial value. Such a property is not suitable for the purpose of supporting the teaching work.

そこで、本発明の初期化工程では、上記目的を達成するため、（Ｎ＋２）以上の頂点を有する超立方体を前記マップ空間とし、前記頂点のうち、第１の頂点に、特徴ベクトルを構成する複数の特徴量の値の各々が、前記Ｍ枚の画像それぞれを表す特徴ベクトルにおいて対応する特徴量の値のうちの最小値以下である特徴ベクトルを、前記第１の頂点から最も遠い第２の頂点に、特徴ベクトルを構成する複数の特徴量の値の各々が、前記Ｍ枚の画像それぞれを表す特徴ベクトルにおいて対応する特徴量の値のうちの最大値以上である特徴ベクトルを、前記第１および第２の頂点以外の頂点に、前記典型画像の各々に対応する特徴ベクトルを、それぞれ当該頂点に対応するノードの参照ベクトルとして配置し、前記頂点以外の各ノードの参照ベクトルの初期値が、前記各頂点の参照ベクトルを境界条件とするポアソン方程式の解として与えられる。 Therefore, in the initialization process of the present invention, in order to achieve the above object, a hypercube having (N + 2) or more vertices is used as the map space, and among the vertices, a plurality of feature vectors are formed at the first vertex. A feature vector whose feature value is equal to or smaller than the minimum value among the corresponding feature values in the feature vectors representing the M images, the second vertex farthest from the first vertex In addition, a feature vector in which each of a plurality of feature value values constituting the feature vector is equal to or greater than a maximum value among the corresponding feature value values in the feature vector representing each of the M images. A feature vector corresponding to each of the typical images is arranged as a reference vector of a node corresponding to the vertex at a vertex other than the second vertex, and a reference vector of each node other than the vertex is set. The initial value of Le is given as a solution of Poisson's equation for a reference vector boundary conditions of the respective vertices.

このように構成された発明では、各参照ベクトルの初期値が形成するベクトル場が、あたかも電荷の分布を与えられた空間におけるポテンシャル場のようにマップ空間に滑らかな超曲面を一意に形成する。予め与えられたＮ枚の典型画像に対応する参照ベクトルは、マップ空間をなす超立方体の頂点に配置され、互いに十分な距離が確保される。また、特徴ベクトルを構成する特徴量の全てが各画像に比べて小さな値を有する参照ベクトルと、特徴量の全てが各画像に比べて大きな値を有する参照ベクトルとがマップ空間上で最も遠い頂点（第１の頂点および第２の頂点）に配置されることで、第１の頂点と第２の頂点とを結ぶ軸に沿った方向では各画像の特徴ベクトルはマップ空間内の広範囲に分布することになる。つまり、マップ空間全体を有効に使って広いダイナミックレンジで画像の分布を示すことができる。 In the invention thus configured, the vector field formed by the initial value of each reference vector uniquely forms a smooth hypersurface in the map space as if it were a potential field in a space to which a charge distribution is given. Reference vectors corresponding to N typical images given in advance are arranged at the vertices of a hypercube forming a map space, and a sufficient distance is secured. In addition, a reference vector in which all the feature values constituting the feature vector have a smaller value than each image and a reference vector in which all the feature values have a larger value than each image are the farthest vertices in the map space. By being arranged at (first vertex and second vertex), the feature vectors of each image are distributed over a wide range in the map space in the direction along the axis connecting the first vertex and the second vertex. It will be. That is, the entire map space can be used effectively to show the image distribution with a wide dynamic range.

このようにして形成されるマップ空間では、互いに類似した特徴を有する画像が集まってクラスタが形成される。したがって、各頂点に配置されたＮ枚の典型画像の周囲には、当該典型画像に類似した特徴を有する画像が集まる。また、典型画像のいずれとも類似しない画像であっても、相互に類似する画像同士がマップ空間内で集まってクラスタを形成することがある。 In the map space formed in this way, images having similar characteristics are gathered to form a cluster. Therefore, images having characteristics similar to the typical image are collected around the N typical images arranged at each vertex. Even if the images are not similar to any of the typical images, images similar to each other may gather together in the map space to form a cluster.

ＳＯＭは多次元特徴空間内のベクトルとして表される多数のデータをより低次元のマップ空間に写像して可視化表現するのに適した手法であり、種々の具体的表現方法が提案されている。本発明においても、そのような表現方法の適宜のものを用いて、上記のように分類された画像の分布を可視化しユーザに提示することで、ユーザによる教示作業を支援することができる。 SOM is a technique suitable for visualizing and expressing a large number of data represented as vectors in a multidimensional feature space in a lower dimensional map space, and various specific expression methods have been proposed. Also in the present invention, the teaching work by the user can be supported by visualizing and presenting to the user the distribution of the images classified as described above using an appropriate one of such expression methods.

具体的には、本発明では、Ｍ（＞Ｎ）枚の画像からＮ種の分類カテゴリの各々に対して少なくとも１つずつ典型画像が特定されれば、残りの画像については典型画像に対しどの程度類似しているかが自動的に判定されその結果が可視化される。したがって、ユーザは複数の画像からいくつかの分類カテゴリに対応する典型画像を指定（教示）するだけで、それらの典型画像に類似する画像がどれだけあるか、あるいは典型画像とは異なる位置でクラスタを形成する画像があるか等、教示作業を続けるために有用な情報を得ることが可能となる。すなわち、未教示の画像に対して付与すべき分類カテゴリの示唆や、教示済みの画像に対して付与された分類カテゴリの適否などがユーザに示される。 Specifically, in the present invention, if at least one typical image is specified for each of the N classification categories from M (> N) images, which of the remaining images is compared with the typical image. The degree of similarity is automatically determined and the result is visualized. Therefore, the user simply designates (teaches) typical images corresponding to several classification categories from a plurality of images, and how many images are similar to the typical images or clusters at positions different from the typical images. It is possible to obtain useful information for continuing the teaching work, such as whether there is an image forming the image. That is, the user is shown the suggestion of the classification category to be assigned to the untaught image and the suitability of the classification category assigned to the taught image.

例えば、ある典型画像の近傍に多数の他の画像があるとき、ユーザはそれらを一括して当該典型画像に対応する分類カテゴリへと教示を行うことができる。また、ある典型画像の近傍に分布する画像が少なければ、当該分類カテゴリを廃止したり典型画像を選び直したりすることができる。また、いずれの典型画像の近傍とも異なる位置にクラスタが形成されていれば、当該クラスタに対応する分類カテゴリを新たに創設することができる。 For example, when there are many other images in the vicinity of a typical image, the user can teach them to the classification category corresponding to the typical image at once. If there are few images distributed in the vicinity of a certain typical image, the classification category can be abolished or the typical image can be selected again. Further, if a cluster is formed at a position different from the vicinity of any typical image, a classification category corresponding to the cluster can be newly created.

このように、本発明の教示支援方法によれば、いくつかの分類カテゴリに対し少なくとも１つの典型画像が特定されれば、それらの典型画像に対する他の画像の類似度が可視化されて提示される。そのため、ユーザに対し教示作業を少ない作業負担で効率的に行うための効果的な支援を提供することが可能である。 As described above, according to the teaching support method of the present invention, when at least one typical image is specified for several classification categories, the similarity of other images to the typical image is visualized and presented. . Therefore, it is possible to provide effective support for the user to efficiently perform the teaching work with a small work load.

また、この発明にかかる画像分類方法は、複数の画像を取得する取得工程と、上記した教示支援方法により、前記複数の画像の前記マップ空間における分布を提示し、前記複数の画像各々について分類カテゴリの教示入力を受け付ける教示工程と、前記複数の画像および教示された分類カテゴリを教師データとして学習した分類器を用いて、画像を分類する分類工程とを備えている。 Further, the image classification method according to the present invention presents the distribution in the map space of the plurality of images by the acquisition step of acquiring a plurality of images and the teaching support method described above, and classifies each of the plurality of images with a classification category. And a classification step of classifying images using a classifier that has learned the plurality of images and the taught classification category as teacher data.

このように構成された発明では、画像を分類するための予備段階としての教示作業におけるユーザ負担が軽減されているので、画像分類の処理コストを低減することが可能である。また、教示作業に対する適切な支援が与えられることで教示結果の信頼性を高めることができるため、画像分類の精度向上を図ることが可能になる。 In the invention configured as described above, since the burden on the user in the teaching work as a preliminary stage for classifying the images is reduced, it is possible to reduce the processing cost of the image classification. Moreover, since the reliability of the teaching result can be increased by providing appropriate support for the teaching work, it is possible to improve the accuracy of image classification.

本発明によれば、未教示の画像に対し付与されるべき分類カテゴリに関する情報や、教示済みの画像に対する分類カテゴリの適否に関する情報などをユーザに提示して、ユーザが行う教示作業に対し、ユーザ負担を軽減し、効率的な教示作業を可能とするための支援を提供することができる。 According to the present invention, information related to a classification category to be assigned to an untaught image, information related to the suitability of a classification category for a taught image, and the like are presented to the user, and the user performs a teaching operation performed by the user. It is possible to provide support for reducing the burden and enabling efficient teaching work.

この発明を好適に適用可能な検査システムの概略構成を示す図である。1 is a diagram showing a schematic configuration of an inspection system to which the present invention can be preferably applied. 欠陥検出装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of a defect detection apparatus. 欠陥分類装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of a defect classification device. 欠陥分類装置により実行される事前学習処理を示すフローチャートである。It is a flowchart which shows the prior learning process performed by the defect classification device. 参照ベクトルの初期値を説明する図である。It is a figure explaining the initial value of a reference vector. 分類結果の表示出力例を示す図である。It is a figure which shows the example of a display output of a classification result. 分類結果の表示出力例を示す図である。It is a figure which shows the example of a display output of a classification result. 三次元マップ空間での表示出力例を示す図である。It is a figure which shows the example of a display output in a three-dimensional map space.

図１はこの発明を好適に適用可能な検査システムの概略構成を示す図である。この検査システム１は、検査対象である半導体基板Ｓの外観に現れたピンホールや異物等の欠陥検査を行い、検出された欠陥の自動分類を行う検査システムである。検査システム１は、基板Ｓ上の検査対象領域から欠陥を検出する欠陥検出装置２と、欠陥検出装置２により検出された欠陥をより詳細に分析して欠陥の種類を特定する欠陥分類装置３とを備えている。欠陥検出装置２は基板Ｓの製造ラインに組み込まれ、検査システム１はいわゆるインライン型のシステムとなっている。 FIG. 1 is a diagram showing a schematic configuration of an inspection system to which the present invention can be preferably applied. This inspection system 1 is an inspection system that performs inspection of defects such as pinholes and foreign matters appearing on the appearance of a semiconductor substrate S to be inspected, and automatically classifies detected defects. The inspection system 1 includes a defect detection device 2 that detects a defect from an inspection target region on the substrate S, a defect classification device 3 that analyzes the defect detected by the defect detection device 2 in more detail and identifies the type of defect, and It has. The defect detection device 2 is incorporated in the production line of the substrate S, and the inspection system 1 is a so-called in-line system.

欠陥検出装置２は、基板Ｓ上の検査対象領域の画像を取得する画像取得ユニット２０と、画像取得ユニット２０からの画像データを解析して欠陥検出を行う検出ユニット２１とを備えている。画像取得ユニット２０は、例えばＣＣＤセンサまたはＣＭＯＳセンサなどの撮像素子を有し基板Ｓ上の検査対象領域を撮像することにより画像データを取得する撮像部２０１、基板Ｓを保持するステージ２０２、および、撮像部２０１に対してステージ２０２を相対的に移動させるステージ駆動部２０３を備えている。ステージ駆動部２０３はボールねじ、ガイドレールおよびモータにより構成される。撮像部２０１およびステージ駆動部２０３は検出ユニット２１に設けられた機構制御部２１１により制御されており、機構制御部２１１からの制御指令に応じて動作することで、基板Ｓ上の検査対象領域を撮像する。 The defect detection apparatus 2 includes an image acquisition unit 20 that acquires an image of an inspection target area on the substrate S, and a detection unit 21 that analyzes the image data from the image acquisition unit 20 and detects a defect. The image acquisition unit 20 has an imaging element such as a CCD sensor or a CMOS sensor, for example, and acquires an image data by acquiring an image of an inspection target area on the substrate S, a stage 202 that holds the substrate S, and A stage driving unit 203 that moves the stage 202 relative to the imaging unit 201 is provided. The stage driving unit 203 includes a ball screw, a guide rail, and a motor. The imaging unit 201 and the stage driving unit 203 are controlled by a mechanism control unit 211 provided in the detection unit 21, and operate according to a control command from the mechanism control unit 211, so that an inspection target region on the substrate S is changed. Take an image.

撮像部２０１による撮像で得られた画像データに対して欠陥検出部２１２が適宜のデータ処理を施して欠陥検出を行う。欠陥検出部２１２は、撮像部２０１から与えられた画像データに対して適宜のデータ処理、例えばノイズ除去処理や平滑化処理などを施し、こうして処理された画像データに基づき欠陥検出を行う。より具体的には、実際の基板Ｓにおいて撮像された画像と、記憶部２１３に予め記憶された無欠陥の基板の画像とを比較して、その差異が予め定められた基準を超える場合には当該画像に欠陥が含まれると判定する。欠陥検出部２１２は、検出した欠陥の画像データをインターフェース（Ｉ／Ｆ）２１４を介して欠陥分類装置３に送信する。 The defect detection unit 212 performs appropriate data processing on the image data obtained by imaging by the imaging unit 201 to detect defects. The defect detection unit 212 performs appropriate data processing such as noise removal processing or smoothing processing on the image data given from the imaging unit 201, and performs defect detection based on the image data thus processed. More specifically, when an image captured on the actual substrate S is compared with an image of a defect-free substrate stored in advance in the storage unit 213, and the difference exceeds a predetermined reference It is determined that the image includes a defect. The defect detection unit 212 transmits the detected defect image data to the defect classification device 3 via the interface (I / F) 214.

欠陥分類装置３は、欠陥検出装置２から与えられる欠陥データに基づき、検出された欠陥の種別を判定する。すなわち、欠陥分類装置３は欠陥検出装置２から与えられる欠陥画像データに基づいて当該欠陥が属すべき分類カテゴリへと欠陥を自動分類（ＡＤＣ；Automatic Defect Classification）する機能を有している。この欠陥分類装置３は、この発明の一実施形態を実装した装置である。以下に説明するように、一般的なパーソナルコンピュータが有するハードウェア資源を用いて、本発明にかかる各処理を記述した制御プログラムを実行させることにより、欠陥分類装置３としての機能を実現することが可能である。 The defect classification device 3 determines the type of the detected defect based on the defect data given from the defect detection device 2. That is, the defect classification device 3 has a function of automatically classifying defects (ADC: Automatic Defect Classification) into classification categories to which the defect belongs based on the defect image data given from the defect detection device 2. The defect classification apparatus 3 is an apparatus that implements an embodiment of the present invention. As described below, the function as the defect classification device 3 can be realized by executing a control program describing each process according to the present invention using hardware resources of a general personal computer. Is possible.

欠陥分類装置３は、予め読み込まれた制御プログラムを実行することにより、図１に示す各機能ブロックを専用ハードウェアおよびソフトウェアにより実現する。欠陥分類装置３は、欠陥検出装置２から欠陥画像データを受信するためのインターフェース（Ｉ／Ｆ）３１、欠陥画像データに基づく自動分類により、当該欠陥の種別を判定する欠陥分類部（ＡＤＣ）３２を備えている。インターフェース３１は例えばＬＡＮ（Local Area Network）を介して他の外部機器と通信が可能となっている。 The defect classification device 3 implements each functional block shown in FIG. 1 by dedicated hardware and software by executing a control program read in advance. The defect classification device 3 includes an interface (I / F) 31 for receiving defect image data from the defect detection device 2, and a defect classification unit (ADC) 32 that determines the type of the defect by automatic classification based on the defect image data. It has. The interface 31 can communicate with other external devices via, for example, a LAN (Local Area Network).

欠陥検出装置２から与えられる画像データおよび欠陥分類部３２における処理過程で生じる種々の処理データは必要に応じて記憶部３３に随時保存される。欠陥分類部３２は、与えられた欠陥画像データに基づき特徴量算出部３４が算出した複数種の特徴量の値に基づき、当該欠陥画像を、欠陥種別に対応して予め設定された複数の分類カテゴリのいずれかに分類する。 The image data given from the defect detection device 2 and various processing data generated in the process in the defect classification unit 32 are stored in the storage unit 33 as needed. The defect classifying unit 32 classifies the defect image into a plurality of classifications set in advance corresponding to the defect type based on the values of the plurality of types of feature amounts calculated by the feature amount calculating unit 34 based on the given defect image data. Sort into one of the categories.

さらに、欠陥分類装置３は、ユーザからの操作入力を受け付けるキーボードおよびマウスなどの入力受付部３５および操作手順や処理結果等のユーザ向け視覚情報を表示する表示部３６を備えている。 Furthermore, the defect classification device 3 includes an input receiving unit 35 such as a keyboard and a mouse that receives an operation input from a user, and a display unit 36 that displays visual information for the user such as an operation procedure and a processing result.

欠陥分類部３２は、検出された欠陥をＳＶＭ（サポートベクタマシン；Support Vector Machine）、ニューラルネットワーク、決定木、判別分析等適宜の学習アルゴリズムを利用して分類する処理をソフトウェア的に実行する。具体的には、欠陥検査装置２から与えられて記憶部３３に記憶保存された欠陥画像データと、当該欠陥画像が分類されるべき分類カテゴリに関するユーザからの教示情報とに基づいて予め適宜の学習アルゴリズムによる機械学習を行っておく。そして、未分類の欠陥画像に対応する画像データが新たに与えられると、学習結果に基づく自動分類を行って新たな欠陥画像をいずれかの分類カテゴリに分類する。 The defect classification unit 32 performs, in software, a process of classifying the detected defects using an appropriate learning algorithm such as SVM (Support Vector Machine), neural network, decision tree, discriminant analysis, and the like. Specifically, appropriate learning is performed in advance based on defect image data given from the defect inspection apparatus 2 and stored and stored in the storage unit 33, and teaching information from a user regarding a classification category in which the defect image should be classified. Perform machine learning using an algorithm. When new image data corresponding to an unclassified defect image is given, automatic classification based on the learning result is performed to classify the new defect image into any one of the classification categories.

図２は欠陥検出装置の動作を示すフローチャートである。欠陥検出装置２は、検査対象基板Ｓを撮像し、その撮像結果から欠陥を検出するための動作として以下の欠陥検出処理を実行する。最初に検査対象基板Ｓが画像取得ユニット２０のステージ２０２にロードされると（ステップＳ１０１）、機構制御部２１１が撮像部２０１およびステージ駆動部２０３を制御して、基板Ｓ上の所定の検査対象位置を撮像し画像を取得する（ステップＳ２０２）。こうして得られた画像データに対して、欠陥検出部２１２は、必要に応じて例えば輝度調整やノイズ除去等の処理を行う。さらに、予め記憶部２１３に記憶されている無欠陥の基板の画像データのうち、当該検査対象位置に対応する画像データを読み出して、撮像により得られた画像データとの画素単位の差分を算出する（ステップＳ１０３）。 FIG. 2 is a flowchart showing the operation of the defect detection apparatus. The defect detection apparatus 2 images the inspection target substrate S, and executes the following defect detection processing as an operation for detecting defects from the imaging result. When the inspection target substrate S is first loaded on the stage 202 of the image acquisition unit 20 (step S101), the mechanism control unit 211 controls the imaging unit 201 and the stage driving unit 203 to perform a predetermined inspection target on the substrate S. The position is imaged and an image is acquired (step S202). For the image data obtained in this way, the defect detection unit 212 performs processes such as brightness adjustment and noise removal as necessary. Further, the image data corresponding to the inspection target position is read out from the image data of the defect-free substrate stored in the storage unit 213 in advance, and the difference in pixel units from the image data obtained by imaging is calculated. (Step S103).

撮像された画像に欠陥が含まれなければ予め記憶されていた無欠陥の画像との差は小さい一方、欠陥が含まれていれば大きな差異が現れる。このことを利用して、両者の差分を求めることで欠陥の有無を判定することが可能である。具体的には、得られた差分画像に対して適宜の平滑化処理および二値化処理を行って（ステップＳ１０４）、有意な差が認められる領域があればその部分を欠陥部分として抽出する（ステップＳ１０５）。欠陥が検出された画像については（ステップＳ１０６）、当該画像を欠陥画像として、インターフェース２１４を介して欠陥分類装置３に送信する（ステップＳ１０７）。上記処理を、検査対象位置を順次変えながら基板Ｓの全体について行うことで（ステップＳ１０８）、基板Ｓの検査が完了する。 If the captured image does not include a defect, the difference from the previously stored defect-free image is small, but if a defect is included, a large difference appears. By utilizing this fact, it is possible to determine the presence or absence of a defect by obtaining the difference between the two. Specifically, appropriate smoothing processing and binarization processing are performed on the obtained difference image (step S104), and if there is a region where a significant difference is recognized, that portion is extracted as a defective portion ( Step S105). For an image in which a defect is detected (step S106), the image is transmitted as a defect image to the defect classification device 3 via the interface 214 (step S107). By performing the above processing on the entire substrate S while sequentially changing the inspection target position (step S108), the inspection of the substrate S is completed.

欠陥検出装置２は、検査対象基板Ｓの各位置を検査して欠陥の有無を判定する。一方、欠陥分類装置３は、検出された欠陥の種別を判定して、欠陥の発生傾向を分析してその結果を製造ラインにフィードバックしたり、欠陥検出装置２による欠陥検出処理の処理内容の調整を行うのに用いられる。検出された欠陥の分析を欠陥分類装置３により行うことで、欠陥検出装置２は欠陥検出に特化した処理を実行することができるので、高速での欠陥検出が可能となる。 The defect detection device 2 inspects each position of the inspection target substrate S to determine whether there is a defect. On the other hand, the defect classification device 3 determines the type of detected defect, analyzes the tendency of occurrence of the defect, and feeds back the result to the production line, or adjusts the processing content of the defect detection processing by the defect detection device 2 Used to do By analyzing the detected defect by the defect classification device 3, the defect detection device 2 can execute a process specialized for defect detection, so that a defect can be detected at high speed.

図３は欠陥分類装置の動作を示すフローチャートである。上記した通り、欠陥分類装置３は欠陥検出装置２から出力される欠陥画像を、その欠陥種別に応じた分類カテゴリに分類する。より具体的には、欠陥分類部３２が予め収集された欠陥画像データに基づき事前学習（後述）を行っておき（ステップＳ２０１）、未分類の欠陥画像が欠陥検出装置２から新たに与えられると（ステップＳ２０２）、特徴量算出部３４が当該欠陥画像を特徴付ける複数の特徴量の値を算出する（ステップＳ２０３）。そして、欠陥分類部３２が、算出された特徴量の値に基づき、事前学習により構成された分類器を用いて当該欠陥画像を複数の分類カテゴリのいずれかに分類する（ステップＳ２０４）。 FIG. 3 is a flowchart showing the operation of the defect classification apparatus. As described above, the defect classification device 3 classifies the defect image output from the defect detection device 2 into a classification category corresponding to the defect type. More specifically, when the defect classification unit 32 performs pre-learning (described later) based on the defect image data collected in advance (step S201), an unclassified defect image is newly given from the defect detection device 2. (Step S202), the feature quantity calculation unit 34 calculates a plurality of feature quantity values that characterize the defect image (Step S203). Then, based on the calculated feature value, the defect classification unit 32 classifies the defect image into one of a plurality of classification categories using a classifier configured by prior learning (step S204).

分類カテゴリは当該基板Ｓにおいて発生し得る複数の欠陥種別に対応して複数種が予め設定されており、例えば異物、傷、気泡、ピンホール、パターン断線、短絡などが代表的なものであるが、これらに限定されるものではない。 A plurality of types of classification categories are set in advance corresponding to a plurality of types of defects that may occur in the substrate S. For example, foreign materials, scratches, bubbles, pinholes, pattern disconnections, short circuits, and the like are representative. However, it is not limited to these.

図４は欠陥分類装置により実行される事前学習処理を示すフローチャートである。事前学習処理は、種々の欠陥を撮像した教師画像と、ユーザから教示される、当該教師画像がどの分類カテゴリに分類すべきものであるかを示す教示情報とに基づいて行われる。欠陥の典型的な外観を示す教師画像が多いほど学習効果が向上し、分類精度を高めることが可能である。一方、多数の教師画像の各々に対し分類カテゴリを割り当てる教示作業については、装置が未学習である以上自動的に処理することができず、経験を有するユーザの目視による判断に頼らざるを得ない。教師画像が多いと、単にそのためのユーザの作業量が増大するだけでなく、教示のばらつきに起因する誤分類を抑制するために各教師画像に対し矛盾のない教示を行うことが求められ、このためのユーザの作業負担が著しく重くなる。 FIG. 4 is a flowchart showing a pre-learning process executed by the defect classification apparatus. The pre-learning process is performed based on a teacher image obtained by imaging various defects and teaching information indicating which classification category the teacher image should be classified, which is taught by the user. The more teacher images that show the typical appearance of the defect, the better the learning effect and the higher the classification accuracy. On the other hand, teaching work for assigning a classification category to each of a large number of teacher images cannot be processed automatically as long as the device is unlearned, and the user must rely on visual judgment of an experienced user. . When there are many teacher images, not only does the amount of work for the user increase, but it is also required to teach each teacher image consistently in order to suppress misclassification caused by variations in teaching. Therefore, the work burden on the user is significantly increased.

そこで、この実施形態の事前学習処理は、ユーザ負担を軽減しユーザによる効率的な教示作業を可能とするために、ユーザに対し各種の支援機能が提供されるように構成されている。具体的には、ユーザに対して、分類カテゴリをなす欠陥種別の特定と、それらの分類カテゴリの各々について当該分類カテゴリの特徴が強く現れた典型例（典型画像）を多数の教師画像から１枚ずつ選出する作業とが求められる。これらがユーザにより指定されると、欠陥分類装置３は、他の教師画像について、与えられた情報に基づいて画像相互間の類似度合いを判定し、その結果を可視化してユーザに提示する。ユーザは提示された情報を参考にして、他の画像に対する教示入力や分類カテゴリの設定の再検討などを行うことが可能となり、何ら手がかりが与えられない状態での教示作業に比べ、遥かに少ない作業負担で効率的に教示作業を行うことができる。このための処理内容について、以下に説明する。 Therefore, the pre-learning process of this embodiment is configured such that various support functions are provided to the user in order to reduce the burden on the user and enable efficient teaching work by the user. Specifically, for a user, one type of typical example (typical image) in which the characteristics of the classification category appear strongly for each of the classification categories is identified from a large number of teacher images. The work of selecting one by one is required. When these are designated by the user, the defect classification device 3 determines the degree of similarity between the images based on the given information for the other teacher images, visualizes the result, and presents the result to the user. The user can refer to the presented information and review the teaching input for other images and the setting of the classification category, which is far less than teaching work without any clues. Teaching work can be performed efficiently with work load. The processing content for this will be described below.

最初に、教師画像として用いられる複数の欠陥画像が収集される（ステップＳ３０１）。ここでは欠陥検出装置２から出力される欠陥画像を教師画像としてこれを順次記憶部３３に保存することで収集を行うものとするが、例えば予め用意された画像ライブラリや外部のストレージに収集された教師画像データを取得する態様であってもよい。当然ながら、欠陥画像は、分類すべきカテゴリの数よりも多数が必要である。 First, a plurality of defect images used as teacher images are collected (step S301). Here, the defect images output from the defect detection device 2 are collected by sequentially storing them as teacher images in the storage unit 33. For example, the defect images are collected in an image library prepared in advance or in an external storage. It may be an aspect in which teacher image data is acquired. Of course, more defect images are required than the number of categories to be classified.

次に、これらの教師画像を分類すべき分類カテゴリの種別と、分類カテゴリごとに１枚ずつの典型例とを指定することが、表示部３６への適宜のメッセージ表示によりユーザに求められる。ユーザからの指定入力は入力受付部３５により受け付けられる（ステップＳ３０２）。分類カテゴリおよび典型例の指定は事後的に変更することが可能であり、この時点で厳密な判断がユーザに要求されるものではない。もちろん、早期に的確な指定がなされることで、以降の処理はさらに効率的なものとなる。 Next, the user is required to designate the type of classification category for classifying these teacher images and one typical example for each classification category by displaying an appropriate message on the display unit 36. The designation input from the user is received by the input receiving unit 35 (step S302). The designation of the classification category and typical example can be changed after the fact, and a strict judgment is not required from the user at this point. Of course, the subsequent processing becomes more efficient when accurate designation is made early.

続いて、各教師画像の類似度合いを可視化表示するためのマップの次元が設定される（ステップＳ３０３）。後述するように、この実施形態では、典型例として指定された教師画像に対し、他の教師画像がどの程度類似しているかが各種の表示態様で表示される。例えば教師画像の分布を任意の次元のマップ空間での散布図として表すことができるが、その次元が選択可能である。この次元の設定については、ユーザ指定を受け付けるようにしてもよく、また指定された分類カテゴリの種別数に応じて自動的に設定されるようにしてもよい。 Subsequently, a map dimension for visualizing and displaying the degree of similarity of each teacher image is set (step S303). As will be described later, in this embodiment, how similar another teacher image is to the teacher image designated as a typical example is displayed in various display modes. For example, the distribution of the teacher image can be expressed as a scatter diagram in a map space of an arbitrary dimension, and the dimension can be selected. As for the setting of this dimension, user designation may be accepted, or it may be automatically set according to the number of types of the designated classification category.

原理的には、例えば任意の次元の超立方体をマップ空間とすることが可能である。このうち、ユーザが画像の分布を直感的に把握しやすい態様で可視化することのできるマップ空間は、二次元空間における正方形、三次元空間における立方体などである。その他、正方形または立方体を複数用いて、より高次元でのマップ表現も可能である。 In principle, for example, a hypercube having an arbitrary dimension can be used as the map space. Among these, the map space that can be visualized in such a manner that the user can intuitively understand the image distribution is a square in a two-dimensional space, a cube in a three-dimensional space, or the like. In addition, it is possible to express a map in a higher dimension by using a plurality of squares or cubes.

次に、各教師画像の特徴を表す複数種の特徴量が、特徴量算出部３４により算出される（ステップＳ３０４）。各教師画像は、複数の特徴量を要素とする特徴ベクトルによってその特徴が数値表現される。マップ空間での見やすい可視化という目的のためには、各画像の特徴量は所定の数値範囲、例えば０ないし１の範囲で正規化された形式で表現されてもよい。以下、各教師画像の特徴ベクトルはこのように正規化された特徴量によって表されるものとする。 Next, a plurality of types of feature amounts representing the features of each teacher image are calculated by the feature amount calculation unit 34 (step S304). Each teacher image has its features represented numerically by a feature vector having a plurality of feature amounts as elements. For the purpose of easy visualization in the map space, the feature amount of each image may be expressed in a form normalized in a predetermined numerical range, for example, 0 to 1. Hereinafter, it is assumed that the feature vector of each teacher image is represented by the feature amount normalized in this way.

なお、ステップＳ３０２ないしステップＳ３０４の実行順序は上記に限定されず、任意の順序で実行して同じ結果を得ることが可能である。 Note that the execution order of step S302 to step S304 is not limited to the above, and the same result can be obtained by executing in any order.

多次元特徴空間における画像の分布をより低次元のマップ空間に写像し可視化する手法として、Ｔ．コホネン（コホーネン）により提唱された自己組織化マップ（ＳＯＭ）が有名である。ＳＯＭでは、マップ空間を構成する各ノードの参照ベクトルが、入力ベクトルである教師画像の特徴ベクトルに応じて調整されることによりノードが学習を行う。そして、学習されたノードの参照ベクトルに基づいて未知の画像をマッピングする。具体的には、未知の画像の特徴ベクトルと近い参照ベクトルを有するノードに当該画像を配置することで、マップ空間における当該画像の位置が決定される。 As a technique for mapping and visualizing an image distribution in a multidimensional feature space to a lower dimensional map space, T.K. The self-organizing map (SOM) proposed by Kohonen (Kohonen) is famous. In SOM, a node performs learning by adjusting a reference vector of each node constituting a map space according to a feature vector of a teacher image that is an input vector. Then, an unknown image is mapped based on the learned reference vector of the node. Specifically, the position of the image in the map space is determined by arranging the image at a node having a reference vector close to the feature vector of the unknown image.

この実施形態でも、ＳＯＭの基本原理を用いてマッピングを行うことで、相互に類似する画像をマップ空間上の近接位置に配置することで、画像間の類似度が表される。ただし、各ノードの参照ベクトルの初期値の設定方法が公知のＳＯＭとは異なっており、ユーザにより指定された典型例の情報がマッピングに反映されるようになっている。この目的のために、超立方体として設定されたマップ空間の各頂点に対応するノードに対し、ユーザにより指定された典型例の情報に基づいて決定される参照ベクトルの初期値が配置される（ステップＳ３０５）。 Also in this embodiment, by performing mapping using the basic principle of SOM, the similarity between images is expressed by arranging images similar to each other at close positions in the map space. However, the method of setting the initial value of the reference vector of each node is different from the known SOM, and typical information specified by the user is reflected in the mapping. For this purpose, an initial value of a reference vector determined based on typical information specified by the user is arranged for a node corresponding to each vertex of the map space set as a hypercube (step S305).

図５は参照ベクトルの初期値を説明する図である。より具体的には、図５（ａ）はマップ空間の頂点に対応するノードへの参照ベクトルの配置を示し、図５（ｂ）は初期化終了後の各ノードへの参照ベクトルの配置を示す。ここでは、２種類の分類カテゴリがあり、それぞれ１つずつ典型例が指定されているものとする。また、各教師画像はＸＹ平面上の二次元マップ空間に写像されるものとする。 FIG. 5 is a diagram for explaining the initial value of the reference vector. More specifically, FIG. 5 (a) shows the arrangement of reference vectors to nodes corresponding to the vertices of the map space, and FIG. 5 (b) shows the arrangement of reference vectors to each node after completion of initialization. . Here, there are two types of classification categories, and one typical example is designated for each category. Each teacher image is mapped to a two-dimensional map space on the XY plane.

なお、図５（ａ）および図５（ｂ）では、原理を理解しやすくするために、参照ベクトルＲ＝（ｒ1，ｒ2，…，ｒi，…）を構成する複数の要素のうち、一の特徴量に対応する一の要素ｒiのみが図示されている。すなわち参照ベクトルＷがあたかもスカラー量のように表現されているが、ＳＯＭの原理に則り、実際にはベクトル量としての参照ベクトルＲがマップ空間（この場合はＸＹ平面）に配置される。 5 (a) and 5 (b), in order to facilitate understanding of the principle, one of a plurality of elements constituting the reference vector R = (r1, r2,..., Ri,. Only one element ri corresponding to the feature quantity is shown. That is, the reference vector W is expressed as if it were a scalar quantity. However, in accordance with the SOM principle, the reference vector R as a vector quantity is actually arranged in the map space (in this case, the XY plane).

この実施形態では、図５（ａ）に示すように、マップ空間の各頂点のうち、マップ空間において最も離れた位置にある（この例では正方形の対角に位置する）２つの頂点に対応するノードに、各要素ｒi（ｉ＝１，２，…）の値が全ての教師画像の特徴ベクトルの当該要素ｒiの値の最小値以下である参照ベクトルＲmin、および各要素ｒi（ｉ＝１，２，…）の値が全ての教師画像の特徴ベクトルの当該要素ｒiの値の最大値以上である参照ベクトルＲmaxがそれぞれ配置される。 In this embodiment, as shown in FIG. 5A, among the vertices of the map space, corresponding to the two vertices that are located farthest in the map space (in this example, located at the diagonal of the square). The node includes a reference vector Rmin in which the value of each element ri (i = 1, 2,...) Is less than or equal to the minimum value of the element ri of the feature vectors of all teacher images, and each element ri (i = 1, 1). 2,...) Are respectively arranged with reference vectors Rmax whose value is equal to or greater than the maximum value of the element ri of the feature vectors of all the teacher images.

参照ベクトルＲminとしては、例えばゼロベクトル（０，０，…，０）を用いることができる。また、参照ベクトルＲmaxとしては、例えば正規化された特徴量の最大値１を各要素の値とするベクトル（１，１，…，１）を用いることができる。 For example, a zero vector (0, 0,..., 0) can be used as the reference vector Rmin. Further, as the reference vector Rmax, for example, a vector (1, 1,..., 1) having the normalized maximum feature value 1 as the value of each element can be used.

正方形マップ空間の他の２つの頂点に対応するノードには、ユーザにより指定された２つの分類カテゴリ（例えば「気泡」と「異物」）に対応する２つの典型例の特徴ベクトルと等しい参照ベクトルＲ1、Ｒ2がそれぞれ配置される。このような配置を可能とするために、マップ空間は、分類カテゴリの種類数Ｎ（Ｎは２以上の整数）に対して（Ｎ＋２）以上の頂点を有する超立方体であることが必要である。言い換えれば、分類カテゴリの種類数Ｎが決まればマップ空間の次元を決定することができる。 The nodes corresponding to the other two vertices of the square map space include reference vectors R1 equal to the two typical feature vectors corresponding to the two classification categories (eg, “bubble” and “foreign matter”) specified by the user. , R2 are arranged respectively. In order to enable such an arrangement, the map space needs to be a hypercube having (N + 2) or more vertices with respect to the number N of classification categories (N is an integer of 2 or more). In other words, if the number N of types of classification categories is determined, the dimension of the map space can be determined.

上記のように分類カテゴリ数Ｎが２のとき、マップ空間は二次元以上、例えば二次元平面上の適宜の形状、例えば正方形とすることができる。また、分類カテゴリ数Ｎが３ないし６のとき、マップ空間は三次元以上、例えば立方体とすることができる。また例えば、分類カテゴリ数Ｎが７ないし１４のとき、マップ空間は四次元以上の超立方体とすることができる。 As described above, when the classification category number N is 2, the map space can be two or more dimensions, for example, an appropriate shape on a two-dimensional plane, for example, a square. When the classification category number N is 3 to 6, the map space can be three-dimensional or more, for example, a cube. For example, when the classification category number N is 7 to 14, the map space can be a four-dimensional or higher hypercube.

Ｎ個の典型例に対し、マップ空間の頂点の数が（Ｎ＋２）より大きいとき、典型例が配置されない頂点が生じる。このような頂点には、参照ベクトルＲmin、Ｒmaxおよび典型例の特徴ベクトルとは異なる適宜の参照ベクトル、例えば全教師画像の特徴ベクトルから求めた平均的なベクトルに等しい参照ベクトルを配置することができる。 For N typical examples, when the number of vertices in the map space is greater than (N + 2), there will be vertices where no typical examples are placed. An appropriate reference vector different from the reference vectors Rmin, Rmax and the feature vector of the typical example, for example, a reference vector equal to an average vector obtained from the feature vectors of all the teacher images can be arranged at such vertices. .

このようにして各頂点に対応するノードに配置された参照ベクトルＲ1、Ｒ2、Ｒmin、Ｒmaxから、マップ空間内の他の各ノードの参照ベクトルが初期化される（ステップＳ３０６）。すなわち、図５（ａ）において格子点として表される各ノードの参照ベクトルの初期値が求められる。具体的には、図５（ａ）に示すように配置された参照ベクトルをあたかも空間に配置された電荷のように扱い、これらの参照ベクトルを各位置に置くことでマップ空間にポテンシャル場が形成されると仮定する。 In this way, the reference vectors of other nodes in the map space are initialized from the reference vectors R1, R2, Rmin, Rmax arranged at the nodes corresponding to the vertices (step S306). That is, the initial value of the reference vector of each node represented as a grid point in FIG. Specifically, the reference vectors arranged as shown in FIG. 5 (a) are treated as if they were electric charges arranged in the space, and a potential field is formed in the map space by placing these reference vectors at each position. Assume that

空間内の電位分布は、空間内での電荷配置（あるいは特定の位置における電位）を境界条件とするポアソン方程式の解として求められることが知られている。特に静電場はラプラス方程式によって記述され、空間内各部の電位はその近傍の電位の平均値によって表される。そして、電位分布を数値計算する方法としては、例えば、各部の電位をその近傍の電位の平均値によって記述し、全ての位置で電位の変化がなくなるまでこれを繰り返す方法（ガウス＝ザイデル法）が知られている。このような知見は周知であるため、計算式の導出は省略する。 It is known that the potential distribution in the space can be obtained as a solution of the Poisson equation having the charge arrangement in the space (or the potential at a specific position) as a boundary condition. In particular, the electrostatic field is described by the Laplace equation, and the potential of each part in the space is represented by the average value of the potential in the vicinity. As a method for calculating the potential distribution numerically, for example, there is a method (Gauss = Seidel method) in which the potential of each part is described by an average value of potentials in the vicinity and this is repeated until there is no change in potential at all positions. Are known. Since such knowledge is well known, derivation of the calculation formula is omitted.

この考え方をマップ空間における参照ベクトルに適用すると、
（１）マップ空間の頂点に対応するノードに、典型例の特徴ベクトルに対応する参照ベクトルを配置する、
（２）頂点以外の各ノードにゼロベクトルを配置する、
（３）頂点以外の各ノードの参照ベクトルを、近傍のノードの参照ベクトルの平均に置き換える、
（４）全てのノードの参照ベクトルについて変化がなくなるまで、上記（３）を繰り返す、
という処理手順が得られる。上記手順で各ノードの参照ベクトルを求めることにより、参照ベクトルが初期化される。 Applying this idea to a reference vector in map space,
(1) A reference vector corresponding to a typical feature vector is arranged at a node corresponding to a vertex of the map space.
(2) Place a zero vector at each node other than the vertex.
(3) Replace the reference vector of each node other than the vertex with the average of the reference vectors of neighboring nodes.
(4) Repeat (3) until there is no change in the reference vectors of all nodes.
The processing procedure is obtained. By obtaining the reference vector of each node in the above procedure, the reference vector is initialized.

上記（１）ないし（４）の処理により各ノードの参照ベクトルを初期化することで、各ノードの参照ベクトルの初期値が決定される。このとき、ＸＹ平面において各ノードの参照ベクトルの当該要素ｒiがなすスカラー場（ポテンシャル場）は、図５（ｂ）に示すようにマップ空間において比較的凹凸の少ない滑らかな曲面となる。すなわち、各参照ベクトルがなすベクトル場は滑らかな超曲面となる。 By initializing the reference vector of each node by the above processes (1) to (4), the initial value of the reference vector of each node is determined. At this time, the scalar field (potential field) formed by the element ri of the reference vector of each node on the XY plane is a smooth curved surface with relatively little unevenness in the map space, as shown in FIG. That is, the vector field formed by each reference vector is a smooth hypersurface.

こうして得られた参照ベクトルを初期値として、ＳＯＭと同様の考え方で他の教師画像を用いた学習が行われる。すなわち、典型例以外の各教師画像は、当該教師画像の特徴ベクトルと最も近い参照ベクトルを有するノードに分類される（ステップＳ３０７）。特徴ベクトルと参照ベクトルとの距離については、例えば両ベクトルの内積の大きさによって評価することができ、各ノードのうち、教師画像の特徴ベクトルとの内積が最も大きくなるような参照ベクトルを有するノードが、当該教師画像が分類されるべきノードであるということができる。 Learning using other teacher images is performed using the reference vector thus obtained as an initial value in the same way as in SOM. That is, each teacher image other than the typical example is classified into a node having a reference vector closest to the feature vector of the teacher image (step S307). The distance between the feature vector and the reference vector can be evaluated by, for example, the size of the inner product of both vectors, and among these nodes, the node having a reference vector that maximizes the inner product with the feature vector of the teacher image However, it can be said that the teacher image is a node to be classified.

そして、教師画像が分類されたノードおよびその近傍のノードの参照ベクトルが、当該ノードに分類された教師画像の特徴ベクトルに近づくように更新される（ステップＳ３０８）。学習結果が教師画像の選択順序の影響を受けないようにするために、全ての教師画像の分類が完了してから、各ノードに分類された特徴ベクトルを用いて参照ベクトルが更新される。１つのノードに複数の教師画像が分類されている場合には、それらの特徴ベクトルを平均したベクトルが用いられる。これは、いわゆるバッチ学習型ＳＯＭの考え方と同様である。 Then, the reference vector of the node into which the teacher image is classified and the neighboring nodes are updated so as to approach the feature vector of the teacher image classified into the node (step S308). In order to prevent the learning result from being influenced by the selection order of the teacher images, the reference vectors are updated using the feature vectors classified into the respective nodes after the classification of all the teacher images is completed. When a plurality of teacher images are classified into one node, a vector obtained by averaging the feature vectors is used. This is the same as the concept of so-called batch learning type SOM.

ステップＳ３０７およびＳ３０８の処理は、所定の収束条件、例えば全ての参照ベクトルの変化がなくなるまで繰り返される（ステップＳ３０９）。収束条件が成立すると（ステップＳ３０９においてＹＥＳ）、各ノードの学習が終了し、最終的に得られた教師画像の分類結果が、言わば改良された自己組織化マップとして表示部３６に表示出力され、ユーザに提示される（ステップＳ３１０）。 Steps S307 and S308 are repeated until there is no change in a predetermined convergence condition, for example, all reference vectors (step S309). When the convergence condition is satisfied (YES in step S309), learning of each node is finished, and the classification result of the teacher image finally obtained is displayed and output on the display unit 36 as an improved self-organizing map, Presented to the user (step S310).

図６および図７は分類結果の表示出力例を示す図である。図６（ａ）に示す例では、マップ空間のノードと当該ノードに分類される教師画像の数が５行×５列のマトリクステーブルとして表示されている。すなわち、図６（ａ）の各枠内では、左上にノードを識別するためのノード番号が示され、右下に当該ノードに分類された教師画像の数が示されている。各ノードに対応する枠はマップ空間での位置に応じた配置で表示される。参照ベクトルＲ1に対応する典型例Ｆ1は左上のノード番号００の枠に含まれ、参照ベクトルＲ2に対応する典型例Ｆ2は右下のノード番号４４の枠に含まれる。図において点線で囲んだ箇所は多くの教師画像が集まってクラスタを形成している部分を示す。 6 and 7 are diagrams showing examples of display output of classification results. In the example shown in FIG. 6A, the map space nodes and the number of teacher images classified into the nodes are displayed as a matrix table of 5 rows × 5 columns. That is, in each frame of FIG. 6A, a node number for identifying a node is shown in the upper left, and the number of teacher images classified into the node is shown in the lower right. The frame corresponding to each node is displayed in an arrangement according to the position in the map space. The typical example F1 corresponding to the reference vector R1 is included in the frame of the node number 00 at the upper left, and the typical example F2 corresponding to the reference vector R2 is included in the frame of the node number 44 at the lower right. In the figure, a portion surrounded by a dotted line indicates a portion where many teacher images are gathered to form a cluster.

これらの典型例Ｆ1，Ｆ2を含むノードおよびその近傍のノードには多くの教師画像が含まれるクラスタが形成されており、指定された典型例が概ね適切なものであったことが示される。中でも、典型例Ｆ1を含む左上のノード００は特に多くの教師画像が集まるクラスタの中心となっており、典型例Ｆ1がこの分類カテゴリの典型例としてふさわしいものであることが示される。また、ノード００およびそれに極めて近いノードに分類された教師画像については、典型例Ｆ1と同一の分類カテゴリ（例えば気泡）が付与されるのが適切であることが、この結果からわかる。 A cluster including many teacher images is formed in the nodes including these typical examples F1 and F2 and the nodes in the vicinity thereof, indicating that the specified typical examples are generally appropriate. In particular, the upper left node 00 including the typical example F1 is the center of a cluster in which a large number of teacher images are gathered, indicating that the typical example F1 is suitable as a typical example of this classification category. Further, it can be seen from this result that it is appropriate to assign the same classification category (for example, bubble) to the teacher image classified into the node 00 and a node very close to the node 00.

図６（ａ）に示されたテーブルの表示画面は、例えばユーザがノード００を選択したとき、図６（ｂ）に示されるように、ノード００に分類された画像Ｉｍの一覧を表示する画面ウィンドウＷに切り替わるようにすることができる。選択されたノードがユーザ指定の典型例を含むノードである場合には、典型例Ｉｃと他の画像Ｉｍとをユーザが画面上で対比することができるように、これらを同一画面に並べて表示することがより好ましい。 The display screen of the table shown in FIG. 6A is a screen that displays a list of images Im classified into the node 00 as shown in FIG. 6B when the user selects the node 00, for example. The window W can be switched. When the selected node is a node including a typical example designated by the user, the typical example Ic and another image Im are displayed side by side on the same screen so that the user can compare them on the screen. It is more preferable.

ユーザは、一覧表示された画像を確認して問題がなければ、カーソルＣを操作してそれらを一括して選択し画面左の分類カテゴリフォルダのいずれか（例えば「気泡」）に移動させることで、複数の画像に対し一括して教示を行うことができる。一方、同じカテゴリへ分類することが適当でない画像が含まれていれば選択から除外したり、他の分類カテゴリへ教示したりすることができる。このようにすることで、１つ１つの教師画像を目視判断する従来技術に比べ、ユーザによる教示作業は極めて効率的なものとなり、ユーザの作業負担は軽減される。 If there is no problem in checking the images displayed in the list, the user operates the cursor C to select them all at once and move them to one of the classification category folders (for example, “bubble”) on the left side of the screen. It is possible to teach a plurality of images at once. On the other hand, if an image that is not appropriate to be classified into the same category is included, it can be excluded from selection or taught to another category. By doing in this way, the teaching work by the user becomes extremely efficient as compared with the conventional technique in which each teacher image is visually determined, and the work burden on the user is reduced.

一方、図６（ａ）において、典型例Ｆ2が分類されたノード４４を含む右下に形成されたクラスタでは、当該ノード４４はクラスタの中心から少し外れており、むしろノード３３に多くの教師画像が集まってクラスタの中心となっていると読み取れる。したがって、当該分類カテゴリ（例えば「異物」）の典型例としては、ノード３３に分類された教師画像Ｆ2ａから選ばれるのが好ましいことが示唆される。この場合にも、ノード３３に分類された教師画像を一覧画面に表示させて、より好ましい典型例を探し直すことができる。 On the other hand, in FIG. 6A, in the cluster formed in the lower right including the node 44 in which the typical example F2 is classified, the node 44 is slightly off the center of the cluster. Can be read as the center of the cluster. Therefore, it is suggested that a typical example of the classification category (for example, “foreign matter”) is preferably selected from the teacher image F2a classified in the node 33. Also in this case, the teacher image classified into the node 33 can be displayed on the list screen, and a more preferable typical example can be searched again.

図６（ａ）からはさらに、テーブルの右上近くに、典型例Ｆ1，Ｆ2をそれぞれ含むクラスタとは離隔してもう１つのクラスタが形成されていることが読み取れる。このことは、左上のクラスタが示す「気泡」、右下のクラスタが示す「異物」とは別の欠陥種別を含んだ教師画像が相当数あることを示唆するものと言える。したがって、このクラスタに対応する分類カテゴリ（例えば「傷」）と、当該クラスタから選出した典型例Ｆ3とを新たに指定することができるようにすれば、このような欠陥種別にも事後的に対応して分類を行うことが可能となる。 Further, it can be seen from FIG. 6A that another cluster is formed near the upper right corner of the table apart from the clusters including the typical examples F1 and F2. This suggests that there are a considerable number of teacher images including defect types different from the “bubbles” indicated by the upper left cluster and the “foreign matter” indicated by the lower right cluster. Therefore, if it is possible to newly designate a classification category (for example, “scratch”) corresponding to this cluster and a typical example F3 selected from the cluster, such a defect type can be dealt with later. Thus, classification can be performed.

この他、少数の教師画像がさらに小さなクラスタを形成し、あるいはどのクラスタにも属さず孤立するなど種々の態様で、全ての教師画像がマップ空間上に配置される。こうして、典型例とされた画像に対し他の教師画像がどの程度類似しているか、あるいは乖離しているかがマップ上の分布として明示される。 In addition, all the teacher images are arranged on the map space in various modes such that a small number of teacher images form a smaller cluster or are isolated without belonging to any cluster. In this way, it is clearly shown as a distribution on the map how much the other teacher images are similar to or dissociated from the typical image.

図７（ａ）に示す表示例では、教師画像の分類結果がＸＹ平面上の散布図として表示される。また図７（ｂ）に示す表示例では、ノードごとの教師画像の数がグラフとして示される。なお、図７（ａ）では各ドットが１つの教師画像に対応しておりドット数が画像の数を表すが、各ノードに分類された教師画像の数に応じた大きさの円によって画像の分布が表されてもよい。これらの表示態様によっても、指定された典型例の近傍やそれから離隔した位置に他の教師画像がどのように分布しているかをユーザは容易に読み取ることが可能である。ユーザの目的に応じて、これらの表示態様が適宜切り替えられることができるようにすればなお好ましい。 In the display example shown in FIG. 7A, the classification result of the teacher image is displayed as a scatter diagram on the XY plane. In the display example shown in FIG. 7B, the number of teacher images for each node is shown as a graph. In FIG. 7A, each dot corresponds to one teacher image, and the number of dots represents the number of images, but the image is represented by a circle having a size corresponding to the number of teacher images classified into each node. A distribution may be represented. Also in these display modes, the user can easily read how other teacher images are distributed in the vicinity of the designated typical example or in a position separated from the designated typical example. It is more preferable that these display modes can be appropriately switched according to the purpose of the user.

図８は三次元マップ空間での表示出力例を示す図である。マップ空間の次元が三次元である場合も、ＸＹＺ空間内の立方体の対角線上に当たる２つの頂点に、参照ベクトルＲmin，Ｒmaxがそれぞれ配置される。他の６つの頂点には、最大６種類までの分類カテゴリに対応する典型例がそれぞれ配置される。 FIG. 8 is a diagram showing a display output example in the three-dimensional map space. Even when the dimension of the map space is three-dimensional, the reference vectors Rmin and Rmax are arranged at two vertices corresponding to the diagonal line of the cube in the XYZ space. At the other six vertices, typical examples corresponding to a maximum of six classification categories are respectively arranged.

頂点に配置された典型例に類似した教師画像群はマップ空間において当該頂点の近傍に集まってクラスタを形成する。類似した画像が多数あれば大きなクラスタが形成され、少なければ小さなクラスタが形成される。また、典型例とは異なる特徴において類似する画像群は、頂点とは異なる位置にクラスタを形成する。このようなマップ表示がなされることで、二次元マップの場合と同様、ユーザは、指定した典型例に類似する画像がどの程度存在するか、あるいは典型例として指定しなかったが大きなクラスタを形成するような画像群があるか否か等を直感的に把握することができる。 A group of teacher images similar to the typical example arranged at the vertex gathers in the vicinity of the vertex in the map space to form a cluster. A large cluster is formed if there are many similar images, and a small cluster is formed if there are few similar images. In addition, an image group similar in features different from the typical example forms a cluster at a position different from the vertex. By displaying such a map, as in the case of a two-dimensional map, the user forms how many images similar to the specified typical example exist, or forms a large cluster that was not specified as a typical example. It is possible to intuitively grasp whether or not there is an image group that does.

なお、典型例が配置された頂点以外では、当該頂点に配置された参照ベクトルは人工的に作成されたものであり、必ずしも実在しない仮想的な画像の特徴ベクトルに対応するものとなっている。このため、当該頂点の近傍に集まってくる画像が少なくなることもあり得る。 In addition to the vertices where the typical examples are arranged, the reference vectors arranged at the vertices are artificially created and correspond to feature vectors of virtual images that do not necessarily exist. For this reason, the image gathering in the vicinity of the said vertex may decrease.

また、異なる頂点に配置された複数の典型例を含むようにしてそれぞれ形成されるクラスタが一体となってそれらの境界が不明瞭となる場合もあり得る。このような場合には、例えばそれらの典型例がより離れた頂点に配置されるようにして再学習を行うことで、境界をより明確にすることができる場合もある。 In addition, clusters formed respectively so as to include a plurality of typical examples arranged at different vertices may be integrated to make their boundaries unclear. In such a case, for example, the boundary may be made clearer by performing relearning so that the typical examples are arranged at more distant vertices.

図４に戻って、この実施形態の事前学習処理では、典型例の再指定や分類カテゴリの新設を可能となっている。すなわち、分類カテゴリや典型例の再指定が必要なときには（ステップＳ３１１においてＹＥＳ）、ステップＳ３０２に戻り、分類カテゴリおよび典型例の指定を新たに受け付ける。一方、再指定が必要なければ、ユーザは上記のように表示される分類結果を参考情報としながら、未教示の教師画像について分類されるべき分類カテゴリを指定入力する教示作業を行うことができる。 Returning to FIG. 4, in the pre-learning process of this embodiment, it is possible to re-specify a typical example and newly establish a category. That is, when it is necessary to re-specify the classification category or typical example (YES in step S311), the process returns to step S302, and the designation of the classification category and typical example is newly accepted. On the other hand, if re-designation is not necessary, the user can perform a teaching operation of designating and inputting a classification category to be classified for an unteached teacher image while using the classification result displayed as described above as reference information.

全ての教師画像について教示入力が受け付けられると（ステップＳ３１２）、各教師画像の特徴量と、当該教師画像に付与された分類カテゴリに関する教示情報とに基づき、欠陥分類部３２が適宜の機械学習アルゴリズムを実行して分類器を構成する（ステップＳ３１３）。これにより事前学習処理は完了し、こうして構成された分類器が、未知の欠陥画像の分類に適用される。 When the teaching input is received for all the teacher images (step S312), the defect classification unit 32 selects an appropriate machine learning algorithm based on the feature amount of each teacher image and the teaching information about the classification category assigned to the teacher image. To configure a classifier (step S313). Thus, the pre-learning process is completed, and the classifier thus configured is applied to the classification of the unknown defect image.

以上のように、この実施形態では、教師あり学習により構成された分類器を用いて未知の欠陥画像を分類することで、欠陥種別の典型例との類似度合いに基づく欠陥分類を行うことができる。多数の教師画像に対する教示作業はユーザにとって大きな負担となり得るが、この実施形態では、ユーザが各分類カテゴリに対し１つずつ典型例を指定すれば、他の教師画像の典型例との類似度合いが自動的に判定され、その結果が可視化されて提示される。そのため、ユーザによる教示作業の負担は大きく軽減され、教示作業を効率的に行うことが可能となる。すなわち、この実施形態は、ユーザが行う教示作業に対し極めて効果的な支援を提供することができる。 As described above, in this embodiment, by classifying an unknown defect image using a classifier configured by supervised learning, it is possible to perform defect classification based on the degree of similarity with a typical defect type. . Although teaching work for a large number of teacher images can be a heavy burden on the user, in this embodiment, if the user designates one typical example for each classification category, the degree of similarity with typical examples of other teacher images is increased. It is automatically determined and the result is visualized and presented. Therefore, the burden of the teaching work by the user is greatly reduced, and the teaching work can be performed efficiently. That is, this embodiment can provide extremely effective support for the teaching work performed by the user.

判定結果を可視化する手法としてはＳＯＭの考え方が用いられるが、マップ空間の参照ベクトルの初期値の設定において、本実施形態は従来のＳＯＭとは異なっている。従来のＳＯＭアルゴリズムは教師なし学習を可能とするためのものである。基本的なＳＯＭの手法では、マップ空間の各ノードの参照ベクトルが乱数により初期化される。このため、初期化された参照ベクトルによりマップ空間に形成されるベクトル場は、原理的に多くの凹凸を含むものとなる。 Although the SOM concept is used as a method for visualizing the determination result, the present embodiment is different from the conventional SOM in setting the initial value of the reference vector in the map space. The conventional SOM algorithm is for enabling unsupervised learning. In the basic SOM method, the reference vector of each node in the map space is initialized with a random number. For this reason, the vector field formed in the map space by the initialized reference vector includes many irregularities in principle.

教師画像の特徴ベクトルが入力ベクトルとして与えられると、当該教師画像はその特徴ベクトルと最も近い参照ベクトルを有するノードに分類される。上記のように参照ベクトルがランダムに初期化されている場合、特徴ベクトルの差異が比較的小さい教師画像同士がマップ空間上で大きく離れた位置に配置されてしまうことがあり得る。特に学習の初期段階でこれが顕著である。このため、似通った特徴を有する２つの画像でありながらそれらが互いに異なるクラスタを形成することがあり、ユーザが指定する典型例を基準とした分類には適さない。 When a feature vector of a teacher image is given as an input vector, the teacher image is classified into a node having a reference vector closest to the feature vector. When the reference vectors are initialized at random as described above, teacher images with relatively small feature vector differences may be arranged at positions that are largely separated on the map space. This is particularly noticeable in the early stages of learning. For this reason, although two images having similar characteristics may form different clusters, they are not suitable for classification based on typical examples designated by the user.

なおＳＯＭでは、全入力データの主成分分析結果に基づいて参照ベクトルを整然と初期配置するという手法も採られるが、それにより得られる結果が人間の主観により指定される典型例を中心とした分類と一致する確率は極めて低い。このように、ユーザから何らの教示も受けない教師なし学習では、ユーザの感覚に見合った分類方法を提供することが困難である。 In SOM, a method of orderly placing reference vectors in an orderly manner based on the principal component analysis results of all input data is also used. However, the results obtained thereby are classified based on typical examples specified by human subjectivity. The probability of matching is very low. Thus, in unsupervised learning that does not receive any teaching from the user, it is difficult to provide a classification method that matches the user's sense.

これに対し、この実施形態では、参照ベクトルの初期値を決定する段階で最小限のユーザ入力、すなわち幾つかの分類カテゴリの特定と、各分類カテゴリに対する１つずつの典型例の指定とを受け付ける。そして、これらの情報に基づいてマップ空間での参照ベクトルが初期化される。 On the other hand, in this embodiment, at the stage of determining the initial value of the reference vector, a minimum user input, that is, identification of several classification categories and designation of one typical example for each classification category are accepted. . Based on these pieces of information, a reference vector in the map space is initialized.

具体的には、マップ空間を構成する超立方体において最も遠い２つの頂点に対応するノードに、各要素の値が全ての教師画像の特徴ベクトル以下である参照ベクトルＲminと、各要素の値が全ての教師画像の特徴ベクトル以上である参照ベクトルＲmaxとを配置する。超立方体の対角に当たる２つの頂点にこれらの参照ベクトルを配置することで、各教師画像の特徴ベクトルはマップ空間内の広い範囲に分布することになり、マップ空間のダイナミックレンジを有効に利用することができる。 Specifically, a node corresponding to the two farthest vertices in the hypercube constituting the map space has a reference vector Rmin in which the value of each element is equal to or less than the feature vector of all the teacher images, and the value of each element is all And a reference vector Rmax that is equal to or greater than the feature vector of the teacher image. By arranging these reference vectors at the two vertices corresponding to the diagonal of the hypercube, the feature vectors of each teacher image are distributed over a wide range in the map space, and the dynamic range of the map space is effectively used. be able to.

また、超立方体の他の頂点には、ユーザにより指定された典型例の特徴ベクトルに相当する参照ベクトルが配置される。これにより、典型例がマップ空間で互いに離隔して配置されるため、１つの典型例と他の典型例との間に十分な距離を確保して、それぞれが属するクラスタの境界が曖昧になるのを抑制することができる。典型例についてユーザからの教示を受けない従来のＳＯＭでは、このように典型例同士をマップ上で明確に分離しユーザの感覚に合った分類をすることが、原理的に不可能である。 Further, a reference vector corresponding to a typical feature vector specified by the user is arranged at the other vertex of the hypercube. As a result, the typical examples are arranged apart from each other in the map space, so that a sufficient distance is secured between one typical example and the other typical examples, and the boundaries of the clusters to which each belongs are ambiguous. Can be suppressed. In the conventional SOM which does not receive the instruction from the user about the typical example, it is impossible in principle to clearly separate the typical examples on the map and classify them according to the user's sense.

そして、他の各ノードの参照ベクトルの初期値は、こうしてマップ空間の各頂点に対応するノードに配置された参照ベクトルを境界条件とするポアソン方程式の各ノード位置での解として与えられる。このため、初期化されたマップ空間において参照ベクトルにより形成されるベクトル場は比較的凹凸の少ない滑らかな超曲面となる。したがって、特徴ベクトルが似通った２つの教師画像が場の凹凸によって分断される確率は小さく、これらはマップ空間内で近接した位置に配置され単一のクラスタを形成すると期待される。 Then, the initial value of the reference vector of each other node is given as a solution at each node position of the Poisson equation using the reference vector arranged at the node corresponding to each vertex of the map space as a boundary condition. For this reason, the vector field formed by the reference vectors in the initialized map space is a smooth hypersurface with relatively few irregularities. Therefore, the probability that two teacher images having similar feature vectors are divided by the unevenness of the field is small, and they are expected to be arranged at close positions in the map space to form a single cluster.

似た特徴を有する教師画像同士がマップ空間で複数のクラスタを形成した場合、それらが本当に別のクラスタとすべきものか、あるいは初期化の結果として偶然分断されたものかをユーザが見極める必要が生じる。このことはユーザ負担の増大を招き、教示を支援するという目的に反する。ユーザにより指定された典型例をマップ上に離隔配置し、それらの間を滑らかに接続する場を形成するような参照ベクトルを各ノードの初期値として設定することで、このような問題は回避される。 When teacher images with similar characteristics form multiple clusters in the map space, the user needs to determine whether they should really be separate clusters or have been accidentally fragmented as a result of initialization . This increases the burden on the user and goes against the purpose of supporting teaching. Such problems can be avoided by placing the reference examples specified by the user on the map and setting a reference vector that forms a smooth connection field between them as the initial value of each node. The

そして、こうして参照ベクトルが初期化されたノードを、典型例以外の教師画像により学習させることで、典型例と似た特徴を有する教師画像はマップ空間上で典型例の近傍に集まる一方、似ていない教師画像はより遠い位置に現れる。このように、分類カテゴリごとに１つの典型例が指定されれば、他の教師画像については自動的に典型例との関係がマップ空間上で可視化される。ユーザは提示された結果を参考にして教示作業を行うことが可能であり、教師あり学習のための教示作業におけるユーザの負担が大きく軽減されている。 Then, by learning the node with the reference vector initialized in this way from the teacher image other than the typical example, the teacher images having characteristics similar to the typical example are gathered in the vicinity of the typical example on the map space, but similar to each other. No teacher image appears at a farther position. Thus, if one typical example is designated for each classification category, the relationship with the typical example is automatically visualized on the map space for the other teacher images. The user can perform teaching work with reference to the presented result, and the burden on the user in teaching work for supervised learning is greatly reduced.

教師なし学習に基づく分類では、分類結果が必ずしも人間の感覚に沿ったものとならず、ユーザが分類結果に違和感を覚えることがあり得る。一方、教師あり学習に基づく分類では、分類カテゴリ数を予め決めておいたり、全ての教師画像をユーザが目視判断したりする必要がある。本実施形態の分類方法では、これらの欠点が解消されている。 In classification based on unsupervised learning, the classification result does not necessarily conform to human senses, and the user may feel uncomfortable with the classification result. On the other hand, in classification based on supervised learning, the number of classification categories must be determined in advance, or the user must visually determine all the teacher images. In the classification method of this embodiment, these drawbacks are eliminated.

以上説明したように、この実施形態では、図４に示される事前学習処理に本発明にかかる「教示支援方法」が適用されており、そのうちステップＳ３０２ないしＳ３０６が本発明の「初期化工程」に相当し、ステップＳ３０７ないしＳ３０９が本発明の「学習工程」に相当している。そして、ステップＳ３１０が本発明の「出力工程」に相当している。また、図３に示される欠陥分類処理に本発明の「画像分類方法」が適用されており、そのうちステップＳ２０１が本発明の「取得工程」および「教示工程」に相当している。また、ステップＳ２０４が本発明の「分類工程」に相当している。 As described above, in this embodiment, the “teaching support method” according to the present invention is applied to the pre-learning process shown in FIG. 4, and steps S302 to S306 are included in the “initialization process” of the present invention. Steps S307 to S309 correspond to the “learning step” of the present invention. Step S310 corresponds to the “output step” of the present invention. Further, the “image classification method” of the present invention is applied to the defect classification processing shown in FIG. 3, and step S201 of these corresponds to the “acquisition process” and “teaching process” of the present invention. Step S204 corresponds to the “classification step” of the present invention.

また、上記実施形態では、欠陥分類装置３としての機能が実装されたコンピュータが本発明の「コンピュータ」に相当し、表示部３６が本発明の「表示手段」として機能している。 In the above-described embodiment, the computer on which the function as the defect classification device 3 is mounted corresponds to the “computer” of the present invention, and the display unit 36 functions as the “display unit” of the present invention.

なお、本発明は上記した実施形態に限定されるものではなく、その趣旨を逸脱しない限りにおいて上述したもの以外に種々の変更を行うことが可能である。例えば、上記実施形態では、１つの正方形により二次元マップ空間が、また１つの立方体により三次元マップ空間が可視化表現されている。しかしながら、これらを複数使用してより多次元のマップ空間が表示されるようにしてもよい。 The present invention is not limited to the above-described embodiment, and various modifications other than those described above can be made without departing from the spirit of the present invention. For example, in the above embodiment, a two-dimensional map space is visualized by one square, and a three-dimensional map space is visualized by one cube. However, a multi-dimensional map space may be displayed using a plurality of these.

また例えば、上記実施形態は基板Ｓの製造ラインに設置されたインライン型検査システム１であるが、本発明にかかる教示支援方法および画像分類方法については、製造ラインとは独立して設けられたコンピュータシステムにより実現することも可能である。また、前述のように、本発明にかかる教示支援方法および画像分類方法は、一般的なハードウェア構成を有するパーソナルコンピュータやワークステーションに上記処理内容を記述した制御プログラムを実行させることにより実現可能である。したがって、既存の検査装置に追加的に組み込まれる制御プログラムとして、本発明を提供することも可能である。 Further, for example, the above embodiment is the inline type inspection system 1 installed on the production line of the substrate S. However, the teaching support method and the image classification method according to the present invention are provided independently from the production line. It can also be realized by a system. In addition, as described above, the teaching support method and the image classification method according to the present invention can be realized by causing a personal computer or workstation having a general hardware configuration to execute a control program describing the above processing contents. is there. Therefore, the present invention can be provided as a control program that is additionally incorporated in an existing inspection apparatus.

また例えば、上記実施形態では、参照ベクトルＲminとしてゼロベクトルを、参照ベクトルＲmaxとして正規化された要素の値が全て１であるベクトルを用いているが、これらに限定されない。例えば、全ての教師画像の特徴ベクトルを要素（特徴量）ごとに比較し、各要素における最小値（または最大値）を当該要素の値とするようなベクトルを参照ベクトルＲmin（または参照ベクトルＲmax）として用いてもよい。また、上記実施形態では各特徴量を正規化しているが、これは処理の便宜上のものであり必須の要件ではない。 For example, in the above-described embodiment, a zero vector is used as the reference vector Rmin, and a vector whose normalized element values are all 1 is used as the reference vector Rmax. However, the present invention is not limited to this. For example, the feature vectors of all the teacher images are compared for each element (feature amount), and a vector having the minimum value (or maximum value) of each element as the value of the element is referred to as the reference vector Rmin (or reference vector Rmax). It may be used as In the above embodiment, each feature amount is normalized, but this is for the convenience of processing and is not an essential requirement.

また、上記実施形態は半導体基板の欠陥を検査する検査装置であるが、本発明の適用対象は、半導体基板を検査する装置だけでなく、他の対象物、例えばプリント基板やガラス基板等を検査する装置や、各種材料の表面状態を検査する表面検査装置であってもよい。さらに、分類すべき画像は上記のような基板の欠陥画像に限定されず、例えば病理組織や細胞等の生物試料を撮像した画像、人間の顔認識のための画像などであってもよく、このような画像を分類する目的のために、本発明を適用することも可能である。 Moreover, although the said embodiment is an inspection apparatus which test | inspects the defect of a semiconductor substrate, the application object of this invention test | inspects not only the apparatus which test | inspects a semiconductor substrate but other objects, for example, a printed circuit board, a glass substrate, etc. Or a surface inspection device for inspecting the surface condition of various materials. Furthermore, the image to be classified is not limited to the defect image of the substrate as described above, and may be, for example, an image obtained by imaging a biological sample such as a pathological tissue or a cell, an image for human face recognition, and the like. It is also possible to apply the present invention for the purpose of classifying such images.

以上、具体的な実施形態を例示して説明してきたように、この発明にかかる教示支援方法の学習工程は、画像のノードへの分類と分類結果に基づく参照ベクトルの更新とを、所定の収束条件が満たされるまで交互に繰り返すように構成されてもよい。参照ベクトルの１回の更新で目的に応じた初期値が得られることはまれであり、処理を繰り返して学習結果を収束させるようにすることで、より分類精度を高めることができる。 As described above, the learning process of the teaching support method according to the present invention has been described with reference to the specific convergence of the classification into the nodes of the image and the update of the reference vector based on the classification result as described in the specific embodiment. It may be configured to repeat alternately until the condition is satisfied. It is rare that an initial value according to the purpose is obtained by one update of the reference vector, and the classification accuracy can be further improved by repeating the processing so as to converge the learning result.

また例えば、学習工程は、マップ空間の各頂点に配置された参照ベクトルを更新しないように構成されてもよい。こうすることで、頂点ではユーザにより指定された典型画像に対応する参照ベクトルが保持されるので、典型画像に類似した画像によるクラスタをマップ空間の頂点付近に形成させることができ、他の典型例がなすクラスタとの間を明確に分離することができる。 Further, for example, the learning step may be configured not to update the reference vector arranged at each vertex of the map space. By doing this, since the reference vector corresponding to the typical image specified by the user is held at the vertex, a cluster of images similar to the typical image can be formed near the vertex of the map space. Can be clearly separated.

また、初期化工程において、マップ空間の頂点が（Ｎ＋２）より大きいとき、第１および第２の頂点以外のＮ個の頂点に、Ｎ枚の典型画像の各々に対応する特徴ベクトルを参照ベクトルとして配置し、残りの頂点に、Ｍ枚の画像の特徴ベクトルの平均に相当する参照ベクトルを配置するようにしてもよい。このように、典型例が配置された頂点以外の頂点には平均的な参照ベクトルを配置することで、典型例と類似する特徴を有する教師画像が他の頂点に引き寄せられることが抑制され、典型例に類似した画像を典型例の周囲に効果的に集めることができる。 Further, in the initialization step, when the vertex of the map space is larger than (N + 2), feature vectors corresponding to each of N typical images are used as reference vectors for N vertices other than the first and second vertices. The reference vectors corresponding to the average of the feature vectors of the M images may be arranged at the remaining vertices. In this way, by placing an average reference vector at vertices other than the vertices where the typical example is arranged, it is possible to suppress a teacher image having characteristics similar to the typical example from being drawn to other vertices. Images similar to the example can be effectively collected around the typical example.

また例えば、正方形または立方体をマップ空間として、出力工程では、当該マップ空間における画像の分布を表す散布図を表示手段に表示出力するように構成されてもよい。このような表示態様では、各教師画像が典型例とどの程度類似しているか、あるいは各典型例に類似する他の教師画像がどの程度あるかが一目瞭然となる。 Further, for example, a square or a cube may be used as a map space, and in the output step, a scatter diagram representing an image distribution in the map space may be displayed and output on the display means. In such a display mode, it becomes obvious at a glance how much each teacher image is similar to the typical example or how many other teacher images are similar to each typical example.

また例えば出力工程では、一の典型画像と、当該典型画像の近傍にある少なくとも一つの他の画像とを、表示手段の同一画面に表示出力するようにしてもよい。このような表示態様によれば、ユーザは典型画像に類似していると判定された他の画像を典型画像と比較し評価することが可能となる。 Further, for example, in the output step, one typical image and at least one other image in the vicinity of the typical image may be displayed and output on the same screen of the display means. According to such a display mode, the user can evaluate another image determined to be similar to the typical image by comparing with the typical image.

また、本発明にかかる画像分類方法では、教示工程に先立って、取得工程で取得された画像のうちから典型画像の指定入力を受け付けるようにしてもよい。このような構成では、取得工程においては典型画像であるか否かとは関係なく種々の画像を収集することができ、収集された多くの画像を教師画像とすることで、分類性能の優れた分類器を構成することが可能になる。 Further, in the image classification method according to the present invention, prior to the teaching step, a typical image designation input may be received from the images acquired in the acquisition step. With such a configuration, it is possible to collect various images regardless of whether or not they are typical images in the acquisition process, and by using many collected images as teacher images, classification with excellent classification performance is possible. Can be configured.

本発明は、基板の欠陥検査や生物試料の分析、顔認識など、曖昧さやばらつきを有する画像をその特徴に基づいて分類する技術分野に対し、好適に適用可能である。 The present invention can be suitably applied to a technical field in which images having ambiguity and variation are classified based on their characteristics, such as substrate defect inspection, biological sample analysis, and face recognition.

１検査システム
２欠陥検出装置
３欠陥分類装置（コンピュータ）
３２欠陥分類部
３４特徴量算出部
３５入力受付部
３６表示部（表示手段）
２０１撮像部
Ｓ３０２〜Ｓ３０６初期化工程
Ｓ３０７〜Ｓ３０９学習工程
Ｓ３１０出力工程
Ｓ２０１取得工程、教示工程
Ｓ２０４分類工程 DESCRIPTION OF SYMBOLS 1 Inspection system 2 Defect detection apparatus 3 Defect classification apparatus (computer)
32 Defect classification unit 34 Feature amount calculation unit 35 Input reception unit 36 Display unit (display means)
201 Imaging unit S302 to S306 Initialization process S307 to S309 Learning process S310 Output process S201 Acquisition process, teaching process S204 Classification process

Claims

The computer supports teaching work to assign classification categories to M (M is an integer larger than N) images including N typical images corresponding to each of N (N is an integer of 2 or more) types of classification categories. A teaching support method, wherein the computer includes:
An initialization step of initializing a reference vector of each node in a map space having a dimension according to the number of classification categories;
A learning step of classifying each of the images into nodes having a reference vector closest to the feature vector of the image in the map space, and learning each node by updating the reference vector of each node based on the classification result;
Performing an output step of visualizing and presenting the distribution of the image in the map space obtained in the learning step, and in the initialization step,
A hypercube having (N + 2) or more vertices is defined as the map space, and among the vertices,
At the first vertex, a feature vector in which each of the plurality of feature value values constituting the feature vector is equal to or less than a minimum value among the corresponding feature value values in the feature vectors representing the M images,
At the second vertex farthest from the first vertex, each of a plurality of feature amount values constituting the feature vector is the maximum of the corresponding feature amount values in the feature vector representing each of the M images. Feature vectors that are greater than or equal to the value
A feature vector corresponding to each of the typical images at vertices other than the first and second vertices,
Place each as a reference vector of the node corresponding to the vertex,
A teaching support method in which an initial value of a reference vector of each node other than the vertex is given as a solution of a Poisson equation using the reference vector of each vertex as a boundary condition.

The teaching support method according to claim 1, wherein in the learning step, classification of the image into the node and update of a reference vector based on a classification result are alternately repeated until a predetermined convergence condition is satisfied.

The teaching support method according to claim 1, wherein, in the learning step, a reference vector arranged at each vertex of the map space is not updated.

In the initialization step, when the vertex of the map space is larger than (N + 2), refer to feature vectors corresponding to each of the N typical images to N vertices other than the first and second vertices. 4. The teaching support method according to claim 1, wherein the teaching support method is arranged as a vector, and a reference vector corresponding to an average of feature vectors of the M images is arranged at the remaining vertices.

The teaching support method according to claim 1, wherein a square or a cube is used as the map space, and in the output step, a scatter diagram representing the distribution of the image in the map space is displayed and output on a display unit.

6. The teaching support according to claim 1, wherein in the output step, the one typical image and at least one other image in the vicinity of the typical image are displayed and output on the same screen of the display means. Method.

An acquisition step of acquiring a plurality of images;
A teaching step of presenting a distribution of the plurality of images in the map space by the teaching support method according to any one of claims 1 to 6, and receiving a classification category teaching input for each of the plurality of images;
A classification step of classifying images using a classifier that has learned the plurality of images and the taught classification category as teacher data.

The image classification method according to claim 7, wherein a designation input of the typical image is received from images acquired in the acquisition step prior to the teaching step.