JP7236292B2

JP7236292B2 - Evaluation device, evaluation method, evaluation program, and inspection device

Info

Publication number: JP7236292B2
Application number: JP2019042990A
Authority: JP
Inventors: 公紀黒澤; 和宏渡部; 正浩柏木
Original assignee: Fujikura Ltd
Current assignee: Fujikura Ltd
Priority date: 2018-09-07
Filing date: 2019-03-08
Publication date: 2023-03-09
Anticipated expiration: 2039-03-08
Also published as: JP7213701B2; JP2020042001A; JP2020042754A; JP7161948B2; JP7289658B2; JP2020042755A; JP2020042757A

Description

本発明は、物品の状態を分類する分類器を評価する評価装置、評価方法、及び評価プログラムに関する。また、本発明は、物品の検査を行う検査装置に関する。 The present invention relates to an evaluation device, an evaluation method, and an evaluation program for evaluating a classifier that classifies the states of articles. The present invention also relates to an inspection apparatus for inspecting articles.

物品を被写体として含む画像を参照して該物品の状態を分類する技術が広く用いられている。このような技術を、機械学習により構築された分類器を用いて実現する場合、分類器を完成させる過程で、学習済みの分類器を評価する必要がある。 A technique for classifying the state of an article by referring to an image containing the article as a subject is widely used. When implementing such a technique using a classifier constructed by machine learning, it is necessary to evaluate the learned classifier in the process of completing the classifier.

このような問題の解決に資する可能性のある技術としては、例えば、特許文献１に記載の評価装置が挙げられる。この評価装置は、学習目標を受け付ける学習目標受付部と、少なくとも学習目標に含まれる評価項目について分類器の評価を行い、評価データを生成する評価部と、学習目標と評価データとを用いて、分類器が学習目標を達成したか否か判定する判定部とを含む。 As a technology that may contribute to solving such problems, for example, an evaluation device described in Patent Document 1 can be cited. This evaluation device uses a learning goal reception unit that receives a learning goal, an evaluation unit that evaluates the classifier for at least the evaluation items included in the learning goal and generates evaluation data, and the learning goal and the evaluation data, and a determining unit for determining whether the classifier has achieved its learning goal.

特開２０１８－１８１１８４号公報（２０１８年１１月１５日公開）Japanese Patent Application Laid-Open No. 2018-181184 (published on November 15, 2018)

発明者らは、物品の状態を分類する際に分類器が着目する画像内の領域と、物品の状態を分類する際に人が着目する画像内の領域との比較を行った。その結果、一定の汎化能力を有する分類器（例えば、特許文献１の記載の評価装置によって学習目標を達成したと判定された評価装置）であっても、物品の状態を分類する際に人が着目する領域とは全く異なる領域に着目している分類器が存在することが分かった。このような分類器は、限られたテストデータに関して人による分類の結果を偶発的に再現した分類器に過ぎないと考えられる。したがって、このような分類器は、追加学習による汎化能力の向上を期待することができない、謂わば「筋の悪い」の分類器であり、人による物品の分類を代替する分類器になり得ない。このような観点から分類器を評価する技術は、これまで存在していない。もちろん、特許文献１に記載の評価装置も、このような観点から分類器を評価するものではない。 The inventors compared the area in the image that the classifier pays attention to when classifying the state of the article and the area in the image that the person pays attention to when classifying the state of the article. As a result, even with a classifier having a certain generalization ability (for example, an evaluation device that is determined to have achieved a learning goal by the evaluation device described in Patent Document 1), when classifying the state of an article, human It was found that there are classifiers that focus on a completely different region from the one focused on by . Such classifiers are considered to be nothing more than accidental reproductions of human classification results on limited test data. Therefore, such a classifier cannot be expected to improve generalization ability through additional learning, and is a so-called "bad" classifier, and can be a classifier that replaces the classification of articles by humans. do not have. Techniques for evaluating classifiers from this point of view have not existed so far. Of course, the evaluation device described in Patent Literature 1 does not evaluate the classifier from such a point of view.

本発明は、上記の問題に鑑みてなされたものであり、その目的は、物品の状態を分類する分類器を、着目する領域に関する人との類似性の観点から評価することが可能な評価装置、評価方法、及び評価プログラムを実現することにある。また、そのような評価装置を用いた検査装置を実現することにある。 SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and an object of the present invention is to provide an evaluation apparatus capable of evaluating a classifier for classifying the state of an article from the viewpoint of similarity with a person in a region of interest. , an evaluation method, and an evaluation program. Another object of the present invention is to realize an inspection apparatus using such an evaluation apparatus.

上記の課題を解決するために、本発明の一態様に係る評価装置は、物品を被写体として含む画像を参照して該物品の状態を分類する分類処理を、学習により定められたアルゴリズムに従い実行する複数の分類器の各々を評価する評価装置である。該評価装置は、人が状態を分類した物品を被写体として含む画像をサンプル画像として、各分類器に対応するヒートマップであって、上記分類処理において該分類器が着目する該サンプル画像内の領域を示すヒートマップを作成する作成部と、対応するヒートマップが示すサンプル画像内の領域と、物品の状態を分類するために上記人が着目するサンプル画像内の領域との比較に基づいて、各分類器を評価する評価部と、を備えている。 In order to solve the above problems, an evaluation apparatus according to an aspect of the present invention refers to an image including an article as a subject, and performs classification processing for classifying the state of the article according to an algorithm determined by learning. An evaluation device for evaluating each of a plurality of classifiers. The evaluation device uses an image including, as a subject, an article whose state has been classified by a person as a sample image, and a heat map corresponding to each classifier, which is a region in the sample image to which the classifier pays attention in the classification process. Based on the comparison between the area in the sample image indicated by the corresponding heat map and the area in the sample image that the person focuses on in order to classify the state of the article, each an evaluation unit for evaluating the classifier.

上記の課題を解決するために、本発明の一態様に係る評価方法は、物品を被写体として含む画像を参照して該物品の状態を分類する分類処理を、学習により定められたアルゴリズムに従い実行する複数の分類器の各々を評価する評価方法である。該評価方法は、人が状態を分類した物品を被写体として含む画像をサンプル画像として、各分類器に対応するヒートマップであって、上記分類処理において該分類器が着目する該サンプル画像内の領域を示すヒートマップを作成する作成ステップと、対応するヒートマップが示すサンプル画像内の領域と、物品の状態を分類するために上記人が着目するサンプル画像内の領域との比較に基づいて、各分類器を評価する評価ステップと、を備えている。 In order to solve the above problems, an evaluation method according to an aspect of the present invention refers to an image including an article as a subject, and executes classification processing for classifying the state of the article according to an algorithm determined by learning. It is an evaluation method for evaluating each of a plurality of classifiers. The evaluation method uses an image including, as a subject, an article whose state has been classified by a person as a sample image, a heat map corresponding to each classifier, and a region in the sample image to which the classifier focuses in the classification process. and comparing the area in the sample image indicated by the corresponding heat map with the area in the sample image that the person focuses on to classify the state of the article, each and an evaluation step of evaluating the classifier.

上記の課題を解決するために、本発明の一態様に係る評価プログラムは、コンピュータを上述の評価装置として動作させる評価プログラムであって、上記コンピュータを上記評価装置の各部として機能させる。 In order to solve the above problems, an evaluation program according to one aspect of the present invention is an evaluation program that causes a computer to operate as the evaluation device described above, and causes the computer to function as each part of the evaluation device.

上記の課題を解決するために、本発明の一態様に係る検査装置は、物品の検査を行う検査装置である。該検査装置は、上記複数の分類器と、上述の評価装置と、上記評価装置による評価結果に基づいて、上記複数の分類器から何れかの分類器を選択する選択部と、を含み、上記選択部により選択された分類器を用いて、上記物品の検査を行う。 To solve the above problems, an inspection apparatus according to one aspect of the present invention is an inspection apparatus that inspects articles. The inspection device includes the plurality of classifiers, the evaluation device, and a selection unit that selects one of the plurality of classifiers based on the evaluation result of the evaluation device, The article is inspected using the classifier selected by the selection unit.

上記の構成によれば、物品の状態を分類する分類器を、着目する領域に関する人との類似性の観点から評価することができる。 According to the above configuration, it is possible to evaluate a classifier that classifies the state of an article from the viewpoint of similarity with a person regarding the region of interest.

本発明の一態様に係る評価装置において、上記評価部は、対応するヒートマップが示すサンプル画像内の領域の中に、物品の状態を分類するために上記人が着目するサンプル画像内の領域と重複する領域が含まれている分類器を、含まれていない分類器より高く評価する、ことが好ましい。 In the evaluation device according to an aspect of the present invention, the evaluation unit includes an area in the sample image that the person pays attention to for classifying the state of the article, in the area in the sample image indicated by the corresponding heat map. It is preferable to rate classifiers that include overlapping regions higher than classifiers that do not.

上記の構成によれば、物品の状態を分類するために、人が着目する領域と類似する領域に着目する分類器を高く評価することができる。 According to the above configuration, in order to classify the state of an article, it is possible to highly evaluate a classifier that focuses on a region similar to a region that a person focuses on.

本発明の一態様に係る評価装置において、上記評価部は、対応するヒートマップが示すサンプル画像内の領域の中に、物品の状態を分類するために上記人が着目するサンプル画像内の領域と重複しない領域が含まれている分類器を、含んでいない分類器よりさらに高く評価する、ことが好ましい。 In the evaluation device according to an aspect of the present invention, the evaluation unit includes an area in the sample image that the person pays attention to for classifying the state of the article, in the area in the sample image indicated by the corresponding heat map. It is preferable to rate classifiers that contain non-overlapping regions higher than classifiers that do not.

上記の構成によれば、物品の状態を分類するために、人が着目する領域と類似する領域に着目することに加えて、人が着目する領域と類似しない独自の領域に着目する分類器を、高く評価することができる。 According to the above configuration, in order to classify the state of an article, in addition to focusing on areas similar to the area of human attention, a classifier that focuses on a unique area that is not similar to the area of human attention is provided. , can be appreciated.

本発明の一態様に係る評価装置において、人が状態を分類した物品を被写体として含む複数の画像の各々をサンプル画像として、サンプル画像毎に各分類器に対応するヒートマップを作成し、上記評価部は、サンプル画像毎に各分類器を評価した評価結果に基づいて、各分類器を評価する、ことが好ましい。 In the evaluation apparatus according to one aspect of the present invention, each of a plurality of images containing articles classified by a person as subjects is used as a sample image, and a heat map corresponding to each classifier is created for each sample image, and the above evaluation is performed. Preferably, the unit evaluates each classifier based on evaluation results of evaluating each classifier for each sample image.

上記の構成によれば、物品の状態を分類する分類器を、より高い精度で評価することができる。 According to the above configuration, the classifier that classifies the state of the article can be evaluated with higher accuracy.

本発明の一態様に係る評価装置において、上記評価部は、上記複数の分類器のうち、各サンプル画像に被写体として含まれる物品の状態を分類した結果が、上記人が該物品の状態を分類した結果に一致する割合がより高い分類器を、さらに高く評価する、ことが好ましい。 In the evaluation device according to the aspect of the present invention, the evaluation unit classifies the state of the article included in each sample image as a subject in each of the plurality of classifiers so that the person classifies the state of the article. Preferably, classifiers with a higher percentage of matching results are rated higher.

上記の構成によれば、物品の状態を分類するために、人が着目する領域と類似する領域に着目すると共に、分類精度が高い分類器を高く評価することができる。 According to the above configuration, in order to classify the state of an article, it is possible to focus on a region similar to a region that a person pays attention to, and to highly evaluate a classifier with high classification accuracy.

本発明の一態様によれば、物品の状態を分類する分類器を、着目する領域に関する人との類似性の観点から評価することができる。 According to one aspect of the present invention, a classifier that classifies the state of an article can be evaluated in terms of similarity with a person regarding a region of interest.

本発明の第１の実施形態に係る評価装置の物理的構成を示すブロック図である。1 is a block diagram showing the physical configuration of an evaluation device according to a first embodiment of the present invention; FIG. 図１に示す評価装置の機能的構成を示すブロック図である。2 is a block diagram showing the functional configuration of the evaluation device shown in FIG. 1; FIG. 図１に示す評価装置が実行する評価方法Ｓ１の流れを示すフローチャートである。2 is a flowchart showing the flow of an evaluation method S1 executed by the evaluation device shown in FIG. 1; 図３に示す作成ステップにおいて生成されるヒートマップの一例を示す図である。FIG. 4 is a diagram showing an example of a heat map generated in the creation step shown in FIG. 3; FIG. （ａ）～（ｃ）は、各サンプル画像における、検査者による着目領域の一例を示す模式図である。（ｄ）～（ｅ）は、ある分類器に対応する各サンプル画像のヒートマップが示す、該分類器による着目領域を示す模式図である。（ｆ）～（ｈ）は、他の分類器に対応する各サンプル画像のヒートマップが示す、該分類器による着目領域を示す模式図である。(a) to (c) are schematic diagrams showing an example of a region of interest by an inspector in each sample image. (d) to (e) are schematic diagrams showing regions of interest by a certain classifier indicated by heat maps of sample images corresponding to the classifier. (f) to (h) are schematic diagrams showing the region of interest by the other classifier indicated by the heat map of each sample image corresponding to the classifier. 図３に示す評価ステップにおける各分類器の評価の一例を示す図である。It is a figure which shows an example of evaluation of each classifier in the evaluation step shown in FIG. 本発明の第２の実施形態に係る検査装置の機能的構成を示すブロック図である。It is a block diagram showing a functional configuration of an inspection apparatus according to a second embodiment of the present invention. （ａ）は、図７に示す検査装置の実施例において、検査者による着目領域を示す図である。（ｂ）、（ｃ）は、各分類器による着目領域を示す図である。8A is a diagram showing a region of interest by an inspector in the embodiment of the inspection apparatus shown in FIG. 7; FIG. (b) and (c) are diagrams showing regions of interest by each classifier. 図４に示す検査装置の実施例において、評価が低い分類器について説明する図である。FIG. 5 is a diagram illustrating a classifier with a low evaluation in the embodiment of the inspection apparatus shown in FIG. 4;

〔実施形態１〕
（評価装置の物理的構成）
本発明の第１の実施形態に係る評価装置１の物理的構成について、図１を参照して説明する。図１は、評価装置１の物理的構成を示すブロック図である。 [Embodiment 1]
(Physical configuration of evaluation device)
A physical configuration of the evaluation device 1 according to the first embodiment of the present invention will be described with reference to FIG. FIG. 1 is a block diagram showing the physical configuration of the evaluation device 1. As shown in FIG.

評価装置１は、図１に示すように、バス１０と、主メモリ１１と、プロセッサ１２と、補助メモリ１３と、入出力インターフェース１４と、を備えたコンピュータである。主メモリ１１、プロセッサ１２、補助メモリ１３、及び入出力インターフェース１４は、バス１０を介して互いに接続されている。主メモリ１１としては、例えば、単一又は複数の半導体ＲＡＭ（random access memory）が用いられる。プロセッサ１２としては、例えば、単一又は複数のマイクロプロセッサ、単一又は複数のデジタルシグナルプロセッサ、単一又は複数のマイクロコントローラ、又はこれらの組み合わせが用いられる。補助メモリ１３としては、例えば、単一又は複数のＨＤＤ（Hard Disk Drive）、単一又は複数のＳＳＤ（Solid State Drive）、又はこれらの組み合わせが用いられる。また、補助メモリ１３の一部又は全部は、通信インタフェース（図示せず）を介して接続されたネットワーク上のストレージであってもよい。入出力インターフェース１４としては、例えば、ＵＳＢ（Universal Serial Bus）インターフェース、赤外線やBluetooth（登録商標）等の近距離通信インターフェース、又はこれらの組み合わせが用いられる。 The evaluation device 1 is a computer having a bus 10, a main memory 11, a processor 12, an auxiliary memory 13, and an input/output interface 14, as shown in FIG. Main memory 11 , processor 12 , auxiliary memory 13 and input/output interface 14 are interconnected via bus 10 . As the main memory 11, for example, single or multiple semiconductor RAMs (random access memories) are used. Processor 12 may be, for example, a single or multiple microprocessors, a single or multiple digital signal processors, a single or multiple microcontrollers, or a combination thereof. As the auxiliary memory 13, for example, a single or multiple HDDs (Hard Disk Drives), single or multiple SSDs (Solid State Drives), or a combination thereof is used. Also, part or all of the auxiliary memory 13 may be storage on a network connected via a communication interface (not shown). As the input/output interface 14, for example, a USB (Universal Serial Bus) interface, a short-range communication interface such as infrared rays or Bluetooth (registered trademark), or a combination thereof is used.

入出力インターフェース１４には、例えば、入力装置２０及び出力装置３０が接続される。入力装置２０としては、例えば、キーボード、マウス、タッチパッド、マイク、又はこれらの組み合わせ等が用いられる。出力装置３０としては、例えば、ディスプレイ、プリンタ、スピーカ、又はこれらの組み合わせが用いられる。なお、評価装置１は、ノート型コンピュータのように、入力装置２０として機能するキーボート及びタッチパッド、並びに、出力装置３０として機能するディスプレイを内蔵していてもよい。また、評価装置１は、スマートフォン又はタブレット型コンピュータのように、入力装置２０及び出力装置３０として機能するタッチパネルを内蔵していてもよい。 For example, an input device 20 and an output device 30 are connected to the input/output interface 14 . As the input device 20, for example, a keyboard, mouse, touch pad, microphone, or a combination thereof is used. A display, a printer, a speaker, or a combination thereof is used as the output device 30, for example. Note that the evaluation device 1 may incorporate a keyboard and touch pad functioning as the input device 20 and a display functioning as the output device 30, like a notebook computer. Moreover, the evaluation device 1 may incorporate a touch panel functioning as the input device 20 and the output device 30 like a smart phone or a tablet computer.

補助メモリ１３には、後述する評価処理Ｓ１をプロセッサ１２に実行させるためのプログラムＰが格納されている。プロセッサ１２は、補助メモリ１３に格納されたプログラムＰを主メモリ１１上に展開し、主メモリ１１上に展開されたプログラムＰに含まれる各命令を実行することによって、後述する評価処理Ｓ１に含まれる各ステップを実行する。また、補助メモリ１３には、後述する評価処理Ｓ１を実行するためにプロセッサ１２が参照する各種データが格納されている。 The auxiliary memory 13 stores a program P for causing the processor 12 to execute an evaluation process S1, which will be described later. The processor 12 expands the program P stored in the auxiliary memory 13 onto the main memory 11, and executes each instruction included in the program P expanded onto the main memory 11, thereby executing the instructions included in the evaluation process S1, which will be described later. perform each step that is In addition, the auxiliary memory 13 stores various data referred to by the processor 12 in order to execute the evaluation process S1, which will be described later.

なお、ここでは、内部記憶媒体である補助メモリ１３に格納されているプログラムＰに従ってプロセッサ１２が後述する評価処理Ｓ１を実行する形態について説明したが、これに限定されない。すなわち、外部記録媒体に格納されているプログラムＰに従ってプロセッサ１２が後述する評価処理Ｓ１を実行する形態を採用してもよい。この場合、外部記録媒体としては、コンピュータが読み取り可能な「一時的でない有形の媒体」、例えば、テープ、ディスク、カード、半導体メモリ、又はプログラマブル論理回路などを用いることができる。あるいは、通信インタフェース（図示せず）を介して接続されるネットワーク上から取得したプログラムＰに従ってプロセッサ１２が後述する評価処理Ｓ１を実施する形態を採用してもよい。この場合、ネットワークとしては、例えば、インターネット、有線ＬＡＮ（Local Area Network）、無線ＬＡＮ、又はこれらの少なくとも一部の組み合わせ等などを用いることができる。 Here, a form in which the processor 12 executes the evaluation process S1, which will be described later, according to the program P stored in the auxiliary memory 13, which is an internal storage medium, has been described, but the present invention is not limited to this. That is, it is also possible to employ a form in which the processor 12 executes the evaluation process S1, which will be described later, according to the program P stored in the external recording medium. In this case, as the external recording medium, a computer-readable "non-transitory tangible medium" such as a tape, disk, card, semiconductor memory, or programmable logic circuit can be used. Alternatively, an embodiment may be adopted in which the processor 12 performs the evaluation process S1 described later according to a program P obtained from a network connected via a communication interface (not shown). In this case, the network may be, for example, the Internet, a wired LAN (Local Area Network), a wireless LAN, or a combination of at least some of them.

また、ここでは、単一のコンピュータを用いて評価装置１を実現する形態について説明したが、これに限定されない。すなわち、互いに通信可能に構成された複数のコンピュータを用いて評価装置１を実現する形態を採用してもよい。この場合、後述する評価処理Ｓ１を構成する各ステップを、これらのコンピュータにより並列的に実行することが可能になる。 Moreover, although the form which implement|achieves the evaluation apparatus 1 using a single computer was demonstrated here, it is not limited to this. That is, a form in which the evaluation apparatus 1 is realized using a plurality of computers configured to be able to communicate with each other may be adopted. In this case, each step constituting the evaluation process S1, which will be described later, can be executed in parallel by these computers.

（評価装置の機能的構成）
評価装置１の機能的構成について、図２を参照して説明する。図２は、評価装置１の機能的構成を示すブロック図である。 (Functional configuration of evaluation device)
A functional configuration of the evaluation device 1 will be described with reference to FIG. FIG. 2 is a block diagram showing the functional configuration of the evaluation device 1. As shown in FIG.

評価装置１は、図２に示すように、複数の分類器Ｃ１，Ｃ２，…，Ｃｎと、作成部１０１と、評価部１０２とを備えている。ここで、ｎは、分類器Ｃ１，Ｃ２，…，Ｃｎの個数を表す任意の自然数である。これらのブロックは、上述したプロセッサ１２が上述したプログラムＰの命令を実行することにより実現される機能ブロックである。 The evaluation device 1 includes, as shown in FIG. 2, a plurality of classifiers C1, C2, . Here, n is an arbitrary natural number representing the number of classifiers C1, C2, . . . , Cn. These blocks are functional blocks implemented by executing the instructions of the program P described above by the processor 12 described above.

各分類器Ｃｉ（ｉ＝１，２，…，ｎ）は、物品を被写体として含む画像を参照して該物品の状態を分類する分類処理を、機械学習により定められたアルゴリズム（以下、「モデル」と記載する）を用いて実行する。分類処理に利用可能なモデルとしては、例えば、物品を被写体として含む画像を入力とし、該物品の状態を出力とするニューラルネットワークが挙げられる。 Each classifier Ci (i=1, 2, . . . , n) refers to an image containing an article as a subject and performs classification processing for classifying the state of the article using an algorithm determined by machine learning (hereinafter referred to as "model ”). A model that can be used for the classification process is, for example, a neural network that receives an image including an article as an object and outputs the state of the article.

各分類器Ｃｉが用いるモデルは、評価装置１による評価が開始する前に、事前の機械学習により構築される。事前の機械学習においては、（１）モデルの選択、（２）モデルの調整（ネットワーク構造及びハイパーパラメータの調整）、及び、（３）モデルの機械学習が行われる。 The model used by each classifier Ci is constructed by prior machine learning before the evaluation by the evaluation device 1 starts. In prior machine learning, (1) model selection, (2) model tuning (network structure and hyperparameter tuning), and (3) model machine learning are performed.

作成部１０１は、検査者（特許請求の範囲における「人」の一例）が状態を分類した物品を被写体として含む画像Ｉ１，Ｉ２，…，Ｉｍをサンプル画像として、各分類器Ｃｉ（ｉ＝１，２，…，ｎ）に対応するヒートマップＭｉｊ（ｊ＝１，２，…，ｍ）を作成するブロックである。ここで、ヒートマップＭｉとは、分類処理において分類器Ｃｉが着目するサンプル画像Ｉｊ内の領域を示す画像のことを指す。 The creation unit 101 uses images I1, I2, . , 2, . . . , n) corresponding to the heat map Mij (j=1, 2, . Here, the heat map Mi refers to an image showing an area in the sample image Ij focused on by the classifier Ci in the classification process.

例えば、ヒートマップＭｉｊは、特定のクラス分類の確からしさ、又は、関連の深さを、色の濃さで表した画像である。この場合、例えば、ヒートマップＭｉｊにおいて、相対的に色の濃い領域が、サンプル画像Ｉｊにおいて、物品の状態を分類するために分類器Ｃｉが着目した領域を表す。或いは、ヒートマップＭｉｊは、分類器Ｃｉの出力に与える影響の大きさに応じて各画素の画素値が設定された画像である。この場合、例えば、ヒートマップＭｉｊにおいて、相対的に高い画素値を有する領域が、サンプル画像Ｉｊにおいて、物品の状態を分類するために分類器Ｃｉが着目した領域を表す。以下、各サンプル画像Ｉｊにおいて、物品の状態を分類するために分類器Ｃｉが着目する領域を、分類器Ｃｉによる「着目領域」とも記載する。分類器Ｃｉによる着目領域は、１箇所の場合もあるし、複数箇所の場合もある。ヒートマップＭｉｊの作成には、例えば、公知のアルゴリズムであるＧｒａｄ－ＣＡＭ（Gradient-weighted Class Activation Mapping）を用いることができる。 For example, the heat map Mij is an image that expresses the certainty of a specific class classification or the depth of association with color intensity. In this case, for example, in the heat map Mij, relatively dark areas represent areas in the sample image Ij focused on by the classifier Ci to classify the state of the article. Alternatively, the heat map Mij is an image in which the pixel value of each pixel is set according to the magnitude of the effect on the output of the classifier Ci. In this case, for example, areas with relatively high pixel values in the heat map Mij represent areas in the sample image Ij that the classifier Ci focused on to classify the state of the article. Hereinafter, in each sample image Ij, the area that the classifier Ci pays attention to for classifying the state of the article is also referred to as the "area of interest" by the classifier Ci. There may be one region of interest by the classifier Ci, or there may be multiple regions. For creating the heat map Mij, for example, Grad-CAM (Gradient-weighted Class Activation Mapping), which is a well-known algorithm, can be used.

なお、作成部１０１は、各分類器Ｃｉに対応するヒートマップＭｉｊの生成を、分類器Ｃｉによる分類の結果が正解情報（人による分類の結果を示す）に一致したサンプル画像Ｉｊについてのみ行う（分類器Ｃｉによる分類の結果が正解情報に一致しないサンプル画像Ｉｌについては行わない）ように構成されていてもよい。 Note that the creating unit 101 generates the heat map Mij corresponding to each classifier Ci only for the sample images Ij whose classification result by the classifier Ci matches correct information (indicating the result of classification by a person) ( The sample image Il for which the result of classification by the classifier Ci does not match the correct information may not be classified).

評価部１０２は、ヒートマップＭｉｊが示す分類器Ｃｉによる着目領域（すなわち、物品の状態を分類するために分類器Ｃｉが着目するサンプル画像Ｉｊ内の領域）と、物品の状態を分類するために検査者が着目するサンプル画像Ｉｊ内の領域との比較に基づいて、各分類器Ｃｉを評価するブロックである。以降、検査者が着目するサンプル画像Ｉｊ内の領域を、検査者による着目領域とも記載する。例えば、検査者による着目領域は、入力装置２０を介して入力される。具体的には、評価部１０２は、サンプル画像Ｉｊをディスプレイに表示し、タッチパッド又はマウス等の操作により指定された領域を、検査者による着目領域として取得してもよい。検査者による着目領域は、１箇所であってもよいし、複数箇所であってもよい。 The evaluation unit 102 divides the region of interest by the classifier Ci indicated by the heat map Mij (that is, the region in the sample image Ij to which the classifier Ci focuses in order to classify the state of the article) and the A block that evaluates each classifier Ci based on comparison with regions in the sample image Ij of interest to the inspector. Hereinafter, the region in the sample image Ij that the inspector pays attention to is also referred to as the region of interest of the inspector. For example, the region of interest by the inspector is input via the input device 20 . Specifically, the evaluation unit 102 may display the sample image Ij on the display, and acquire a region specified by operating a touch pad, mouse, or the like as a region of interest for the inspector. The region of interest by the inspector may be at one point or at a plurality of points.

検査者による着目領域と分類器Ｃｉによる着目領域との類似度が高い場合、その分類器Ｃｉは、汎化能力が高い（或いは、今後の追加学習により汎化能力の向上が期待できる）分類器であると考えられる。逆に、検査者による着目領域と分類器Ｃｉによる着目領域との類似度が低い場合、その分類器Ｃｉは、汎化能力が低い（或いは、今後の追加学習により汎化能力の向上が期待しにくい）分類器であると考えられる。評価部１０２は、このような観点から各分類器Ｃｉを評価する。各分類器Ｃｉに対する評価結果は、良又は否等の２段階で表されていてもよいし、３段階以上で表されていてもよいし、所定範囲の数値として表されていてもよい。 When the region of interest by the inspector and the region of interest by the classifier Ci are highly similar, the classifier Ci is a classifier with high generalization ability (or an improvement in generalization ability can be expected through additional learning in the future). It is considered to be Conversely, when the similarity between the region of interest by the inspector and the region of interest by the classifier Ci is low, the classifier Ci has low generalization ability (or it is expected that the generalization ability will be improved by additional learning in the future). difficult) classifier. The evaluation unit 102 evaluates each classifier Ci from this point of view. The evaluation result for each classifier Ci may be expressed in two stages such as good or bad, may be expressed in three stages or more, or may be expressed as a numerical value within a predetermined range.

例えば、評価部１０２は、分類器Ｃｉによる着目領域の中に、検査者による着目領域と重複する着目領域が含まれている分類器Ｃｉを、そうでない分類器Ｃｋ（ｋ≠ｉ）よりも高く評価する。ここで、２つの領域が重複するとは、これら２つの領域の共通部分が存在することを意味する。以下、分類器Ｃｉによる着目領域のうち、検査者による着目領域と重複する着目領域のことを、重複領域とも記載する。 For example, the evaluation unit 102 ranks a classifier Ci that includes a region of interest that overlaps with the region of interest of the inspector higher than the classifier Ck (k≠i) that does not, among the regions of interest of the classifier Ci. evaluate. Here, two regions overlap means that there is a common portion of these two regions. Hereinafter, among the regions of interest by the classifier Ci, a region of interest that overlaps the region of interest by the inspector is also referred to as an overlapping region.

また、評価部１０２は、分類器Ｃｉによる着目領域の中に、検査者による着目領域と重複する領域と、検査者による着目領域と重複しない領域との両方が含まれている分類器Ｃｉを、そうでない分類器Ｃｋよりも高くする評価する。以下、分類器Ｃｉによる着目領域のうち、検査者による着目領域と重複しない着目領域のことを、独自領域とも記載する。独自領域は、分類器Ｃｉによる着目領域の中で、検査者が着目することのない着目領域、すなわち、分類器Ｃｉが独自に着目する着目領域である。なお、評価部１０２は、分類器Ｃｉによる着目領域のうち、検査者による着目領域と重複しない着目領域を独自領域とみなすか否かを、検査者による入力情報に基づいて決定してもよい。これにより、評価部１０２は、そのような分類器Ｃｉの独自の着目領域のうち、検査者にとって確かにその着目領域もあり得ると考えられる着目領域を、独自領域とみなして動作する。 In addition, the evaluation unit 102 selects a classifier Ci that includes both a region that overlaps with the region of interest of the inspector and a region that does not overlap with the region of interest of the inspector, in the region of interest by the classifier Ci. Evaluate it higher than the classifier Ck that is not. Hereinafter, among the regions of interest by the classifier Ci, a region of interest that does not overlap with the region of interest by the inspector is also referred to as a unique region. The unique region is a region of interest that the inspector does not focus on among the regions of interest by the classifier Ci, that is, a region of interest that the classifier Ci uniquely focuses on. Note that the evaluation unit 102 may determine, based on input information from the inspector, whether or not to consider a region of interest that does not overlap with the region of interest of the inspector, among the regions of interest by the classifier Ci, as a unique region. As a result, the evaluation unit 102 operates by regarding a region of interest, which the examiner believes to be a possible region of interest, among the regions of interest unique to such a classifier Ci as a unique region.

また、評価部１０２は、各サンプル画像Ｉｊに被写体として含まれる物品の状態を分類した結果が、検査者が該物品の状態を分類した結果に一致する割合（以下、「正解率」とも記載する）がより高い分類器Ｃｉを、そうでない分類器Ｃｋよりも高く評価する。この場合、評価部１０２は、各分類器Ｃｉについて、分類器Ｃｉによる着目領域と検査者による着目領域との比較結果と、分類器Ｃｉの正解率とに基づく評価を行うことになる。 In addition, the evaluation unit 102 determines the rate at which the result of classifying the state of the article included as the subject in each sample image Ij matches the result of classifying the state of the article by the inspector (hereinafter, also referred to as the “accuracy rate”). ) higher than classifiers Ck that do not. In this case, the evaluation unit 102 evaluates each classifier Ci based on the result of comparison between the region of interest by the classifier Ci and the region of interest by the inspector, and the accuracy rate of the classifier Ci.

具体的には、評価部１０２には、検査者が該物品の状態を分類した結果が正解情報として、入力装置２０を介して入力される。また、評価部１０２には、各分類器Ｃｉから出力される分類結果が入力される。そして、評価部１０２は、各分類器Ｃｉについて、当該分類器Ｃｉから出力される各サンプル画像Ｉｊの分類結果が正解情報に一致する割合を、正解率として算出する。例えば、評価部１０２は、各分類器Ｃｉについて、該分類器Ｃｉによる着目領域と検査者による着目領域との比較に基づく評価結果に対して、正解率が高いほど評価結果がさらに高くなるような重み付けを行う。 Specifically, the result of classifying the state of the article by the inspector is input to the evaluation unit 102 as correct information through the input device 20 . The evaluation unit 102 also receives the classification result output from each classifier Ci. Then, for each classifier Ci, the evaluation unit 102 calculates, as the correct answer rate, the rate at which the classification result of each sample image Ij output from the classifier Ci matches the correct information. For example, for each classifier Ci, the evaluation unit 102 sets the evaluation result based on the comparison between the region of interest by the classifier Ci and the region of interest by the inspector such that the higher the accuracy rate, the higher the evaluation result. weighting.

なお、評価部１０２は、各分類器Ｃｉについて、該分類器Ｃｉによる着目領域と検査者による着目領域との比較を、当該分類器Ｃｉから出力された分類結果が正解情報に一致したサンプル画像Ｉｊについてのみ行う（一致しないサンプル画像Ｉｌについては比較を行わない）構成を採用してもよい。 For each classifier Ci, the evaluation unit 102 compares the region of interest by the classifier Ci with the region of interest by the inspector, and compares the sample image Ij in which the classification result output from the classifier Ci matches the correct information. (no comparison is performed for non-matching sample images Il) may be employed.

（評価方法）
評価装置１が実行する評価方法Ｓ１ついて、図３～図６を参照して説明する。 (Evaluation method)
The evaluation method S1 executed by the evaluation device 1 will be described with reference to FIGS. 3 to 6. FIG.

図３は、評価方法Ｓ１の流れを示すフローチャートである。評価方法Ｓ１は、図３に示すように、作成ステップＳ１０１と、評価ステップＳ１０２と、を含んでいる。 FIG. 3 is a flow chart showing the flow of the evaluation method S1. The evaluation method S1 includes a creation step S101 and an evaluation step S102, as shown in FIG.

作成ステップＳ１０１は、検査者が状態を分類した物品を被写体として含む画像をサンプル画像Ｉｊとして、各分類器Ｃｉに対応するヒートマップＭｉｊを作成するステップである。上述したように、本実施形態では、各分類器Ｃｉに複数のサンプル画像Ｉ１，Ｉ２，…，Ｉｍが入力され、複数のヒートマップＭｉ１，Ｍｉ２，…，Ｍｉｍが作成される。ここでは、作成部１０１は、分類器Ｃｉによる分類結果が正解情報に一致したサンプル画像Ｉｊについてヒートマップを作成し、一致しなかったサンプル画像Ｉｌ（ｌ≠ｊ）については、ヒートマップを作成しない。 The creation step S101 is a step of creating a heat map Mij corresponding to each classifier Ci by using an image including, as a subject, an article whose state has been classified by an inspector as a sample image Ij. As described above, in the present embodiment, a plurality of sample images I1, I2, . . . , Im are input to each classifier Ci, and a plurality of heat maps Mi1, Mi2, . Here, the creating unit 101 creates a heat map for the sample image Ij whose classification result by the classifier Ci matches the correct information, and does not create a heat map for the sample image Il (l≠j) that does not match. .

評価ステップＳ１０２は、対応するヒートマップＭｉｊが示すサンプル画像Ｉｊ内の領域と、物品の状態を分類するために検査者が着目するサンプル画像Ｉｊ内の領域との比較に基づいて、各分類器Ｃｉを評価するステップである。上述したように、本実施形態では、複数のサンプル画像Ｉ１，Ｉ２，…，Ｉｍの各々について、分類器Ｃｉによる着目領域と検査者による着目領域との比較が行われる。また、各分類器Ｃｉについて、複数のサンプル画像Ｉ１，Ｉ２，…，Ｉｍの分類結果が正解情報と一致した割合である正解率が算出される。そして、各分類器Ｃｉについて、比較結果と正解率とに基づいた評価が実行され、評価結果が出力される。 The evaluation step S102 evaluates each classifier Ci is the step of evaluating As described above, in the present embodiment, the region of interest by the classifier Ci and the region of interest by the inspector are compared for each of the plurality of sample images I1, I2, . . . , Im. Also, for each classifier Ci, a correct answer rate is calculated, which is a rate at which the classification results of the plurality of sample images I1, I2, . Then, each classifier Ci is evaluated based on the comparison result and the accuracy rate, and the evaluation result is output.

図４は、作成ステップＳ１０１において生成されるヒートマップの一例を示す図である。ここでは、各分類器Ｃｉには、サンプル画像Ｉ１，Ｉ２，Ｉ３が入力される。サンプル画像Ｉ１，Ｉ２，Ｉ３には、それぞれ、物品Ｏ１，Ｏ２，Ｏ３が被写体として含まれている。ここで、サンプル画像Ｉ１，Ｉ２，Ｉ３の正解情報は、それぞれ、クラスＡ、クラスＢ、クラスＣである。 FIG. 4 is a diagram showing an example of the heat map generated in the creation step S101. Here, sample images I1, I2, and I3 are input to each classifier Ci. Sample images I1, I2, and I3 include articles O1, O2, and O3 as subjects, respectively. Here, the correct information of the sample images I1, I2, and I3 are class A, class B, and class C, respectively.

分類器Ｃ１，Ｃ２によるサンプル画像Ｉ１の分類結果は、それぞれクラスＡで正解であり、分類器Ｃ１に対応するヒートマップＭ１１と、分類器Ｃ２に対応するヒートマップＭ２１とが作成される。 The classification results of the sample image I1 by the classifiers C1 and C2 are correct in class A, respectively, and a heat map M11 corresponding to the classifier C1 and a heat map M21 corresponding to the classifier C2 are created.

分類器Ｃ１，Ｃ２によるサンプル画像Ｉ２の分類結果は、それぞれクラスＢで正解であり、分類器Ｃ１に対応するヒートマップＭ１２と、分類器Ｃ２に対応するヒートマップＭ２２とが作成される。 The classification results of the sample image I2 by the classifiers C1 and C2 are correct in class B, respectively, and a heat map M12 corresponding to the classifier C1 and a heat map M22 corresponding to the classifier C2 are created.

分類器Ｃ１によるサンプル画像Ｉ３の分類結果は、クラスＢで不正解であり、分類器Ｃ１に対応するヒートマップは作成されない。分類器Ｃ２によるサンプル画像Ｉ３の分類結果は、クラスＣで正解であり、分類器Ｃ２に対応するヒートマップＭ２３が作成される。 The classification result of the sample image I3 by the classifier C1 is an incorrect answer in class B, and a heat map corresponding to the classifier C1 is not created. The classification result of the sample image I3 by the classifier C2 is correct in class C, and a heat map M23 corresponding to the classifier C2 is created.

評価ステップＳ１０２における比較処理の一例を、図５～図６を用いて説明する。 An example of comparison processing in the evaluation step S102 will be described with reference to FIGS. 5 and 6. FIG.

図５（ａ），（ｂ），（ｃ）は、サンプル画像Ｉ１，Ｉ２，Ｉ３における、検査者による着目領域Ｒ１，Ｒ２，Ｒ３を示す模式図である。 FIGS. 5A, 5B, and 5C are schematic diagrams showing regions of interest R1, R2, and R3 of the inspector in the sample images I1, I2, and I3.

図５（ｄ）は、上述したヒートマップＭ１１が示す着目領域を示す。ヒートマップＭ１１は、２つの着目領域Ｐ１１，Ｐ１２を含んでいる。分類器Ｃ１による着目領域Ｐ１１，Ｐ１２の中で、着目領域Ｐ１１は、検査者による着目領域Ｒ１と重複する重複領域である。また、分類器Ｃ１による着目領域Ｐ１１，Ｐ１２の中で、着目領域Ｐ１２は、検査者による着目領域Ｒ１と重複していない独自領域である。 FIG. 5(d) shows the region of interest indicated by the heat map M11 described above. The heat map M11 includes two regions of interest P11 and P12. Among the regions of interest P11 and P12 by the classifier C1, the region of interest P11 is an overlapping region that overlaps the region of interest R1 by the inspector. Among the regions of interest P11 and P12 by the classifier C1, the region of interest P12 is a unique region that does not overlap with the region of interest R1 by the inspector.

図５（ｅ）は、上述したヒートマップＭ１２が示す着目領域を示す。ヒートマップＭ１２は、２つの着目領域Ｐ２１，Ｐ２２を含んでいる。分類器Ｃ１による着目領域Ｐ２１，Ｐ２２の中で、着目領域Ｐ２１は、検査者による着目領域Ｒ２と重複する重複領域である。また、分類器Ｃ１による着目領域Ｐ２１，Ｐ２２の中で、着目領域Ｐ２２は、検査者による着目領域Ｒ２と重複していない独自領域である。 FIG. 5(e) shows the region of interest indicated by the heat map M12 described above. The heat map M12 includes two regions of interest P21 and P22. Among the regions of interest P21 and P22 by the classifier C1, the region of interest P21 is an overlapping region that overlaps the region of interest R2 by the inspector. Among the regions of interest P21 and P22 by the classifier C1, the region of interest P22 is a unique region that does not overlap with the region of interest R2 by the inspector.

図５（ｆ）は、上述したヒートマップＭ２１が示す着目領域を示す。ヒートマップＭ２１は、１つの着目領域Ｑ１を含んでいる。分類器Ｃ２による着目領域Ｑ１は、検査者による着目領域Ｒ１と重複していない。 FIG. 5(f) shows the region of interest indicated by the heat map M21 described above. The heat map M21 includes one region of interest Q1. The region of interest Q1 by the classifier C2 does not overlap with the region of interest R1 by the inspector.

図５（ｇ）は、上述したヒートマップＭ２２が示す着目領域を示す。ヒートマップＭ２２は、１つの着目領域Ｑ２を含んでいる。分類器Ｃ２による着目領域Ｑ２は、検査者による着目領域Ｒ２と重複していない。 FIG. 5(g) shows the region of interest indicated by the heat map M22 described above. The heat map M22 includes one region of interest Q2. The region of interest Q2 by the classifier C2 does not overlap with the region of interest R2 by the inspector.

図５（ｈ）は、上述したヒートマップＭ２３が示す着目領域を示す。ヒートマップＭ２３は、１つの着目領域Ｑ３を含んでいる。分類器Ｃ２による着目領域Ｑ３は、検査者による着目領域Ｒ３と重複する重複領域である。 FIG. 5(h) shows the region of interest indicated by the heat map M23 described above. The heat map M23 includes one region of interest Q3. The region of interest Q3 by the classifier C2 is an overlapping region that overlaps the region of interest R3 by the inspector.

図６は、各分類器Ｃｉについて、正解／不正解、重複領域の有無、及び独自領域の有無を、サンプル画像Ｉｊ毎に整理した表であり、各分類器Ｃｉの評価結果を示している。なお、ヒートマップＭｉｊが示す着目領域の中に重複領域が無い場合には、独自領域の有無については記載していない。 FIG. 6 is a table in which correct/incorrect answers, presence/absence of overlapping regions, and presence/absence of unique regions for each classifier Ci are arranged for each sample image Ij, and shows evaluation results of each classifier Ci. Note that if there is no overlapping area in the focused area indicated by the heat map Mij, the presence or absence of the unique area is not described.

図６の例では、評価部１０２は、ｎ個の分類器Ｃｉを、正解率、重複領域率、独自領域率に基づいて比較した結果に基づいて、各分類器Ｃｉの評価結果を生成している。ここで、重複領域率とは、サンプル画像Ｉｊの総数ｍに対して、分類器Ｃｉが重複領域を検出したサンプル画像Ｉｊの数の割合である。また、独自領域率とは、サンプル画像Ｉｊの総数に対して、分類器Ｃｉが独自領域を検出したサンプル画像Ｉｊの数の割合である。また、この例では、「良」または「否」の２段階の評価結果が生成されている。 In the example of FIG. 6, the evaluation unit 102 generates the evaluation result of each classifier Ci based on the result of comparing the n classifiers Ci based on the accuracy rate, overlapping area rate, and unique area rate. there is Here, the overlapping area rate is the ratio of the number of sample images Ij for which the classifier Ci has detected overlapping areas to the total number m of sample images Ij. The unique region rate is the ratio of the number of sample images Ij in which the classifier Ci has detected unique regions to the total number of sample images Ij. Also, in this example, a two-level evaluation result of "good" or "bad" is generated.

また、評価部１０２は、重複領域率を、正解率および独自領域率よりも重視した評価処理を行う。例えば、評価部１０２は、ｎ個の中で重複領域率が高いとの条件（例えば、最も高い）を満たす分類器Ｃｉのうち、正解率が高いとの条件（例えば、閾値以上）を満たす分類器Ｃｉの評価結果を「良」とする。そして、評価部１０２は、当該評価結果を「良」とした分類器Ｃｉとの比較により他の分類器Ｃｋの評価を行う。 In addition, the evaluation unit 102 performs evaluation processing that emphasizes the overlapping area rate more than the correct answer rate and the unique area rate. For example, the evaluation unit 102 selects a classifier Ci that satisfies a condition of a high accuracy rate (e.g., a threshold value or more) among the classifiers Ci that satisfy a condition of a high overlapping area rate (e.g., the highest rate) among n classifiers. Assume that the evaluation result of the device Ci is "good". Then, the evaluation unit 102 evaluates the other classifiers Ck by comparing with the classifier Ci for which the evaluation result is “good”.

例えば、分類器Ｃ１の重複領域率６０％は、ｎ個の中で最も高い。また、分類器Ｃ１の正解率７０％は、閾値（例えば、７０％）以上である。そこで、評価部１０２は、分類器Ｃ１の評価結果を「良」とする。 For example, the overlapping area rate of 60% for the classifier C1 is the highest among n. Also, the accuracy rate of 70% of the classifier C1 is equal to or higher than the threshold (for example, 70%). Therefore, the evaluation unit 102 sets the evaluation result of the classifier C1 as "good".

分類器Ｃ２は、分類器Ｃ１と比較して、重複領域率６０％が同じであるが、正解率８０％が高い一方で、独自領域率１０％が低い。この場合、分類器Ｃ２は、分類器Ｃ１と比較して優れているとも劣っているともいえず、同等であると判定される。したがって、評価部１０２は、分類器Ｃ２の評価結果を「良」とする。 The classifier C2 has the same overlapping area rate of 60% as the classifier C1, but has a higher accuracy rate of 80% and a lower unique area rate of 10%. In this case, classifier C2 is neither superior nor inferior to classifier C1, and is determined to be equivalent. Therefore, the evaluation unit 102 sets the evaluation result of the classifier C2 as "good".

また、分類器Ｃ３は、分類器Ｃ１と比較して、重複領域率６０％が同じであるが、正解率６０％および独自領域率０％ともに低い。この場合、分類器Ｃ３は、分類器Ｃ１と比較して劣っていると判定される。したがって、評価部１０２は、分類器Ｃ３の評価結果を「否」とする。 Also, the classifier C3 has the same overlapping area rate of 60% as the classifier C1, but both the accuracy rate of 60% and the unique area rate of 0% are lower. In this case, classifier C3 is determined to be inferior compared to classifier C1. Therefore, the evaluation unit 102 sets the evaluation result of the classifier C3 as "no".

また、分類器Ｃ４は、分類器Ｃ１と比較して、重複領域率６０％および正解率７０％が同じであるが、独自領域率２０％が低い。この場合、分類器Ｃ４は、分類器Ｃ１と比較して劣っていると判定される。したがって、評価部１０２は、分類器Ｃ４の評価結果を「否」とする。 Also, the classifier C4 has the same overlapping area rate of 60% and correct answer rate of 70% as the classifier C1, but has a lower unique area rate of 20%. In this case, classifier C4 is determined to be inferior compared to classifier C1. Therefore, the evaluation unit 102 sets the evaluation result of the classifier C4 as "no".

また、分類器Ｃ５は、分類器Ｃ１と比較して、重複領域率６０％および独自領域率３０％が同じであるが、正解率６０％が低い。この場合、分類器Ｃ５は、分類器Ｃ１と比較して劣っていると判定される。したがって、評価部１０２は、分類器Ｃ５の評価結果を「否」とする。 Also, the classifier C5 has the same overlapping area rate of 60% and unique area rate of 30% as the classifier C1, but has a lower accuracy rate of 60%. In this case, classifier C5 is determined to be inferior compared to classifier C1. Therefore, the evaluation unit 102 sets the evaluation result of the classifier C5 as "no".

また、分類器Ｃ６は、分類器Ｃ１と比較して、重複領域率５０％が低く、正解率７０％が同じであり、独自領域率４０％が高い。この場合、重複領域率が低いことを、独自領域率が高いことより重視するため、分類器Ｃ６は、分類器Ｃ１と比較して劣っていると判定される。したがって、評価部１０２は、分類器Ｃ６の評価結果を「否」とする。 Further, the classifier C6 has a lower overlapping area rate of 50%, the same correct answer rate of 70%, and a higher unique area rate of 40% than the classifier C1. In this case, since a low overlapping area rate is more important than a high unique area rate, the classifier C6 is determined to be inferior to the classifier C1. Therefore, the evaluation unit 102 sets the evaluation result of the classifier C6 as "no".

また、分類器Ｃｎは、重複領域率３０％が極端に低いとの条件（例えば、閾値以下）を満たす。この場合、評価部１０２は、分類器Ｃｎを他の分類器Ｃｉと比較することなく、分類器Ｃｎの評価結果を「否」とする。 Also, the classifier Cn satisfies the condition that the overlapping area rate of 30% is extremely low (for example, less than or equal to the threshold). In this case, the evaluation unit 102 evaluates the classifier Cn as "no" without comparing the classifier Cn with other classifiers Ci.

（評価装置１の効果）
本実施形態に係る評価装置１は、分類器Ｃｉが着目する領域の中に、検査者が着目する領域と重複する重複領域が含まれる該分類器Ｃｉを、そうでない分類器Ｃｋより高く評価する。また、本実施形態に係る評価装置１は、分類器Ｃｉが着目する領域の中に、検査者が着目する領域と重複しない独自領域がさらに含まれる該分類器Ｃｉを、そうでない分類器Ｃｋよりさらに高く評価する。したがって、本実施形態に係る評価装置１は、各分類器Ｃｉを、当該分類器Ｃｉが着目する領域に関する検査者が着目する領域との類似性の観点から、評価することができる。 (Effect of evaluation device 1)
The evaluation apparatus 1 according to the present embodiment evaluates a classifier Ci that includes an overlap region that overlaps the region focused by the inspector higher than the classifier Ck that does not, among the regions targeted by the classifier Ci. . In addition, the evaluation apparatus 1 according to the present embodiment classifies the classifier Ci, which further includes a unique region that does not overlap with the region focused by the inspector, from the classifier Ck that does not, in the region focused by the classifier Ci. appreciate it even more. Therefore, the evaluation apparatus 1 according to the present embodiment can evaluate each classifier Ci from the viewpoint of the similarity between the region targeted by the classifier Ci and the region targeted by the inspector.

（変形例１）
なお、本実施形態に係る評価方法Ｓ１では、評価部１０２が、検査者による着目領域を入力として、分類器による着目領域及び検査者による着目領域を比較するものとして説明した。これに限らず、評価方法Ｓ１では、分類器による着目領域及び検査者による着目領域の比較を、検査者が行ってもよい。この場合、評価部１０２は、作成部１０１によって生成されたヒートマップが示す、分類器による着目領域をディスプレイ等に表示する。また、評価部１０２は、ディスプレイに表示された着目領域を視認した検査者による上述した比較結果を表す情報を、入力装置３０を介して取得する。 (Modification 1)
In the evaluation method S1 according to the present embodiment, the evaluation unit 102 receives the region of interest by the inspector as input, and compares the region of interest by the classifier and the region of interest by the inspector. Not limited to this, in the evaluation method S1, the inspector may compare the region of interest by the classifier and the region of interest by the inspector. In this case, the evaluation unit 102 displays, on a display or the like, the region of interest by the classifier indicated by the heat map generated by the generation unit 101 . In addition, the evaluation unit 102 acquires, via the input device 30, information representing the above-described comparison result by the inspector who visually recognized the region of interest displayed on the display.

（変形例２）
また、本実施形態においては、複数の分類器Ｃ１，Ｃ２，…，Ｃｎが、評価装置１の内部に含まれている（評価装置１と同じコンピュータで実行されている）構成を採用しているが、本発明は、これに限定されない。例えば、複数の分類器Ｃ１，Ｃ２，…，Ｃｎの一部又は全部が評価装置１の内部に含まれていない（評価装置１と異なるコンピュータで実行される）構成を採用しても構わない。すなわち、複数の分類器Ｃ１，Ｃ２，…，Ｃｎは、評価装置１の必須の構成要素ではない。 (Modification 2)
Further, in this embodiment, a configuration is adopted in which a plurality of classifiers C1, C2, . However, the invention is not so limited. For example, a configuration in which some or all of the plurality of classifiers C1, C2, . That is, the plurality of classifiers C1, C2, . . . , Cn are not essential components of the evaluation device 1.

〔実施形態２〕
本実施形態では、実施形態１に係る評価装置１を用いて、物品の検査を行う検査装置２を構成する実施形態について説明する。 [Embodiment 2]
In this embodiment, the evaluation apparatus 1 according to the first embodiment is used to configure an inspection apparatus 2 that inspects articles.

図７は、検査装置２の機能的構成を示すブロック図である。なお、検査装置２の物理的構成については、図１を参照して説明した評価装置１と同様であるため、詳細な説明を省略する。 FIG. 7 is a block diagram showing the functional configuration of the inspection device 2. As shown in FIG. Since the physical configuration of the inspection device 2 is the same as that of the evaluation device 1 described with reference to FIG. 1, detailed description thereof will be omitted.

図７に示すように、検査装置２は、評価装置１と、選択部２０３とを含む。検査装置２は、選択部２０３により選択された分類器Ｃｉを用いて、物品の検査を行う。 As shown in FIG. 7 , the inspection device 2 includes an evaluation device 1 and a selection section 203 . The inspection device 2 uses the classifier Ci selected by the selection unit 203 to inspect the article.

選択部２０３は、評価装置１による評価結果に基づいて、複数の分類器Ｃ１，Ｃ２，…，Ｃｎから何れかの分類器Ｃｉを選択するブロックである。例えば、選択部２０３は、各分類器Ｃｉのうち、評価結果が所定条件を満たす（例えば、評価結果が「良」である）ものを１つ以上選択してもよい。 The selection unit 203 is a block that selects one of the classifiers Ci from the plurality of classifiers C1, C2, . For example, the selection unit 203 may select one or more classifiers Ci whose evaluation results satisfy a predetermined condition (for example, the evaluation results are "good").

具体的には、選択部２０３は、検査装置２に対して入力される画像が、選択した分類器Ｃｉに入力され、選択した分類器Ｃｉからの出力が、検査結果として外部に出力されるよう、切り替え処理を行う。選択部２０３による切り替え処理は、評価装置１による評価が完了した後、検査装置２によって物品の検査を行う運用が開始される前に行われる。また、検査装置２の運用が開始する前に、選択された１つ以上の分類器Ｃｉを追加学習させる処理が、さらに実行されてもよい。 Specifically, the selection unit 203 is configured so that the image input to the inspection apparatus 2 is input to the selected classifier Ci, and the output from the selected classifier Ci is output to the outside as the inspection result. , switching processing is performed. The switching process by the selection unit 203 is performed after the evaluation by the evaluation device 1 is completed and before the operation of inspecting the article by the inspection device 2 is started. Further, a process of additionally learning the selected one or more classifiers Ci may be further executed before the operation of the inspection apparatus 2 is started.

〔実施例１〕
検査装置２を用いた実施例１について、図９を参照して説明する。実施例１では、物品としての圧着端子を被写体として含むサンプル画像Ｉ１を用いて、評価装置１による評価を行った。 [Example 1]
Example 1 using the inspection device 2 will be described with reference to FIG. In Example 1, evaluation was performed by the evaluation device 1 using a sample image I1 including a crimp terminal as an article as a subject.

各分類器Ｃｉは、圧着端子を被写体として含む画像に基づいて、該圧着端子の状態を次の８状態の何れかに分類するよう、事前の機械学習により構築されている。 Each classifier Ci is constructed by prior machine learning so as to classify the state of the crimp terminal into one of the following eight states based on the image including the crimp terminal as an object.

状態１：良品
状態２：浅打ち
状態３：深打ち
状態４：バレルめくれ
状態５：トランジション部へのはみだし
状態６：インス側へのはみ出し
状態７：芯線切れ
状態８：ベルマウス不良
図８（ａ）は、サンプル画像Ｉ１における検査者による着目領域を示す図である。この実施例では、検査者は、圧着端子のバレルを含む領域に着目している。 Condition 1: Good condition Condition 2: Shallow stroke Condition 3: Deep stroke Condition 4: Barrel curled Condition 5: Protrusion to the transition part Condition 6: Protrusion to the inset side Condition 7: Wire breakage Condition 8: Bad bell mouth Fig. 8 (a) ) is a diagram showing a region of interest by an inspector in the sample image I1. In this example, the inspector focuses on the area containing the barrel of the crimp terminal.

本実施形態において、分類器Ｃ１及びＣ２は、何れも、上述したサンプル画像Ｉ１に被写体として含まれる圧着端子を、「状態４：バレルめくれ」として分類し、検査者が状態を分類した分類結果に一致する分類結果を出力した。 In the present embodiment, the classifiers C1 and C2 both classify the crimp terminal included as a subject in the sample image I1 described above as "state 4: barrel turn-up", and the classification result of the state classified by the inspector is Outputs matching classification results.

図８（ｂ）は、分類器Ｃ１による２つの着目領域を示す図である。ここでは、分類器Ｃ１による一方の着目領域は、検査者による着目領域と重複する重複領域である。また、他方の着目領域は、検査者による着目領域と重複しない独自領域である。 FIG. 8(b) is a diagram showing two regions of interest by the classifier C1. Here, one region of interest by the classifier C1 is an overlapping region that overlaps the region of interest by the inspector. The other region of interest is a unique region that does not overlap with the region of interest of the inspector.

図８（ｃ）は、分類器Ｃ２による１つの着目領域を示す図である。ここでは、分類器Ｃ２による着目領域は、検査者による着目領域と重複していない。 FIG. 8(c) is a diagram showing one region of interest by the classifier C2. Here, the region of interest by the classifier C2 does not overlap with the region of interest by the inspector.

図９は、分類器Ｃ２の評価について説明する図である。図９に示すように、評価装置１は、分類器Ｃ２による分類結果が検査者による分類結果と一致したものの、分類器Ｃ２による着目領域の中に重複領域が含まれないので、含まれる分類器Ｃ１より評価を低くする。このような分類器Ｃ２は、検査者による基準との関係が薄い画像箇所に基づいて、偶然に正しい分類結果を得た可能性があり、追加学習を行っても正解率が向上しない可能性がある。検査装置２は、そのような分類器Ｃ２を選択することなく、追加学習に適した分類器Ｃ１を用いて、物品の検査を行うことができる。 FIG. 9 is a diagram explaining the evaluation of the classifier C2. As shown in FIG. 9, although the classification result by the classifier C2 matches the classification result by the inspector, the evaluation apparatus 1 does not include an overlapping region in the region of interest by the classifier C2. Lower the evaluation than C1. Such a classifier C2 may have obtained a correct classification result by chance based on an image portion having a weak relationship with the examiner's criteria, and there is a possibility that the accuracy rate will not improve even if additional learning is performed. be. The inspection device 2 can inspect the article using the classifier C1 suitable for additional learning without selecting such a classifier C2.

〔付記事項〕
本発明は上述した各実施形態に限定されるものではなく、請求項に示した範囲で種々の変更が可能であり、異なる実施形態にそれぞれ開示された技術的手段を適宜組み合わせて得られる実施形態についても本発明の技術的範囲に含まれる。 [Additional notes]
The present invention is not limited to the above-described embodiments, but can be modified in various ways within the scope of the claims, and can be obtained by appropriately combining technical means disclosed in different embodiments. is also included in the technical scope of the present invention.

１評価装置
２検査装置
１０バス
１１主メモリ
１２プロセッサ
１３補助メモリ
１４入出力インターフェース
２０入力装置
３０出力装置
Ｃ１，Ｃ２，…，Ｃｎ，Ｃｉ分類器
１０１作成部
１０２評価部
２０３選択部 1 evaluation device 2 inspection device 10 bus 11 main memory 12 processor 13 auxiliary memory 14 input/output interface 20 input device 30 output device C1, C2, .

Claims

An evaluation device for evaluating each of a plurality of classifiers that refer to an image containing an article as a subject and perform classification processing for classifying the state of the article according to an algorithm determined by learning,
a creation unit that creates a heat map corresponding to each classifier, which indicates an area that the classifier pays attention to in the classification process in a sample image including, as a subject, an article whose state has been classified by a person;
an evaluation unit that evaluates each classifier based on a comparison of the area in the sample image indicated by the corresponding heat map and the area in the sample image that the person focuses on to classify the state of the article. ing,
An evaluation device characterized by:

The evaluation unit selects a classifier that includes, among the regions in the sample image indicated by the corresponding heat map, a region that overlaps with the region in the sample image that the person focuses on in order to classify the state of the article. , rate higher than the classifier not included,
The evaluation device according to claim 1, characterized by:

The evaluation unit selects a classifier that includes an area that does not overlap with the area in the sample image that the person focuses on in order to classify the state of the article, in the area in the sample image indicated by the corresponding heat map. , evaluates more highly than classifiers that do not contain
3. The evaluation device according to claim 2, characterized in that:

The creation unit creates a heat map corresponding to each classifier for each sample image, using each of a plurality of images including articles classified by a person as a subject as a sample image,
The evaluation unit evaluates each classifier based on the evaluation result of evaluating each classifier for each sample image,
4. The evaluation device according to any one of claims 1 to 3, characterized in that:

The evaluation unit selects, from among the plurality of classifiers, a classifier having a higher rate that the result of classifying the state of the article included as the subject in each sample image matches the result of the classification of the state of the article by the person. further appreciate the
5. The evaluation device according to claim 4, characterized in that:

An evaluation method for evaluating each of a plurality of classifiers that refer to an image containing an article as a subject and perform classification processing for classifying the state of the article according to an algorithm determined by learning,
a creation step of creating a heat map corresponding to each classifier, the heat map indicating an area of interest to the classifier in the classification process in a sample image including, as a subject, an article whose state has been classified by a person;
an evaluation step of evaluating each classifier based on a comparison of the area in the sample image indicated by the corresponding heat map to the area in the sample image that the person focuses on to classify the condition of the article. ing,
An evaluation method characterized by

6. An evaluation program that causes a computer to operate as the evaluation apparatus according to any one of claims 1 to 5, wherein the computer functions as each part of the evaluation apparatus.

An inspection device for inspecting an article,
the plurality of classifiers;
an evaluation device according to any one of claims 1 to 5;
a selection unit that selects one of the plurality of classifiers based on the evaluation result of the evaluation device;
An inspection device that inspects the article using the classifier selected by the selection unit.