JP6489005B2

JP6489005B2 - Information processing system, information processing method, and program

Info

Publication number: JP6489005B2
Application number: JP2015247373A
Authority: JP
Inventors: 克繁中田
Original assignee: Canon Marketing Japan Inc; Canon IT Solutions Inc
Current assignee: Canon Marketing Japan Inc; Canon IT Solutions Inc
Priority date: 2015-12-18
Filing date: 2015-12-18
Publication date: 2019-03-27
Anticipated expiration: 2035-12-18
Also published as: JP2017111731A

Description

本発明は、情報処理システム、情報処理方法、プログラムに関する。 The present invention relates to an information processing system, an information processing method, and a program.

ここ数年、教師あり学習による画像認識技術として、ディープラーニングの研究が盛んにおこなわれている。ディープラーニングは、従来からあるニューラルネットワークを多層に重ねた機械学習の一つであり、学習用画像を用いて事前に学習させることで、画像分類などの問題を解決する。 In recent years, research on deep learning has been actively conducted as an image recognition technique based on supervised learning. Deep learning is one type of machine learning in which conventional neural networks are stacked in multiple layers, and solves problems such as image classification by learning in advance using learning images.

ディープラーニングによる画像認識で精度を高めるためには、学習用画像の確保、選定が重要になる。学習用画像は、事前学習や認識精度向上のための追加学習として使用するために、画像とそのコンテンツ情報をデータセットとして大量に用意しなければならない。例えば、「猫」というデータセットであれば、猫が写る画像とそのコンテンツ情報（品種名など。ラベル。）を対として、数百セットもしくはそれ以上用意する必要がある。 In order to improve accuracy by image recognition by deep learning, it is important to secure and select learning images. In order to use learning images as additional learning for advance learning or recognition accuracy improvement, a large amount of images and their content information must be prepared as a data set. For example, in the case of a data set of “cat”, it is necessary to prepare several hundred sets or more of a pair of an image showing a cat and its content information (such as breed name, label).

しかし、上記のようなデータセットを用意するには、非常に作業コストがかかることが多い。その理由としては、大量の画像を用意すること、それらのコンテンツ情報を対応付ける必要があること、そしてそのコンテンツ情報の対応付けは基本的に手動でおこなわなければならないことが挙げられる。 However, it is often very expensive to prepare such a data set. The reason for this is that a large amount of images are prepared, the content information needs to be associated, and the association of the content information must be performed manually.

以上のことから、ディープラーニングの事前・追加学習用画像の確保、選定が大きな課題の一つとなっており、それらの作成コストの軽減が強く望まれている。 From the above, securing and selecting images for pre-learning and additional learning for deep learning is one of the major issues, and reduction of their creation costs is strongly desired.

特許文献１には、画像・映像類似検索に使用するための学習辞書の質を向上させるために、追加画像の候補を選定する技術について記載がされている。 Patent Document 1 describes a technique for selecting additional image candidates in order to improve the quality of a learning dictionary used for image / video similarity search.

特開２０１３−２５０８８１号公報JP 2013-250881 A

特許文献１の技術では、学習辞書である画像群の平均特徴量と、候補画像の特徴量から類似度を算出し、その類似度が閾値よりも大きい場合に追加画像として選定する。 In the technique of Patent Document 1, the similarity is calculated from the average feature amount of the image group that is a learning dictionary and the feature amount of the candidate image, and is selected as an additional image when the similarity is larger than a threshold value.

この方法によると、学習辞書が本来持っていた特徴量の傾向を変化させることなく、学習辞書の画像数を増やすことができるが、平均特徴量と傾向が異なる画像を候補として選定することはできない。 According to this method, the number of images in the learning dictionary can be increased without changing the tendency of the feature amount originally possessed by the learning dictionary, but an image having a tendency different from the average feature amount cannot be selected as a candidate. .

そのため、ディープラーニング等を用いた画像認識をおこなった際の、特徴量の傾向が異なることに起因する誤検出結果へのフィードバックをおこなうようなシステムには適用できない。 Therefore, it cannot be applied to a system that performs feedback to a false detection result caused by a difference in feature amount tendency when image recognition using deep learning or the like is performed.

そこで、本発明では、画像認識における誤検出結果へのフィードバックを目的とした、追加学習の候補となる画像を選定できる仕組みを提供することを目的とする。 Therefore, an object of the present invention is to provide a mechanism that can select an image as a candidate for additional learning for the purpose of feedback to an erroneous detection result in image recognition.

本発明の情報処理システムは、教師あり画像分類器により検証用画像の分類をする情報処理システムであって、検証用画像の入力をする画像入力手段と、前記画像入力手段により入力された検証用画像と類似する画像を抽出する類似画像抽出手段と、前記類似画像抽出手段により抽出した画像にラベルを付与することで教師データを作成する教師データ作成手段と、前記教師データ作成手段により作成された教師データを用いて、前記教師あり画像分類器を学習させる学習手段と、を備えることを特徴とする。 An information processing system according to the present invention is an information processing system for classifying verification images by a supervised image classifier, and includes an image input means for inputting a verification image, and a verification input input by the image input means. Similar image extraction means for extracting an image similar to an image, teacher data creation means for creating teacher data by giving a label to the image extracted by the similar image extraction means, and created by the teacher data creation means Learning means for learning the supervised image classifier using teacher data.

また、本発明の情報処理方法は教師あり画像分類器により検証用画像の分類をする情報処理システムにおける情報処理方法であって、検証用画像の入力をする画像入力工程と、前記画像入力工程により入力された検証用画像と類似する画像を抽出する類似画像抽出工程と、前記類似画像抽出工程により抽出した画像にラベルを付与することで教師データを作成する教師データ作成工程と、前記教師データ作成工程により作成された教師データを用いて、前記教師あり画像分類器を学習させる学習工程と、を備えることを特徴とする。 The information processing method of the present invention is an information processing method in an information processing system for classifying verification images by a supervised image classifier, and includes an image input step for inputting verification images, and the image input step. A similar image extraction step for extracting an image similar to the input verification image, a teacher data creation step for creating teacher data by giving a label to the image extracted by the similar image extraction step, and the teacher data creation And a learning step of learning the supervised image classifier using the teacher data created in the step.

また、本発明のプログラムは、教師あり画像分類器により検証用画像の分類をする情報処理システムにおいて実行可能なプログラムであって、前記情報処理システムを、検証用画像の入力をする画像入力手段と、前記画像入力手段により入力された検証用画像と類似する画像を抽出する類似画像抽出手段と、前記類似画像抽出手段により抽出した画像にラベルを付与することで教師データを作成する教師データ作成手段と、前記教師データ作成手段により作成された教師データを用いて、前記教師あり画像分類器を学習させる学習手段として機能させることを特徴とする。 The program of the present invention is a program that can be executed in an information processing system that classifies verification images by a supervised image classifier, and the information processing system includes image input means that inputs verification images. Similar image extraction means for extracting an image similar to the verification image input by the image input means, and teacher data creation means for creating teacher data by assigning a label to the image extracted by the similar image extraction means And using the teacher data created by the teacher data creating means to function as a learning means for learning the supervised image classifier.

本発明によれば、画像認識における誤検出結果へのフィードバックを目的とした、追加学習の候補となる画像を選定することが可能となる。 According to the present invention, it is possible to select an image as a candidate for additional learning for the purpose of feedback to an erroneous detection result in image recognition.

本発明の情報処理システムの構成を示す図The figure which shows the structure of the information processing system of this invention 図１に示した操作端末ＰＣ１０１、アプリケーションサーバ１０２、教師あり画像分類サーバ１０３、教師なし画像分類サーバ１０４、画像データベース１０５に適用可能な情報処理装置のハードウェア構成を示すブロック図1 is a block diagram showing the hardware configuration of an information processing apparatus applicable to the operation terminal PC 101, application server 102, supervised image classification server 103, unsupervised image classification server 104, and image database 105 shown in FIG. 操作端末ＰＣ１０１、アプリケーションサーバ１０２、教師あり画像分類サーバ１０３、教師なし画像分類サーバ１０４、画像データベース１０５の機能を説明するための機能ブロック図Functional block diagram for explaining the functions of the operation terminal PC 101, application server 102, supervised image classification server 103, unsupervised image classification server 104, and image database 105 教師なし画像分類器を用いた教師あり画像分類器への学習用画像の選定処理を示すフローチャートFlowchart showing selection of learning image to supervised image classifier using unsupervised image classifier 類似画像取得処理（ステップＳ４０９）を示すフローチャートFlowchart showing similar image acquisition processing (step S409) 画像データの管理レコード情報テーブルのデータ構成例Data structure example of image data management record information table ステップＳ４１１において操作端末ＰＣ１０１に表示される画面の一例An example of a screen displayed on operation terminal PC101 in step S411 ステップＳ４０５において操作端末ＰＣ１０１に表示される画面の一例An example of a screen displayed on operation terminal PC101 in step S405

以下、図面を参照して、本発明の実施形態を詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

図１は、本発明の追加学習用画像の選定支援機能を適用した、情報処理システムの構成の一例を示すシステム構成図である。 FIG. 1 is a system configuration diagram showing an example of the configuration of an information processing system to which the additional learning image selection support function of the present invention is applied.

情報処理システムは、１または複数の操作端末ＰＣ１０１、１または複数のアプリケーションサーバ１０２、１または複数の教師あり画像分類サーバ１０３、１または複数の教師なし画像分類サーバ１０４、１または複数の画像データベース１０５が、ネットワークを介して接続される構成となっている。 The information processing system includes one or more operation terminals PC101, one or more application servers 102, one or more supervised image classification servers 103, one or more unsupervised image classification servers 104, and one or more image databases 105. Are connected via a network.

なお、本実施形態においてはアプリケーションサーバ１０２、教師あり画像分類サーバ１０３、教師あり画像分類サーバ１０４、データベース１０５をそれぞれ別の筐体として説明しているが、同じ筐体にこれらのサーバ、データベースの機能を備えるように構成しても良い。 In this embodiment, the application server 102, the supervised image classification server 103, the supervised image classification server 104, and the database 105 are described as separate housings. However, these servers and databases are stored in the same housing. You may comprise so that a function may be provided.

操作端末ＰＣ１０１は、画像分類、画像分類器学習、候補画像登録の各操作をおこなうための端末であり、ネットワーク上に存在する。画像の分類と学習の操作にはウェブ操作画面（ウェブブラウザ上で操作）を使用し、ネットワークを介してアプリケーションサーバ１０２に接続する。 The operation terminal PC101 is a terminal for performing each operation of image classification, image classifier learning, and candidate image registration, and exists on the network. A web operation screen (operated on a web browser) is used for image classification and learning operations, and connected to the application server 102 via a network.

アプリケーションサーバ１０２は、画像分類、画像分類器学習、候補画像登録の制御をおこなうためのサーバであり、ネットワーク上に構築されている。アプリケーションサーバ１０２へは、操作端末ＰＣ１０１が接続する。アプリケーションサーバ１０２は、操作端末ＰＣ１０１から画像分類操作が行われた場合は教師あり画像分類サーバ１０３へ画像を送信し、分類結果を受信する。操作端末ＰＣ１０１から分類器の学習操作が行われた場合は、教師なし画像分類サーバ１０４へ画像を送信して学習候補となる類似画像クラスを受信し、そのクラス情報をもとに画像データベース１０５から学習に使用する画像データを取得し、取得した画像データを教師あり画像分類サーバ１０３へ送信する。操作端末ＰＣ１０１から候補画像登録操作が行われた場合は画像データベース１０５へ画像データを送信する。 The application server 102 is a server for controlling image classification, image classifier learning, and candidate image registration, and is constructed on a network. The operation terminal PC 101 is connected to the application server 102. When an image classification operation is performed from the operation terminal PC 101, the application server 102 transmits an image to the supervised image classification server 103 and receives a classification result. When a classifier learning operation is performed from the operation terminal PC 101, an image is transmitted to the unsupervised image classification server 104, a similar image class as a learning candidate is received, and from the image database 105 based on the class information. Image data used for learning is acquired, and the acquired image data is transmitted to the supervised image classification server 103. When a candidate image registration operation is performed from the operation terminal PC 101, the image data is transmitted to the image database 105.

教師あり画像分類サーバ１０３は、画像分類、画像分類器の学習をおこなうためのサーバであり、ネットワーク上に構築されている。教師あり画像分類サーバ１０３は、操作端末ＰＣ１０１から分類対象の画像データを受信した場合、その分類結果を送信する。操作端末ＰＣ１０１から分類器学習のための画像データを受信した場合、受信した画像データを用いて分類器を学習する。 The supervised image classification server 103 is a server for performing image classification and learning of an image classifier, and is constructed on a network. When the supervised image classification server 103 receives image data to be classified from the operation terminal PC 101, the supervised image classification server 103 transmits the classification result. When image data for classifier learning is received from the operation terminal PC101, the classifier is learned using the received image data.

教師なし画像分類サーバ１０４は、教師あり画像分類サーバ１０３にある画像分類器を学習するための候補画像の選定をおこなうためのサーバであり、ネットワーク上に構築されている。教師なし画像分類サーバ１０４は、操作端末ＰＣ１０１から学習対象の画像データを受信した場合、その画像データと類似した類似画像クラスを送信する。 The unsupervised image classification server 104 is a server for selecting candidate images for learning the image classifier in the supervised image classification server 103, and is constructed on a network. When the unsupervised image classification server 104 receives image data to be learned from the operation terminal PC 101, the unsupervised image classification server 104 transmits a similar image class similar to the image data.

画像データベース１０５は、アプリケーションサーバ１０２の操作にもとづいて画像データを記憶、管理するサーバであり、ネットワーク上に構築されている。画像データベース１０５は、画像データのほかに、画像を管理レコード情報も記憶する。 The image database 105 is a server that stores and manages image data based on the operation of the application server 102, and is constructed on a network. The image database 105 stores image management record information in addition to image data.

以下、図２を用いて図１に示した操作端末ＰＣ１０１、アプリケーションサーバ１０２、教師あり画像分類サーバ１０３、教師なし画像分類サーバ１０４、画像データベース１０５に適用可能な情報処理装置のハードウェア構成について説明する。 Hereinafter, the hardware configuration of the information processing apparatus applicable to the operation terminal PC 101, the application server 102, the supervised image classification server 103, the unsupervised image classification server 104, and the image database 105 shown in FIG. To do.

図２は、図１に示した操作端末ＰＣ１０１、アプリケーションサーバ１０２、教師あり画像分類サーバ１０３、教師なし画像分類サーバ１０４、画像データベース１０５に適用可能な情報処理装置のハードウェア構成を示すブロック図である。 FIG. 2 is a block diagram showing a hardware configuration of an information processing apparatus applicable to the operation terminal PC 101, application server 102, supervised image classification server 103, unsupervised image classification server 104, and image database 105 shown in FIG. is there.

図２に示すように、情報処理装置では、システムバス２００を介してＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）２０１、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）２０２、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）２０３、記憶装置２０４、入力コントローラ２０５、音声入力コントローラ２０６、ビデオコントローラ２０７、メモリコントローラ２０８、よび通信Ｉ／Ｆコントローラ２０９が接続される。 As shown in FIG. 2, in the information processing apparatus, a CPU (Central Processing Unit) 201, a ROM (Read Only Memory) 202, a RAM (Random Access Memory) 203, a storage device 204, an input controller 205, An audio input controller 206, a video controller 207, a memory controller 208, and a communication I / F controller 209 are connected.

ＣＰＵ２０１は、システムバス２００に接続される各デバイスやコントローラを統括的に制御する。 The CPU 201 comprehensively controls each device and controller connected to the system bus 200.

ＲＯＭ２０２あるいは記憶装置２０４は、ＣＰＵ２０１が実行する制御プログラムであるＢＩＯＳ（ＢａｓｉｃＩｎｐｕｔ／ＯｕｔｐｕｔＳｙｓｔｅｍ）やＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）や、本情報処理方法を実現するためのコンピュータ読み取り実行可能なプログラムおよび必要な各種データ（データテーブルを含む）を保持している。 The ROM 202 or the storage device 204 is a control program executed by the CPU 201 such as a basic input / output system (BIOS) or an operating system (OS), a computer-readable program for realizing this information processing method, and various necessary programs. Holds data (including data table).

ＲＡＭ２０３は、ＣＰＵ２０１の主メモリ、ワークエリア等として機能する。ＣＰＵ２０１は、処理の実行に際して必要なプログラム等をＲＯＭ２０２あるいは記憶装置２０４からＲＡＭ２０３にロードし、ロードしたプログラムを実行することで各種動作を実現する。 The RAM 203 functions as a main memory, work area, and the like for the CPU 201. The CPU 201 implements various operations by loading a program or the like necessary for executing the processing from the ROM 202 or the storage device 204 to the RAM 203 and executing the loaded program.

入力コントローラ２０５は、キーボード／タッチパネル２１０などの入力装置からの入力を制御する。入力装置はこれに限ったものでなく、マウスやマルチタッチスクリーンなどの、複数の指でタッチされた位置を検出することが可能なタッチパネルであってもよい。 The input controller 205 controls input from an input device such as a keyboard / touch panel 210. The input device is not limited to this, and may be a touch panel capable of detecting a position touched by a plurality of fingers, such as a mouse or a multi-touch screen.

ユーザがタッチパネルに表示されたアイコンやカーソルやボタンに合わせて押下（指等でタッチ）することにより、各種の指示を行うことができる。 Various instructions can be performed when the user presses (touches with a finger or the like) in accordance with an icon, a cursor, or a button displayed on the touch panel.

この入力装置を用いて各種通信装置で利用可能な通信宛先に対する宛先を入力するようになっている。 This input device is used to input a destination for a communication destination that can be used by various communication devices.

音声入力コントローラ２０６は、マイク２１１からの入力を制御する。マイク２１１から入力された音声を音声認識することが可能となっている。 The voice input controller 206 controls input from the microphone 211. Voice input from the microphone 211 can be recognized.

ビデオコントローラ２０７は、ディスプレイ２１２などの外部出力装置への表示を制御する。ディスプレイは本体と一体になったノート型パソコンのディスプレイも含まれるものとする。なお、外部出力装置はディスプレイに限ったものははく、例えばプロジェクタであってもよい。また、前述のタッチ操作により受け付け可能な装置については、キーボード／タッチパネル２１０からの入力を受け付けることも可能となる。 The video controller 207 controls display on an external output device such as the display 212. The display includes a display of a notebook computer integrated with the main body. The external output device is not limited to a display, and may be a projector, for example. Further, for devices that can be received by the touch operation described above, it is also possible to receive input from the keyboard / touch panel 210.

なおビデオコントローラ２０７は、表示制御を行うためのビデオメモリ（ＶＲＡＭ）を制御することが可能で、ビデオメモリ領域としてＲＡＭ２０３の一部を利用することもできるし、別途専用のビデオメモリを設けることも可能である。 Note that the video controller 207 can control a video memory (VRAM) for display control, and a part of the RAM 203 can be used as a video memory area, or a dedicated video memory can be provided separately. Is possible.

本発明では、ユーザが情報処理装置を通常する場合の表示に用いられる第１のビデオメモリ領域と、所定の画面が表示される場合に、第１のビデオメモリ領域の表示内容に重ねての表示に用いられる第２のビデオメモリ領域を有している。ビデオメモリ領域は２つに限ったものではなく、情報処理装置の資源が許す限り複数有することが可能なものとする。 In the present invention, the first video memory area used for display when the user normally uses the information processing apparatus and the display overlaid on the display content of the first video memory area when a predetermined screen is displayed. A second video memory area used in The number of video memory areas is not limited to two, and a plurality of video memory areas can be provided as long as the resources of the information processing apparatus permit.

メモリコントローラ２０８は、外部メモリ２１３へのアクセスを制御する。外部メモリとしては、ブートプログラム、各種アプリケーション、フォントデータ、ユーザファイル、編集ファイル、および各種データ等を記憶する外部記憶装置（ハードディスク）、フレキシブルディスク（ＦＤ）、或いはＰＣＭＣＩＡカードスロットにアダプタを介して接続されるコンパクトフラッシュ（登録商標）メモリ等を利用可能である。 The memory controller 208 controls access to the external memory 213. The external memory is connected via an adapter to an external storage device (hard disk), flexible disk (FD), or PCMCIA card slot that stores boot programs, various applications, font data, user files, editing files, and various data. A compact flash (registered trademark) memory or the like can be used.

通信Ｉ／Ｆコントローラ２０９、ネットワーク２１４を介して外部機器と接続・通信するものであり、ネットワークでの通信制御処理を実行する。例えば、ＴＣＰ／ＩＰを用いた通信やＩＳＤＮなどの電話回線、および携帯電話の３Ｇ回線を用いた通信が可能である。 The communication I / F controller 209 is connected to and communicates with an external device via the network 214, and executes communication control processing in the network. For example, communication using TCP / IP, telephone lines such as ISDN, and communication using 3G lines of mobile phones are possible.

なお、記憶装置２０４は情報を永続的に記憶するための媒体であって、その形態をハードディスク等の記憶装置に限定するものではない。例えば、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）などの媒体であってもよい。 The storage device 204 is a medium for permanently storing information, and the form is not limited to a storage device such as a hard disk. For example, a medium such as SSD (Solid State Drive) may be used.

また本実施形態における通信端末で行われる各種処理時の一時的なメモリエリアとしても利用可能である。 It can also be used as a temporary memory area during various processes performed by the communication terminal in the present embodiment.

図３は、図１に示した操作端末ＰＣ１０１、アプリケーションサーバ１０２、教師あり画像分類サーバ１０３、教師なし画像分類サーバ１０４、画像データベース１０５の機能を説明するための機能ブロック図である。 FIG. 3 is a functional block diagram for explaining the functions of the operation terminal PC 101, application server 102, supervised image classification server 103, unsupervised image classification server 104, and image database 105 shown in FIG.

情報処理システムにおいて、主に、操作端末ＰＣ１０１では画像分類や分類器学習の操作および処理結果の表示、アプリケーションサーバ１０２では画像分類処理と分類器学習処理の制御、教師あり画像分類サーバ１０３では画像分類および分類器学習の実行、教師なし画像分類サーバ１０４では分類器学習に用いる画像候補選定の実行、画像データベース１０５は分類器学習に用いる画像データの管理をおこなう。
操作端末ＰＣ１０１は、操作結果表示部３０１を持つ。 In the information processing system, operation terminal PC101 mainly displays image classification and classifier learning operations and processing result display, application server 102 controls image classification processing and classifier learning processing, and supervised image classification server 103 performs image classification. The classifier learning is performed, the unsupervised image classification server 104 executes image candidate selection used for classifier learning, and the image database 105 manages image data used for classifier learning.
The operation terminal PC 101 has an operation result display unit 301.

操作結果表示部３０１は、ウェブブラウザから専用ＵＲＬにアクセスするウェブ管理画面により実現され、アプリケーションサーバ１０２に操作で入力した画像の分類表示要求を送信することにより、表示データを受信し、画像分類情報を表示する。また、ウェブ管理画面では、画像分類情報のほか、分類器学習操作、分類器学習に用いる候補画像の確認などについて確認できる。 The operation result display unit 301 is realized by a web management screen that accesses a dedicated URL from a web browser. The operation result display unit 301 receives display data by transmitting an operation classification display request input to the application server 102, and receives image classification information. Is displayed. In addition, on the web management screen, in addition to image classification information, it is possible to confirm classifier learning operation, confirmation of candidate images used for classifier learning, and the like.

アプリケーションサーバ１０２は、操作制御部３１１、画像分類制御部３１２、分類器学習制御部３１３から構成される。 The application server 102 includes an operation control unit 311, an image classification control unit 312, and a classifier learning control unit 313.

アプリケーションサーバ１０２は操作端末ＰＣ１０１から各操作の表示要求を受信すると、操作制御部３１１が操作内容を解析する。操作制御部３１１は受信した要求から、画像の分類表示の操作、画像分類器の学習操作、分類学習に用いる候補画像の確認操作であるかを判断する。 When the application server 102 receives a display request for each operation from the operation terminal PC 101, the operation control unit 311 analyzes the operation content. From the received request, the operation control unit 311 determines whether the operation is a classification display operation of an image, a learning operation of an image classifier, or a confirmation operation of a candidate image used for classification learning.

画像分類制御部３１２は、操作制御部３１１で画像の分類表示の操作がおこなわれたと判断した場合、画像分類対象の画像データを教師あり画像分類サーバ１０３に送信し、その結果となる分類情報を受信する。受信した分類情報は、操作制御部３１１を介して操作端末ＰＣ１０１に送信する。 If the image classification control unit 312 determines that the operation of the image classification display has been performed by the operation control unit 311, the image classification control unit 311 transmits the image data to be classified to the supervised image classification server 103, and the resulting classification information is displayed. Receive. The received classification information is transmitted to the operation terminal PC 101 via the operation control unit 311.

分類器学習制御部３１３は、操作制御部３１１で分類学習に用いる候補画像の確認操作がおこなわれたと判断した場合、教師なし画像分類サーバ１０４へ候補画像を選定する際の基となる画像データを送信して画像分類処理を実行させ、類似画像クラスを受信する。受信した類似画像クラスは、操作制御部３１１を介して操作端末ＰＣ１０１に送信する。操作制御部３１１で教師あり画像分類サーバ１０３への学習操作がおこなわれたと判断した場合、類似画像クラスを基に画像データベース１０５から該当する画像データを取得し、取得した画像データと類似画像クラスを分類器学習用データとして、教師あり画像分類サーバ１０３へ送信し、分類器の学習処理を実行させる。また、教師なし画像分類サーバ１０４への学習操作もおこなうことが可能であり、その場合は操作端末ＰＣ１０１から操作制御部３１１を介して受信した画像データを学習用データとして、教師なし画像分類サーバ１０４へ送信する。 If the classifier learning control unit 313 determines that the operation control unit 311 has confirmed the candidate image used for classification learning, the classifier learning control unit 313 uses the unsupervised image classification server 104 to select image data as a basis for selecting a candidate image. The image classification process is performed by transmission, and a similar image class is received. The received similar image class is transmitted to the operation terminal PC 101 via the operation control unit 311. When the operation control unit 311 determines that a learning operation to the supervised image classification server 103 has been performed, the corresponding image data is acquired from the image database 105 based on the similar image class, and the acquired image data and the similar image class are determined. The data for classifier learning is transmitted to the supervised image classification server 103, and the classifier learning process is executed. It is also possible to perform a learning operation on the unsupervised image classification server 104. In this case, image data received from the operation terminal PC 101 via the operation control unit 311 is used as learning data, and the unsupervised image classification server 104 is used. Send to.

教師あり画像分類サーバ１０３は、データ送受信制御部３２１、画像分類解析部３２２、分類器制御部３２３、教師あり学習制御部３２４から構成される。 The supervised image classification server 103 includes a data transmission / reception control unit 321, an image classification analysis unit 322, a classifier control unit 323, and a supervised learning control unit 324.

教師あり画像分類サーバ１０３は、アプリケーションサーバ１０２から画像分類対象の画像データ、または分類器学習用データを受信する。データ送受信制御部３２１は、受信したデータが画像分類対象の画像データであるか、分類器学習用データであるかを判断する。 The supervised image classification server 103 receives image data subject to image classification or classifier learning data from the application server 102. The data transmission / reception control unit 321 determines whether the received data is the image classification target image data or the classifier learning data.

画像分類解析部３２２は、データ送受信制御部３２１で画像分類対象の画像データを受信したと判断した場合に、画像分類解析をおこなう。画像分類解析では、分類器制御部３２３に分類対象の画像データを入力し、その分類結果を取得する。取得した結果は、データ送受信制御部３２１を介して、アプリケーションサーバ１０２に送信する。 When the data transmission / reception control unit 321 determines that the image classification target image data has been received, the image classification analysis unit 322 performs image classification analysis. In the image classification analysis, image data to be classified is input to the classifier control unit 323, and the classification result is acquired. The acquired result is transmitted to the application server 102 via the data transmission / reception control unit 321.

分類器制御部３２３は、画像分類解析部３２２から画像データを入力された場合は分類結果を出力する。教師あり学習制御部３２４から分類器学習用データを入力された場合は分類器の教師あり学習をおこなう。 The classifier control unit 323 outputs a classification result when image data is input from the image classification analysis unit 322. When classifier learning data is input from the supervised learning control unit 324, supervised learning of the classifier is performed.

教師あり学習制御部３２４は、データ送受信制御部３２１で分類器学習用データを受信したと判断した場合、分類器の教師あり学習をおこなう。分類器の教師あり学習では、分類器制御部３２３に学習用データを入力し、分類器を学習させる。 If the supervised learning control unit 324 determines that the data transmission / reception control unit 321 has received classifier learning data, the supervised learning control unit 324 performs supervised learning of the classifier. In supervised learning of the classifier, learning data is input to the classifier control unit 323 to learn the classifier.

教師なし画像分類サーバ１０４は、データ送受信制御部３３１、画像選定制御部３３２、分類器制御部３３３、教師なし学習制御部３３４から構成される。 The unsupervised image classification server 104 includes a data transmission / reception control unit 331, an image selection control unit 332, a classifier control unit 333, and an unsupervised learning control unit 334.

教師なし画像分類サーバ１０４は、アプリケーションサーバ１０２から教師あり学習データの候補画像選定の基となる画像データ、分類器学習用データを受信する。データ送受信部３３１は、受信したデータが候補画像選定の基となる画像データであるか、分類器学習用データであるかを判断する。 The unsupervised image classification server 104 receives, from the application server 102, image data and classifier learning data as a basis for selecting candidate images for supervised learning data. The data transmitting / receiving unit 331 determines whether the received data is image data that is a basis for selecting a candidate image or classifier learning data.

画像選定制御部３３２は、データ送受信制御部３３１で教師あり学習データの候補画像選定の基となる画像データを受信したと判断した場合に、候補画像の選定処理をおこなう。候補画像の選定では、分類制御部３３３に選定基の画像データを入力し、その結果である類似画像クラスを取得する。取得したクラス情報は、データ送受信制御部３３１を介して、アプリケーションサーバ１０２に送信する。 The image selection control unit 332 performs candidate image selection processing when the data transmission / reception control unit 331 determines that it has received image data as a basis for selecting candidate images for supervised learning data. In selecting a candidate image, image data of the selection base is input to the classification control unit 333, and a similar image class as a result is acquired. The acquired class information is transmitted to the application server 102 via the data transmission / reception control unit 331.

分類器制御部３３３は、画像選定制御部３３２から画像データを入力された場合はその画像の類似画像クラスを出力する。教師なし学習制御部３３４から分類器学習用データを入力された場合は、分類器の教師なし学習をおこなう。 When image data is input from the image selection control unit 332, the classifier control unit 333 outputs a similar image class of the image. When the classifier learning data is input from the unsupervised learning control unit 334, the classifier performs unsupervised learning.

教師なし学習制御部３３４は、データ送受信制御部３２１で分類器学習用データを受信したと判断した場合、分類器の教師なし学習をおこなう。分類器の教師なし学習では、分類器制御部３３３に学習用データを入力し、分類器を学習させる。 When the data transmission / reception control unit 321 determines that the classifier learning data has been received, the unsupervised learning control unit 334 performs unsupervised learning of the classifier. In unsupervised learning of the classifier, learning data is input to the classifier control unit 333, and the classifier is trained.

画像データベース１０５は、データ送受信制御部３４１、レコート管理制御部３４２、画像データ用記憶領域３４３、管理レコード用記憶領域３４４から構成される。 The image database 105 includes a data transmission / reception control unit 341, a record management control unit 342, an image data storage area 343, and a management record storage area 344.

画像データベース１０５は、アプリケーションサーバ１０２から画像データの登録または取得、管理情報更新の要求を受信する。データ送受信部３４１は、受信したデータが画像データ登録または取得要求、管理情報更新要求であるかを判断する。 The image database 105 receives a request for registration or acquisition of image data and management information update from the application server 102. The data transmission / reception unit 341 determines whether the received data is an image data registration or acquisition request or a management information update request.

レコード管理制御部３４２は、データ送受信部３４１の判断内容により、画像データの登録、取得、管理情報の更新をおこなう。画像データの登録と判断した場合、画像データ用記憶領域３４３に画像データを保存し、管理レコード用記憶領域３４４に新規管理レコード情報を作成する。画像データの取得と判断した場合、管理レコード用記憶領域３４４から条件に合う画像データを検索し、その検索結果を基に画像データ用記憶領域３４３から画像データを取得し、データ送受信制御部３４１を介してアプリケーションサーバ１０２に取得した画像データを送信する。管理情報の更新と判断した場合、管理レコード用記憶領域３４４から該当する管理レコードの更新をおこなう。 The record management control unit 342 performs registration / acquisition of image data and update of management information according to the determination contents of the data transmission / reception unit 341. If it is determined that the image data is registered, the image data is stored in the image data storage area 343 and new management record information is created in the management record storage area 344. If it is determined that the image data is to be acquired, the image data that meets the conditions is searched from the management record storage area 344, the image data is acquired from the image data storage area 343 based on the search result, and the data transmission / reception control unit 341 is executed. The acquired image data is transmitted to the application server 102. When it is determined that the management information is updated, the corresponding management record is updated from the management record storage area 344.

次に図４に示すフローチャートを用いて、教師なし画像分類器を用いた教師あり画像分類器への学習用画像の選定処理について説明する。 Next, the learning image selection processing for the supervised image classifier using the unsupervised image classifier will be described using the flowchart shown in FIG.

なお、図４においてはアプリケーションサーバ１０２は図示していないが、操作端末ＰＣ１０１と教師あり画像分類サーバ１０３との間の通信、操作端末ＰＣ１０１と教師なし画像分類サーバ１０４との間の通信については、アプリケーションサーバ１０２が介在するものである。 Although the application server 102 is not shown in FIG. 4, the communication between the operation terminal PC101 and the supervised image classification server 103 and the communication between the operation terminal PC101 and the unsupervised image classification server 104 are as follows. The application server 102 is interposed.

ステップＳ４０１では、操作端末ＰＣ１０１のＣＰＵ２０１は、ユーザからの指示により検証用画像を受け付け、当該画像を教師あり画像分類サーバ１０３に送信する。 In step S <b> 401, the CPU 201 of the operation terminal PC 101 receives a verification image according to an instruction from the user, and transmits the image to the supervised image classification server 103.

検証用画像は、画像分類の精度検証を行うための画像データである。 The verification image is image data for verifying the accuracy of image classification.

ステップＳ４０２では、教師あり画像分類サーバ１０３のＣＰＵ２０１は、ステップＳ４０１で操作端末ＰＣ１０１から送信された検証用画像を取得する。 In step S402, the CPU 201 of the supervised image classification server 103 acquires the verification image transmitted from the operation terminal PC 101 in step S401.

ステップＳ４０３では、教師あり画像分類サーバ１０３のＣＰＵ２０１は、ステップＳ４０２で取得した画像に対して、分類推定を行う。 In step S403, the CPU 201 of the supervised image classification server 103 performs classification estimation on the image acquired in step S402.

分類推定は、教師あり画像分類器を使用する。教師あり画像分類器は、ＣＮＮ（ＣｏｎｖｏｌｕｔｉｏｎａｌＮｅｕｒａｌＮｅｔｗｏｒｋ）などの公知の技術を利用するものとする。 Classification estimation uses a supervised image classifier. The supervised image classifier uses a known technique such as CNN (Convolutional Neural Network).

ステップＳ４０４では、教師あり画像分類サーバ１０３のＣＰＵ２０１は、ステップＳ４０３の処理の結果を操作端末ＰＣ１０１に送信する。 In step S404, the CPU 201 of the supervised image classification server 103 transmits the result of the process in step S403 to the operation terminal PC101.

ステップＳ４０５では、操作端末ＰＣ１０１のＣＰＵ２０１は、教師あり画像分類サーバ１０３から送信された結果を受信し、表示部に表示する。 In step S405, the CPU 201 of the operation terminal PC101 receives the result transmitted from the supervised image classification server 103 and displays it on the display unit.

ステップＳ４０６では、操作端末ＰＣ１０１のＣＰＵ２０１は、ユーザからステップＳ４０５で受信した結果に対する評価を受け付ける。具体的には、図８に示す画面を介して、分類推定結果が適切である否かの選択を受け付ける。 In step S406, the CPU 201 of the operation terminal PC101 receives an evaluation on the result received in step S405 from the user. Specifically, selection of whether or not the classification estimation result is appropriate is accepted via the screen shown in FIG.

ステップＳ４０７であ、操作端末ＰＣ１０１のＣＰＵ２０１は、ステップＳ４０６において受け付けた評価が、分類推定結果が適切である旨の評価であったかを判定する。 In step S407, the CPU 201 of the operation terminal PC 101 determines whether the evaluation received in step S406 is an evaluation that the classification estimation result is appropriate.

分類推定結果が適切である旨の評価の場合（ステップＳ４０７：ＹＥＳ）は、本フローチャートの処理を終了する。 In the case of evaluation that the classification estimation result is appropriate (step S407: YES), the process of this flowchart is terminated.

分類推定結果が適切ではない旨の評価の場合（ステップＳ４０７：ＮＯ）は、処理をステップＳ４０８に移行する。 In the case of evaluation that the classification estimation result is not appropriate (step S407: NO), the process proceeds to step S408.

ステップＳ４０８では、操作端末ＰＣ１０１のＣＰＵ２０１は、教師なし画像分類サーバ１０４に対して、教師あり画像分類器の学習に用いる画像を抽出する旨の指示を送信する。 In step S <b> 408, the CPU 201 of the operation terminal PC 101 transmits an instruction for extracting an image used for learning of the supervised image classifier to the unsupervised image classification server 104.

ステップＳ４０９では、教師なし画像分類サーバ１０４のＣＰＵ２０１は、教師あり画像分類器の学習に用いる画像の候補を、画像データベース１０５から抽出する。候補画像の抽出は、教師なし画像分類器を使用し、検証用画像と類似する画像を抽出する。類似画像として判断するのに使用する特徴は、色やテクスチャ、形状などのほか、類似性を表す他の特徴であってもよい。それらの特徴から画像を自動抽出できるよう、教師なし画像分類器を事前に学習しておくものとする。なお、教師なし画像分類器は、ＳＯＭ（ＳｅｌｆＯｒｇａｎｉｚｉｎｇＭａｐ）などの公知の技術を用いるものとする。 In step S409, the CPU 201 of the unsupervised image classification server 104 extracts image candidates used for learning of the supervised image classifier from the image database 105. The candidate images are extracted using an unsupervised image classifier and an image similar to the verification image is extracted. The feature used for determining as a similar image may be other features representing similarity in addition to color, texture, shape, and the like. Assume that an unsupervised image classifier is learned in advance so that images can be automatically extracted from these features. The unsupervised image classifier uses a known technique such as SOM (Self Organizing Map).

ステップＳ４０９の処理の詳細については、図５を用いて後述する。 Details of the processing in step S409 will be described later with reference to FIG.

ステップＳ４１０では、教師なし画像分類サーバ１０４のＣＰＵ２０１は、ステップＳ４０９で抽出した候補画像のデータを操作端末ＰＣ１０１に対して送信する。 In step S410, the CPU 201 of the unsupervised image classification server 104 transmits the candidate image data extracted in step S409 to the operation terminal PC101.

ステップＳ４１１では、操作端末ＰＣ１０１のＣＰＵ２０１は、ステップＳ４１０で教師なし画像分類サーバ１０４から送信された画像を受信し、表示部に表示する。本ステップで表示される画面の一例を図７に示す。 In step S411, the CPU 201 of the operation terminal PC101 receives the image transmitted from the unsupervised image classification server 104 in step S410 and displays it on the display unit. An example of the screen displayed in this step is shown in FIG.

ステップＳ４１２では、操作端末ＰＣ１０１のＣＰＵ２０１は、ユーザの指示に従い、教師データを作成する。具体的な処理は、図７を用いて後述する。 In step S412, the CPU 201 of the operation terminal PC101 creates teacher data in accordance with a user instruction. Specific processing will be described later with reference to FIG.

ステップＳ４１３では、操作端末ＰＣ１０１のＣＰＵ２０１は、ステップＳ４１２で作成した教師データを教師あり画像分類サーバ１０３に送信する。 In step S413, the CPU 201 of the operation terminal PC101 transmits the teacher data created in step S412 to the supervised image classification server 103.

ステップＳ４１４では、教師あり画像分類サーバ１０３のＣＰＵ２０１は、ステップＳ４１３で操作端末ＰＣ１０１から送信された教師データを受信し、当該教師データを教師あり分類器に入力して学習させる。 In step S414, the CPU 201 of the supervised image classification server 103 receives the teacher data transmitted from the operation terminal PC 101 in step S413, and inputs the supervised data to the supervised classifier for learning.

ステップＳ４１５では、教師あり画像分類サーバ１０３のＣＰＵ２０１は、ステップＳ４１４で学習に使用した画像データが、学習用画像の候補から取り除かれるよう、画像データベース１０５の管理レコードに記録する。学習に使用したことを記録することで、次回以降の学習の候補画像の抽出対象から除外されるようにし、誤って学習に再利用されることを防ぐ。 In step S415, the CPU 201 of the supervised image classification server 103 records the image data used for learning in step S414 in the management record of the image database 105 so as to be removed from the learning image candidates. By recording what has been used for learning, it is excluded from the extraction target of candidate images for subsequent learning, and is prevented from being reused by mistake for learning.

そして、処理をステップＳ４０３に戻す。ここで処理をステップＳ４０３に戻すのは、ステップＳ４１４における学習の効果を検証するためである。そのため、次のステップＳ４０３の処理では、再度同じ検証用画像を用いて分類推定処理を実行する。 Then, the process returns to step S403. The reason why the process returns to step S403 is to verify the effect of learning in step S414. Therefore, in the process of the next step S403, the classification estimation process is executed again using the same verification image.

次に、図５を参照して、本実施形態の類似画像取得処理（ステップＳ４０９）について説明する。 Next, with reference to FIG. 5, the similar image acquisition process (step S409) of the present embodiment will be described.

ステップＳ５０１では、教師なし画像分類サーバ１０４のＣＰＵ２０１は、検証用画像の特徴量を算出する。算出する特徴量は、色やテクスチャ、形状などのほか、画像の類似性を表す他の特徴であってもよい。これらはユーザにより設定されるものとする。 In step S501, the CPU 201 of the unsupervised image classification server 104 calculates the feature amount of the verification image. The feature amount to be calculated may be a color, a texture, a shape, or the like, or another feature representing image similarity. These are set by the user.

例えば、色特徴に重み付けをして検証用画像の特徴量を算出し、当該特徴量を用いて類似する画像を抽出することで、教師あり画像分類器に対して色特徴による分類を学習させることが可能となる。このように、教師あり画像分類器に対してどのような学習をさせたいかにより、どの特徴を用いるかを決定する。 For example, the feature amount of the verification image is calculated by weighting the color feature, and a similar image is extracted using the feature amount, so that the supervised image classifier learns the classification based on the color feature. Is possible. In this way, which feature is used is determined depending on what kind of learning is desired for the supervised image classifier.

ステップＳ５０２では、教師なし画像分類サーバ１０４のＣＰＵ２０１は、ステップＳ５０１で算出した特徴量を基に、教師なし画像学習器を用いて、検証用画像を類似画像クラスに分類する。類似画像クラスは、画像が持つ特徴量により、類似しているものを取りまとめるための情報であり、類似していると判断された画像は同一のクラスに所属する。この情報は、画像データベース１０５に記憶されている各画像データの管理レコードに類似画像クラス情報として、事前に分類、記録されているものとする。管理レコードの記載例は、図６の説明で後述する。 In step S502, the CPU 201 of the unsupervised image classification server 104 classifies the verification image into a similar image class using an unsupervised image learner based on the feature amount calculated in step S501. The similar image class is information for collecting similar images according to the feature amounts of the images, and images determined to be similar belong to the same class. This information is assumed to be classified and recorded in advance as similar image class information in the management record of each image data stored in the image database 105. A description example of the management record will be described later with reference to FIG.

類似画像クラスの分類に使用する教師なし画像分類器は、ＳＯＭなど、教師なし学習による分類器であればいずれでもよい。 The unsupervised image classifier used for classification of similar image classes may be any classifier based on unsupervised learning such as SOM.

ステップＳ５０３では、教師なし画像分類サーバ１０４のＣＰＵ２０１は、ステップＳ５０２で分類した類似画像クラスを基に、画像データベース１０５から同一クラスに所属する画像データを取得し、候補画像リストを作成する。候補画像リストは、各画像データと管理レコードを対にしたリストで構成される。 In step S503, the CPU 201 of the unsupervised image classification server 104 acquires image data belonging to the same class from the image database 105 based on the similar image class classified in step S502, and creates a candidate image list. The candidate image list is composed of a list in which each image data and management record are paired.

また、同一クラスに所属する画像データだけでなく、閾値より類似度の高いクラスの画像データも候補画像としてリストにする。この処理により、図７において、同一の類似画像クラスに所属する画像のみを表示したり、類似度が閾値より近い類似画像クラスも含めて候補画像を表示したりすることが可能となる。 In addition to image data belonging to the same class, image data of a class having a higher degree of similarity than the threshold is listed as a candidate image. By this process, in FIG. 7, it is possible to display only images belonging to the same similar image class or display candidate images including similar image classes whose similarity is closer than the threshold.

また、本実施形態では、候補画像を類似画像クラス情報を基にして抽出したが、その他の情報（例えば、画像カテゴリ情報など）を各画像にあらかじめ付与しておき、それらを組み合わせた条件を基に抽出してもよい。 In this embodiment, the candidate images are extracted based on the similar image class information. However, other information (for example, image category information) is given to each image in advance, and a condition based on a combination thereof is used. May be extracted.

ステップＳ５０４では、教師なし画像分類サーバ１０４のＣＰＵ２０１は、ステップＳ５０３で作成した候補画像リストに登録されている各画像データについて、既に教師あり分類器の学習に使用されているかを判断する。 In step S504, the CPU 201 of the unsupervised image classification server 104 determines whether each image data registered in the candidate image list created in step S503 is already used for learning of the supervised classifier.

学習に使用済みであるかの情報は、各画像データの管理レコードに記録されているものとする（図６）。 It is assumed that information indicating whether or not it has been used for learning is recorded in a management record of each image data (FIG. 6).

学習に使用済みである場合（ステップＳ５０４：ＹＥＳ）は、処理をステップＳ５０５に移行する。 If it is already used for learning (step S504: YES), the process proceeds to step S505.

学習にしようしていない場合（ステップＳ５０４：ＮＯ）は、本フローチャートの処理を終了し、ステップＳ４１０の処理に移行する。 If learning is not being performed (step S504: NO), the process of this flowchart is terminated, and the process proceeds to step S410.

ステップＳ５０５では、教師なし画像分類サーバ１０４のＣＰＵ２０１は、ステップＳ５０４で学習に使用済みと判断された各画像データを、候補画像リストから削除する。
そして、ステップＳ４１０の処理に移行する。 In step S505, the CPU 201 of the unsupervised image classification server 104 deletes each image data determined to be used for learning in step S504 from the candidate image list.
Then, the process proceeds to step S410.

以上説明したように、本発明では、教師あり画像分類器の分類精度を向上させるための学習において、検証用画像に類似する画像を教師なし画像分類器により抽出する。 As described above, in the present invention, in learning for improving the classification accuracy of the supervised image classifier, an image similar to the verification image is extracted by the unsupervised image classifier.

このような処理により、大量の候補画像の中から教師あり画像分類器の分類精度向上に適した画像を抽出することが可能となる。 By such processing, it is possible to extract an image suitable for improving the classification accuracy of the supervised image classifier from a large number of candidate images.

具体的には、従来は認識精度の悪かった形態の「自動車」について、当該自動車と類似する形態の自動車の画像を学習させることで、当該形態の自動車について認識精度を高めることが可能となる。その結果、様々な形態の自動車を認識できるようになり、画像分類の精度を高めることが可能となる。 Specifically, it is possible to increase the recognition accuracy of an automobile of the form by learning an image of an automobile having a form similar to the automobile for an “automobile” having a form with poor recognition accuracy. As a result, various forms of automobiles can be recognized, and the accuracy of image classification can be improved.

また、ユーザは抽出された画像の中から学習用画像を選定することになるため、学習用画像を選定する作業の効率化を図ることが可能となる。 In addition, since the user selects a learning image from the extracted images, it is possible to improve the efficiency of the work of selecting the learning image.

次に図６を用いて、画像データの管理レコード情報テーブルのデータ構成例について説明する。 Next, a data configuration example of the management record information table for image data will be described with reference to FIG.

図６は、画像データベース１０５に記録している画像データを管理するための、管理レコードのデータ構成例である。 FIG. 6 is a data configuration example of a management record for managing the image data recorded in the image database 105.

図６（Ａ）は、画像データの所属する類似画像クラスを表現するデータ構成例である。 FIG. 6A is a data configuration example that represents a similar image class to which image data belongs.

画像データを識別する情報である画像ＩＤと、当該画像データが属する類似画像クラス（ノード）を識別するノードＩＤとが対応付けて登録されている。 An image ID, which is information for identifying image data, and a node ID for identifying a similar image class (node) to which the image data belongs are registered in association with each other.

画像データは、類似画像クラス情報により、類似する画像同士を関連付けて管理される。同じ類似画像クラスに所属していれば、それらの画像は類似する物として取り扱われる。 Image data is managed by associating similar images with each other based on similar image class information. If they belong to the same similar image class, those images are handled as similar objects.

図６（Ｂ）は、画像データの教師あり画像分類器学習への使用状況を表現するデータ構成例である。 FIG. 6B is an example of a data configuration that represents a use situation of supervised image classifier learning of image data.

画像ＩＤと学習済みであるかを示す情報が対応付けて登録されている。図６（Ｂ）の例では、ｔｒｕｅが登録されていれば学習済みであることを、ｆａｌｓｅが登録されていれば学習済みではないことを意味している。 Information indicating whether or not the image ID has been learned is registered in association with each other. In the example of FIG. 6B, if true is registered, it means that learning has been completed, and if false is registered, it means that learning has not been completed.

図６（Ｃ）は、画像データの類似画像クラス情報を表現するデータ構成例である。 FIG. 6C is a data configuration example that represents similar image class information of image data.

類似画像クラスを識別する情報であるノードＩＤと当該類似画像クラスに所属する画像データの特徴を示す情報（特徴Ａ〜特徴Ｚ）とが対応付けて登録されている。 A node ID, which is information for identifying a similar image class, and information (features A to Z) indicating features of image data belonging to the similar image class are registered in association with each other.

図６（Ｃ）における特徴は、教師なし画像分類器で分類する際に指標となる特徴量である。特徴量の数は任意である。また、特徴量の値は、同じ類似画像クラスに所属する画像データの平均値や、その他の指標から算出した値のいずれであってもよい。この特徴量は、教師なし画像分類器の種類により、分類辞書としても使用してもよい。 The feature in FIG. 6C is a feature amount that serves as an index when classification is performed by an unsupervised image classifier. The number of feature quantities is arbitrary. The feature value may be either an average value of image data belonging to the same similar image class or a value calculated from another index. This feature quantity may be used as a classification dictionary depending on the type of unsupervised image classifier.

次に、図７を用いて、ステップＳ４１１において操作端末ＰＣ１０１に表示される画面について説明する。 Next, the screen displayed on the operation terminal PC 101 in step S411 will be described with reference to FIG.

図７に示す画面は、ユーザからの教師データ作成の指示を受け付ける画面である。図７の画面において入力された情報に従い、教師あり画像分類器の学習に使用するための教師データの作成が行われる。 The screen shown in FIG. 7 is a screen that accepts an instruction to create teacher data from the user. Teacher data for use in learning of the supervised image classifier is created according to the information input on the screen of FIG.

図７に示す画面は、ステップＳ４０１で入力を受け付けた検証用画像を表示する領域７０１、当該画像に対する分類推定結果を表示する領域７０２、当該検証用画像に対して付与されるラベルの選択を受け付ける領域７０３、画像表示範囲の指定を受け付ける領域７０４、ステップＳ４０９の処理により抽出された画像を表示する領域７０５等から構成される。 The screen shown in FIG. 7 accepts an area 701 for displaying the verification image received in step S401, an area 702 for displaying the classification estimation result for the image, and a selection of a label given to the verification image. An area 703, an area 704 for accepting designation of an image display range, an area 705 for displaying an image extracted by the processing in step S409, and the like.

ラベルの選択を受け付ける領域７０３では、検証用画像の正しい分類結果の選択を受け付ける。例えば、図７の例では、「自動車」、「乗用車」のラベルが選択されていることを示している。指定できるラベルは、教師あり分類器が分類できるものとする。また指定する分類ラベルは、単一指定のほか、複数指定も可能である。 In a region 703 for receiving a label selection, selection of a correct classification result of the verification image is received. For example, in the example of FIG. 7, it is shown that the labels “automobile” and “passenger car” are selected. Labels that can be specified can be classified by a supervised classifier. In addition to the single designation, a plurality of designations can be designated.

また、画像表示領域に表示されチェックがなされた画像に対しても、領域７０３で選択されたラベルの画像として学習される。 Also, an image displayed in the image display area and checked is learned as an image of the label selected in the area 703.

画像表示範囲の指定領域７０４は、画像表示領域７０５に表示される画像の範囲を指定できる。 An image display range designation area 704 can designate a range of an image displayed in the image display area 705.

候補画像は、検証用画像を教師なし画像分類した結果である類似画像クラス情報を基に選定される。 Candidate images are selected based on similar image class information that is the result of unsupervised image classification of verification images.

その際、範囲の指定領域７０４において指定された条件に従い、例えば、同一の類似画像クラスに所属する画像のみを表示したり、類似度が閾値より近い類似画像クラスも含めて候補画像を表示したりする。 At that time, according to the conditions specified in the range specification area 704, for example, only images belonging to the same similar image class are displayed, or candidate images including similar image classes whose similarity is closer than the threshold are displayed. To do.

画像表示領域７０５は、表示範囲の指定７０４で指定された内容に従い、教師あり画像分類器の学習に使用する候補画像の一覧を表示する。学習に使用する画像は、画像を選択されることで（チェックをされることで）指定される。一括選択ボタン７０６が押下されると、表示された全ての画像が選択されたことになる。指定後、学習実行ボタン７０７が押下されると、当該選択された画像に対して、領域７０３において選択されたラベルが付与され、教師あり画像分類器の学習が実行される。 The image display area 705 displays a list of candidate images used for learning of the supervised image classifier according to the content specified in the display range specification 704. An image to be used for learning is designated by selecting an image (by being checked). When the collective selection button 706 is pressed, all displayed images are selected. When the learning execution button 707 is pressed after the designation, the label selected in the region 703 is assigned to the selected image, and learning of the supervised image classifier is executed.

図８は、ステップＳ４０５において操作端末ＰＣ１０１に表示される画面の一例である。 FIG. 8 is an example of a screen displayed on the operation terminal PC 101 in step S405.

図８に示す画面を介して、ユーザから分類推定結果が適切であったか否か、学習が必要であるかの指示を受け付ける（Ｓ４０６）。 Through the screen shown in FIG. 8, an instruction is received from the user as to whether or not the classification estimation result is appropriate and whether learning is necessary (S406).

図８に示す通り、ステップＳ４０１で入力を受け付けた検証用画像８０１と、その分類推定結果８０２とが表示されている。 As shown in FIG. 8, the verification image 801 that has received the input in step S401 and the classification estimation result 802 are displayed.

適切ボタン８０３が押下されることで、分類推定結果が適切であった旨の結果を受け付ける（ステップＳ４０７：ＹＥＳ）。学習ボタン８０４が押下されることで、分類推定結果が適切ではない旨の結果を受け付ける（ステップＳ４０７：ＮＯ）。 When the appropriate button 803 is pressed, a result indicating that the classification estimation result is appropriate is accepted (step S407: YES). When the learning button 804 is pressed, a result indicating that the classification estimation result is not appropriate is received (step S407: NO).

また、本発明におけるプログラムは、図４、図５の処理をコンピュータに実行させるプログラムである。なお、本発明におけるプログラムは、図４、図５の各処理ごとのプログラムであってもよい。 The program in the present invention is a program that causes a computer to execute the processes of FIGS. The program in the present invention may be a program for each process in FIGS. 4 and 5.

以上のように、前述した実施形態の機能を実現するプログラムを記録した記録媒体を、システムあるいは装置に供給し、そのシステムあるいは装置のコンピュータ（またはＣＰＵやＭＰＵ）が記録媒体に格納されたプログラムを読み出し、実行することによっても本発明の目的が達成されることは言うまでもない。 As described above, a recording medium that records a program that implements the functions of the above-described embodiments is supplied to a system or apparatus, and a computer (or CPU or MPU) of the system or apparatus stores the program stored in the recording medium. It goes without saying that the object of the present invention can also be achieved by reading and executing.

この場合、記録媒体から読み出されたプログラム自体が本発明の新規な機能を実現することになり、そのプログラムを記録した記録媒体は本発明を構成することになる。 In this case, the program itself read from the recording medium realizes the novel function of the present invention, and the recording medium recording the program constitutes the present invention.

プログラムを供給するための記録媒体としては、例えば、フレキシブルディスク、ハードディスク、光ディスク、光磁気ディスク、ＣＤ−ＲＯＭ、ＣＤ−Ｒ、ＤＶＤ−ＲＯＭ、磁気テープ、不揮発性のメモリカード、ＲＯＭ、ＥＥＰＲＯＭ、シリコンディスク等を用いることが出来る。 As a recording medium for supplying the program, for example, a flexible disk, hard disk, optical disk, magneto-optical disk, CD-ROM, CD-R, DVD-ROM, magnetic tape, nonvolatile memory card, ROM, EEPROM, silicon A disk or the like can be used.

また、コンピュータが読み出したプログラムを実行することにより、前述した実施形態の機能が実現されるだけでなく、そのプログラムの指示に基づき、コンピュータ上で稼働しているＯＳ（オペレーティングシステム）等が実際の処理の一部または全部を行い、その処理によって前述した実施形態の機能が実現される場合も含まれることは言うまでもない。 Further, by executing the program read by the computer, not only the functions of the above-described embodiments are realized, but also an OS (operating system) operating on the computer based on an instruction of the program is actually It goes without saying that a case where the function of the above-described embodiment is realized by performing part or all of the processing and the processing is included.

さらに、記録媒体から読み出されたプログラムが、コンピュータに挿入された機能拡張ボードやコンピュータに接続された機能拡張ユニットに備わるメモリに書き込まれた後、そのプログラムコードの指示に基づき、その機能拡張ボードや機能拡張ユニットに備わるＣＰＵ等が実際の処理の一部または全部を行い、その処理によって前述した実施形態の機能が実現される場合も含まれることは言うまでもない。 Furthermore, after the program read from the recording medium is written to the memory provided in the function expansion board inserted into the computer or the function expansion unit connected to the computer, the function expansion board is based on the instructions of the program code. It goes without saying that the case where the CPU or the like provided in the function expansion unit performs part or all of the actual processing and the functions of the above-described embodiments are realized by the processing.

また、本発明は、複数の機器から構成されるシステムに適用しても、ひとつの機器から成る装置に適用しても良い。また、本発明は、システムあるいは装置にプログラムを供給することによって達成される場合にも適応できることは言うまでもない。この場合、本発明を達成するためのプログラムを格納した記録媒体を該システムあるいは装置に読み出すことによって、そのシステムあるいは装置が、本発明の効果を享受することが可能となる。 The present invention may be applied to a system constituted by a plurality of devices or an apparatus constituted by a single device. Needless to say, the present invention can be applied to a case where the present invention is achieved by supplying a program to a system or apparatus. In this case, by reading a recording medium storing a program for achieving the present invention into the system or apparatus, the system or apparatus can enjoy the effects of the present invention.

さらに、本発明を達成するためのプログラムをネットワーク上のサーバ、データベース等から通信プログラムによりダウンロードして読み出すことによって、そのシステムあるいは装置が、本発明の効果を享受することが可能となる。なお、上述した各実施形態およびその変形例を組み合わせた構成も全て本発明に含まれるものである。 Furthermore, by downloading and reading a program for achieving the present invention from a server, database, etc. on a network using a communication program, the system or apparatus can enjoy the effects of the present invention. In addition, all the structures which combined each embodiment mentioned above and its modification are also included in this invention.

１０１操作端末ＰＣ
１０２アプリケーションサーバ
１０３教師あり画像分類サーバ
１０４教師なし画像分類サーバ
１０５画像データベース 101 Operation terminal PC
102 application server 103 supervised image classification server 104 unsupervised image classification server 105 image database

Claims

Image input means for inputting a classification target image to be classified by the first classifier ;
Similar image acquisition means for acquiring an image similar to the classification target image classified by the first classifier ;
And the teacher data generating means for generating teaching data by accepting the label against the image obtained by the similar image acquisition unit,
Learning means for learning the first classifier using the teacher data created by the teacher data creating means;
An information processing system comprising:

An evaluation receiving means for receiving an evaluation result for the classification result by the first classifier;
The similar image acquisition means acquires an image similar to the classification target image classified by the first classifier when the evaluation result received by the evaluation reception means is a result indicating that learning is necessary. The information processing system according to claim 1.

Of the images acquired by the similar image acquisition means, further comprising a selection reception means for receiving selection of an image used for learning by the learning means,
The information processing system according to claim 1, wherein the teacher data creating unit creates the teacher data by receiving a label for the image selected by the selection receiving unit.

Display control means for displaying a list of images acquired by the similar image acquisition means;
The information processing system according to claim 3, wherein the selection receiving unit receives a selection of an image used for learning by receiving a selection operation on the image displayed by the display control unit.

5. The display control unit further displays the classification target image classified by the first classifier on a screen on which a list of images acquired by the similar image acquisition unit is displayed. Information processing system described in 1.

The image acquired by the similar image acquisition unit is an image similar to the classification target image, and is an image extracted by an unsupervised image classifier. Information processing system described in 1.

An image input step in which the image input means of the information processing system inputs a classification target image to be classified by the first classifier ;
Similar image acquisition unit of the information processing system, a similar image acquisition step of acquiring an image similar to the classified grouping target image by the first classifier,
Teaching data creation means of the information processing system, and the teacher data creating step of creating teacher data by accepting the label against the image obtained by the similar image acquisition step,
A learning step in which learning means of the information processing system learns the first classifier using the teacher data created by the teacher data creation step;
An information processing method comprising:

Information processing device
Image input means for inputting a classification target image to be classified by the first classifier ;
Similar image acquisition means for acquiring an image similar to the classification target image classified by the first classifier ;
And the teacher data generating means for generating teaching data by accepting the label against the image obtained by the similar image acquisition unit,
A program for causing a teacher to learn the first classifier using the teacher data created by the teacher data creating means.