JP6647632B2

JP6647632B2 - Generating training data for machine learning

Info

Publication number: JP6647632B2
Application number: JP2017169856A
Authority: JP
Inventors: 誓旅渡邊
Original assignee: Individual
Current assignee: Individual
Priority date: 2017-09-04
Filing date: 2017-09-04
Publication date: 2020-02-14
Anticipated expiration: 2037-09-04
Also published as: JP2019046269A

Description

本発明は、機械学習用訓練データの生成に関するものである。 The present invention relates to generation of training data for machine learning.

ある画像処理方法では、ニューラルネットワークを使用して、生物学的組織サンプルの画像内の生物学的構成を自動検出している（例えば特許文献１参照）。 One image processing method uses a neural network to automatically detect a biological configuration in an image of a biological tissue sample (for example, see Patent Document 1).

一般的に、このような、画像が入力されるニューラルネットワークの訓練データとしては、サンプル画像を入手して、そのサンプル画像に対するラベルを手作業で付したものが使用される。 Generally, training data of a neural network to which an image is input is obtained by obtaining a sample image and manually attaching a label to the sample image.

特表２０１７−５１６９９２号公報JP-T-2017-516992

しかしながら、ディープニューラルネットワークのような機械学習システムでは多くの訓練データセット（つまり、多くの入力と出力との対）が必要になるため、適切な訓練データを作成するのに長い時間がかかってしまう。 However, machine learning systems such as deep neural networks require many training data sets (that is, many input and output pairs), and it takes a long time to generate appropriate training data. .

原訓練データからの擬似的な訓練データセットを生成する方法としては、データ拡張がある。データ拡張では、原訓練データセットの原サンプル画像（機械学習用の入力画像）に対して、平行移動、鏡像反転、回転、所定の幾何学的変形、若干の色変更、ノイズ付加などを行うことで、新たな訓練データセットを生成している。通常、データ拡張では、原サンプル画像に対して付されているラベルが、そのまま、新たな訓練データセットでも使用される。ただし、機械学習システムの出力が画像（ラベル画像）である場合には、データ拡張では、原訓練データセットの原ラベル画像（機械画像用の出力画像）に、原サンプル画像の平行移動、鏡像反転、回転、および所定の幾何学的変形に合わせて、同様の平行移動、鏡像反転、回転、および所定の幾何学的変形を施した画像が、新たな訓練データセットのラベル画像として使用される。なお、若干の色変更、およびノイズ付加によるデータ拡張の場合、原ラベル画像と同一の画像が新たな訓練データセットのラベル画像として使用される。 As a method of generating a pseudo training data set from the original training data, there is a data extension. In data expansion, translation, mirror image inversion, rotation, predetermined geometric deformation, slight color change, noise addition, etc. are performed on the original sample image (input image for machine learning) of the original training data set Generates a new training dataset. Normally, in data extension, the label attached to the original sample image is used as it is in a new training data set. However, when the output of the machine learning system is an image (label image), in the data extension, the original label image (output image for machine image) of the original training data set is translated and mirror-inverted of the original sample image. , Rotation, and a predetermined geometric transformation, the image subjected to the similar translation, mirror image inversion, rotation, and the predetermined geometric transformation is used as a label image of a new training data set. In addition, in the case of data expansion by slight color change and noise addition, the same image as the original label image is used as a label image of a new training data set.

しかし、データ拡張を使用した場合、原画像の訓練データセットのみに比べて過学習が発生しにくいものの、データ拡張で生成した新たな訓練データセットのサンプル画像およびラベル画像は、上述のような比較的単純な方法で加工されたものであり、実際に取得された原サンプル画像および原ラベル画像の特徴を多く残しているため、データ拡張で多くの訓練データセットを新たに生成したとしても、原サンプル画像および原ラベル画像の少なさに起因して、多くの訓練データセットで機械学習が行われるディープニューラルネットワークのような機械学習システムでは依然として過学習が発生しやすい。 However, when data extension is used, overtraining is less likely to occur than in the case of using only the training data set of the original image, but the sample image and label image of the new training data set generated by data extension are compared as described above. It is processed by a simple method, and retains many features of the original sample images and original label images actually acquired, so even if many training data sets are newly generated by data expansion, the original Due to the small number of sample images and original label images, overlearning is still likely to occur in machine learning systems such as deep neural networks where machine learning is performed on many training data sets.

本発明は、上記の問題に鑑みてなされたもので、短時間で多くの適切な訓練データセットを自動的に生成する訓練データ生成システム、訓練データ生成方法、および訓練データ生成プログラムを得ることを目的とする。 The present invention has been made in view of the above problems, and provides a training data generating system, a training data generating method, and a training data generating program that automatically generate many appropriate training data sets in a short time. Aim.

本発明に係る訓練データ生成システムは、入力画像から出力画像を生成する機械学習システム用の訓練データを自動生成する訓練データ生成システムであり、原サンプル画像に対応する原ラベル画像の類似画像を、擬似ラベル画像として生成する類似画像生成部と、原ラベル画像から原サンプル画像への変換特性に従って擬似ラベル画像から類推される画像を、擬似ラベル画像に対応する擬似サンプル画像として生成する類推画像生成部とを備える。ここで、擬似サンプル画像は、訓練データにおける機械学習システムの入力画像であり、擬似ラベル画像は、訓練データにおける機械学習システムの出力画像である。 The training data generation system according to the present invention is a training data generation system that automatically generates training data for a machine learning system that generates an output image from an input image, and a similar image of an original label image corresponding to an original sample image, A similar image generation unit that generates a pseudo-label image, and an analogy image generation unit that generates an image analogized from the pseudo-label image according to the conversion characteristics from the original label image to the original sample image as a pseudo-sample image corresponding to the pseudo-label image And Here, the pseudo sample image is an input image of the machine learning system in the training data, and the pseudo label image is an output image of the machine learning system in the training data.

本発明に係る訓練データ生成方法は、入力画像から出力画像を生成する機械学習システム用の訓練データを自動生成する訓練データ生成方法であり、原サンプル画像に対応する原ラベル画像の類似画像を、擬似ラベル画像として生成する類似画像生成ステップと、原ラベル画像から原サンプル画像への変換特性に従って擬似ラベル画像から類推される画像を、擬似ラベル画像に対応する擬似サンプル画像として生成する類推画像生成ステップとを備える。ここで、擬似サンプル画像は、訓練データにおける機械学習システムの入力画像であり、擬似ラベル画像は、訓練データにおける機械学習システムの出力画像である。 The training data generation method according to the present invention is a training data generation method for automatically generating training data for a machine learning system that generates an output image from an input image, and a similar image of an original label image corresponding to an original sample image, A similar image generating step of generating a pseudo-label image, and an analogy image generating step of generating an image analogized from the pseudo-label image according to a conversion characteristic from the original label image to the original sample image as a pseudo-sample image corresponding to the pseudo-label image And Here, the pseudo sample image is an input image of the machine learning system in the training data, and the pseudo label image is an output image of the machine learning system in the training data.

本発明に係る訓練データ生成プログラムは、コンピュータに、入力画像から出力画像を生成する機械学習システム用の訓練データを自動生成させる訓練データ生成プログラムであり、コンピュータに、原サンプル画像に対応する原ラベル画像の類似画像を、擬似ラベル画像として生成する類似画像生成ステップと、原ラベル画像から原サンプル画像への変換特性に従って擬似ラベル画像から類推される画像を、擬似ラベル画像に対応する擬似サンプル画像として生成する類推画像生成ステップとを実行させる。ここで、擬似サンプル画像は、訓練データにおける機械学習システムの入力画像であり、擬似ラベル画像は、訓練データにおける機械学習システムの出力画像である。 The training data generation program according to the present invention is a training data generation program that causes a computer to automatically generate training data for a machine learning system that generates an output image from an input image. A similar image generation step of generating a similar image of the image as a pseudo-label image, and an image analogized from the pseudo-label image according to the conversion characteristic from the original label image to the original sample image as a pseudo-sample image corresponding to the pseudo-label image And an analogy image generation step to be generated. Here, the pseudo sample image is an input image of the machine learning system in the training data, and the pseudo label image is an output image of the machine learning system in the training data.

本発明によれば、短時間で多くの適切な訓練データセットが自動的に生成される。 According to the present invention, many suitable training data sets are automatically generated in a short time.

本発明の上記又は他の目的、特徴および優位性は、添付の図面とともに以下の詳細な説明から更に明らかになる。 The above and other objects, features and advantages of the present invention will become more apparent from the following detailed description in conjunction with the accompanying drawings.

図１は、本発明の実施の形態に係る訓練データ生成システムの構成を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a training data generation system according to an embodiment of the present invention. 図２は、入力画像から出力画像を生成する機械学習システムについて説明する図である。FIG. 2 is a diagram illustrating a machine learning system that generates an output image from an input image. 図３は、図１に示すシステムにおける訓練データ生成方法について説明するフローチャートである。FIG. 3 is a flowchart illustrating a training data generation method in the system shown in FIG. 図４は、図１に示すシステムにおける原訓練データセットと擬似訓練データセットとの対応関係を説明する図である。FIG. 4 is a diagram for explaining the correspondence between the original training data set and the pseudo training data set in the system shown in FIG. 図５は、原サンプル画像および原ラベル画像の具体例を説明する図である。FIG. 5 is a diagram illustrating a specific example of an original sample image and an original label image. 図６は、擬似ラベル画像の生成の具体例を説明する図である。FIG. 6 is a diagram illustrating a specific example of generating a pseudo label image. 図７は、擬似サンプル画像の生成の具体例を説明する図である。FIG. 7 is a diagram illustrating a specific example of generating a pseudo sample image. 図８は、機械学習後の画像生成エンジンの出力画像の具体例を説明する図である。FIG. 8 is a diagram illustrating a specific example of an output image of the image generation engine after machine learning.

以下、図に基づいて本発明の実施の形態を説明する。 Hereinafter, an embodiment of the present invention will be described with reference to the drawings.

図１は、本発明の実施の形態に係る訓練データ生成システムの構成を示すブロック図である。図１に示す訓練データ生成システムは、１台の情報処理装置で構成されているが、後述の各処理部を、互いにデータ通信可能な複数の情報処理装置に分散させてもよい。 FIG. 1 is a block diagram illustrating a configuration of a training data generation system according to an embodiment of the present invention. Although the training data generation system illustrated in FIG. 1 includes one information processing device, each processing unit described below may be distributed to a plurality of information processing devices that can perform data communication with each other.

この実施の形態に係る訓練データ生成システムは、ディープニューラルネットワークなどの画像生成エンジンの学習に使用する訓練データセットに使用可能な複数（多数）の擬似サンプル画像および擬似ラベル画像を、１または複数の原訓練データセット（原サンプル画像と原ラベル画像との対）から自動的に生成する。 The training data generation system according to this embodiment includes a plurality (a large number) of pseudo sample images and a plurality of pseudo label images that can be used for a training data set used for learning of an image generation engine such as a deep neural network. Automatically generated from the original training data set (pair of original sample images and original label images).

図２は、入力画像から出力画像を生成する機械学習システムについて説明する図である。図２に示すように、この機械学習システムは、例えばディープニューラルネットワークなどの画像生成エンジンを使用して、入力画像から、その入力画像に対応する出力画像を自動的に生成するためのシステムである。 FIG. 2 is a diagram illustrating a machine learning system that generates an output image from an input image. As shown in FIG. 2, the machine learning system is a system for automatically generating an output image corresponding to the input image from an input image using an image generation engine such as a deep neural network. .

このようなシステムでは、ユーザーの用意する未知の入力画像から所望の出力画像が自動的に生成されるように、使用開始前に、あるいは、随時、既知の入力画像とその既知の入力画像に対応する出力画像との対である訓練データセットに基づいて、ディープニューラルネットワークなどの画像生成エンジンの学習が実行される。 In such a system, a known input image and its known input image are corresponded before starting use or at any time so that a desired output image is automatically generated from an unknown input image prepared by a user. Learning of an image generation engine such as a deep neural network is performed based on a training data set that is a pair with an output image to be executed.

この実施の形態に係る訓練データ生成システムを使用することで、原訓練データセット（原サンプル画像と原ラベル画像との対）から多数の擬似訓練データセット（擬似サンプル画像と擬似ラベル画像との対）が得られ、それらの原訓練データセットおよび擬似訓練データセットを使用した画像生成エンジンの学習が適切に実行される。 By using the training data generation system according to this embodiment, a large number of pseudo training data sets (pairs of pseudo sample images and pseudo label images) can be converted from original training data sets (pairs of original sample images and original label images). ) Is obtained, and the training of the image generation engine using the original training data set and the pseudo training data set is appropriately performed.

図１に示す訓練データ生成システムは、記憶装置１、通信装置２、画像読取装置３、および演算処理装置４を備える。 The training data generation system shown in FIG. 1 includes a storage device 1, a communication device 2, an image reading device 3, and an arithmetic processing device 4.

記憶装置１は、フラッシュメモリー、ハードディスクなどの不揮発性の記憶装置であって、各種データやプログラムを格納する。 The storage device 1 is a nonvolatile storage device such as a flash memory and a hard disk, and stores various data and programs.

ここでは、記憶装置１には、訓練データ生成プログラム１１が格納されており、また、生成された訓練データが必要に応じて格納される。なお、訓練データ生成プログラム１１は、ＣＤ（Compact Disk）などの可搬性のあるコンピュータ読み取り可能な記録媒体に格納されていてもよい。その場合、例えば、その記録媒体から記憶装置１へ訓練データ生成プログラム１１がインストールされる。また、訓練データ生成プログラム１１は、１つのプログラムでも、複数のプログラムの集合体でもよい。 Here, the storage device 1 stores the training data generation program 11, and the generated training data is stored as needed. The training data generation program 11 may be stored in a portable computer-readable recording medium such as a CD (Compact Disk). In that case, for example, the training data generation program 11 is installed in the storage device 1 from the recording medium. The training data generation program 11 may be a single program or an aggregate of a plurality of programs.

通信装置２は、ネットワークインターフェイス、周辺機器インターフェイス、モデムなどのデータ通信可能な装置であって、必要に応じて、他の装置とデータ通信を行う。画像読取装置３は、入力画像としての原画像を含む写真などの物理媒体の原稿の画像を光学的に読み取り、原画像の画像データを生成する。通信装置２および画像読取装置３は、原訓練データセットの取得に使用される。 The communication device 2 is a device capable of data communication such as a network interface, a peripheral device interface, and a modem, and performs data communication with another device as necessary. The image reading device 3 optically reads an image of a document on a physical medium such as a photograph including an original image as an input image, and generates image data of the original image. The communication device 2 and the image reading device 3 are used for acquiring an original training data set.

演算処理装置４は、ＣＰＵ（Central Processing Unit）、ＲＯＭ（Read Only Memory）、ＲＡＭ（Random Access Memory）などを備えるコンピュータであって、プログラムを、ＲＯＭ、記憶装置１などからＲＡＭにロードしＣＰＵで実行することで、各種処理部として動作する。 The arithmetic processing device 4 is a computer including a CPU (Central Processing Unit), a ROM (Read Only Memory), a RAM (Random Access Memory), and the like. By executing, it operates as various processing units.

ここで、訓練データ生成プログラム１１を実行することで、演算処理装置４は、原データ取得部２１、類似画像生成部２２、および類推画像生成部２３として動作する。 Here, by executing the training data generation program 11, the arithmetic processing device 4 operates as an original data acquisition unit 21, a similar image generation unit 22, and an analogy image generation unit 23.

原データ取得部２１は、１または複数の原訓練データセットを取得する。１つの原訓練データセットは、入力画像から出力画像を生成する機械学習システム用の訓練データとして使用可能な、１つの原サンプル画像と、その原サンプル画像に対応する原ラベル画像とを含む。原サンプル画像は、機械学習時に入力画像として使用可能な、実際に得られた写真などの画像であり、原ラベル画像は、機械学習時に出力画像として使用可能な、原サンプル画像に対して手作業などで生成された適正なラベル画像である。 The original data acquisition unit 21 acquires one or a plurality of original training data sets. One original training data set includes one original sample image that can be used as training data for a machine learning system that generates an output image from an input image, and an original label image corresponding to the original sample image. The original sample image is an image such as an actually obtained photograph that can be used as an input image during machine learning, and the original label image is a manual image that can be used as an output image during machine learning. This is an appropriate label image generated by the above method.

原データ取得部２１は、記憶装置１に格納されている原訓練データセットを読み出したり、通信装置２を使用して、外部の装置から原訓練データセットを受信したり、画像読取装置３を使用して、画像読取装置３により読み取られた画像を原訓練データセットとして取得したりする。なお、通信装置２および画像読取装置３を使用して取得された原訓練データセットは、記憶装置１に格納されるようにしてもよい。 The original data acquisition unit 21 reads the original training data set stored in the storage device 1, receives the original training data set from an external device using the communication device 2, and uses the image reading device 3. Then, the image read by the image reading device 3 is acquired as an original training data set. The original training data set acquired using the communication device 2 and the image reading device 3 may be stored in the storage device 1.

類似画像生成部２２は、取得された原ラベル画像の類似画像を、擬似ラベル画像として生成する。ここで、擬似ラベル画像は、機械学習時に機械学習システムの出力画像として使用されるものである。 The similar image generation unit 22 generates a similar image of the acquired original label image as a pseudo label image. Here, the pseudo label image is used as an output image of the machine learning system during machine learning.

この実施の形態では、類似画像生成部２２は、所定範囲（例えば−１〜＋１）の値を有する１または複数の乱数値（乱数ベクトル）を入力値とし入力値に対応する画像を生成する画像生成エンジンを備え、複数の原ラベル画像を使用してその画像生成エンジンの機械学習を行い、その画像生成エンジンに別の乱数値を入力したときの出力画像を類似画像（つまり、擬似ラベル画像）とする。例えば、類似画像生成部２２は、ＤＣＧＡＮＳ（Deep Convolutional Generative Adversarial Networks）などのＧＡＮ（Generative Adversarial Network）に基づいて、原ラベル画像の類似画像を、擬似ラベル画像として生成する。この類似画像の生成方法は、通常のデータ拡張（平行移動、鏡像反転、回転変動、幾何学的変形、濃度変更、色の変動、ランダムノイズの付加など）とは異なるものである。つまり、通常のデータ拡張は、何らかの規則に従って原画像を変換したものであるか、原画像を多少不規則に変化させたものであり、データ拡張で得られる画像には、１枚の原画像に固有の規則性がある程度残されるが、この生成方法では、複数枚の原画像（原ラベル画像）の規則性が機械学習で抽出され、類似画像に反映されるため、個々の原画像に固有の規則性が、生成された類似画像に残りにくくなっている。 In this embodiment, the similar image generation unit 22 receives one or a plurality of random numbers (random number vectors) having values within a predetermined range (for example, −1 to +1) as input values and generates an image corresponding to the input value. It has a generation engine, performs machine learning of the image generation engine using a plurality of original label images, and outputs an output image when another random value is input to the image generation engine as a similar image (that is, a pseudo-label image). And For example, the similar image generation unit 22 generates a similar image of the original label image as a pseudo label image based on a GAN (Generative Adversarial Network) such as DCGANS (Deep Convolutional Generative Adversarial Networks). This method of generating a similar image is different from normal data expansion (translation, mirror image inversion, rotation fluctuation, geometric deformation, density change, color fluctuation, addition of random noise, etc.). In other words, the normal data extension is obtained by converting the original image according to some rule or changing the original image somewhat irregularly. The image obtained by the data extension includes one original image. Although a certain degree of unique regularity remains, in this generation method, the regularity of a plurality of original images (original label images) is extracted by machine learning and reflected on similar images. Regularity is less likely to remain in the generated similar image.

類似画像生成部２２は、ＧＡＮで類似ラベル画像を生成する場合、ジェネレータおよびディスクリミネータを備え、ディスクリミネータおよびジェネレータの機械学習を行った後、ジェネレータで、乱数値ベクトルから擬似ラベル画像を生成する。ディスクリミネータは、原ラベル画像とジェネレータにより生成された画像とを互いに区別するニューラルネットワークであり、ジェネレータは、乱数ベクトルから画像を生成するニューラルネットワークである。ディスクリミネータおよびジェネレータの機械学習では、（ａ）原ラベル画像に基づいて、原ラベル画像が原ラベル画像であると識別されるように、ディスクリミネータが機械学習され、（ｂ）任意にサンプリングされた乱数ベクトルに対してジェネレータにより生成された画像について、当該画像が原ラベル画像であると識別されるようにジェネレータが機械学習されるとともに、当該画像がジェネレータにより生成された画像であると識別されるようにジェネレータが機械学習される。 When generating a similar label image by GAN, the similar image generating unit 22 includes a generator and a discriminator. After performing machine learning of the discriminator and the generator, the generator generates a pseudo label image from the random number value vector. I do. The discriminator is a neural network that distinguishes an original label image from an image generated by a generator, and the generator is a neural network that generates an image from a random number vector. In the machine learning of the discriminator and the generator, (a) the discriminator is machine-learned based on the original label image so that the original label image is identified as the original label image, and (b) arbitrarily sampled. The generator is machine-learned such that the image is identified as the original label image for the image generated by the generator for the random number vector, and the image is identified as the image generated by the generator. The generator is machine-learned as is done.

なお、類似画像生成部２２は、原ラベル画像で機械学習される別の画像生成エンジンで、原ラベル画像の類似画像を擬似ラベル画像として生成するようにしてもよい。 Note that the similar image generation unit 22 may generate a similar image of the original label image as a pseudo label image by another image generation engine that performs machine learning on the original label image.

また、類似画像生成部２２は、生成した擬似ラベル画像に対して、ノイズ除去処理を行うようにしてもよい。そのノイズ除去処理は、例えば、Non-local Means Denoising algorithmなどに従って実行される。 Further, the similar image generation unit 22 may perform a noise removal process on the generated pseudo label image. The noise removal processing is executed according to, for example, a Non-local Means Denoising algorithm.

さらに、必要に応じて、類似画像生成部２２は、生成した擬似ラベル画像に対して、通常のデータ拡張（平行移動、鏡像反転、回転変動、幾何学的変形、濃度変更、色の変動、ランダムノイズの付加など）を行って、擬似ラベル画像の数を増加させるようにしてもよい。 Further, if necessary, the similar image generation unit 22 performs normal data expansion (parallel movement, mirror image inversion, rotation fluctuation, geometric deformation, density change, color fluctuation, random fluctuation) on the generated pseudo label image. Noise, etc.) to increase the number of pseudo-label images.

類推画像生成部２３は、原ラベル画像から原サンプル画像への変換特性に従って擬似ラベル画像から類推される画像（類推画像）を、その擬似ラベル画像に対応する擬似サンプル画像として生成する。ここで、擬似サンプル画像は、機械学習時に機械学習システムの入力画像として使用されるものである。 The analogy image generation unit 23 generates an image (analogous image) analogized from the pseudo label image according to the conversion characteristic from the original label image to the original sample image as a pseudo sample image corresponding to the pseudo label image. Here, the pseudo sample image is used as an input image of the machine learning system during machine learning.

例えば、類推画像生成部２３は、擬似ラベル画像の各画素を注目画素として、注目画素の周辺領域（注目画素を含む所定サイズのウィンドウ）の画素値分布に最も近い画素値分布を周辺領域に有する画素を原ラベル画像内で探索し、発見した画素の位置と同一の位置の原サンプル画像の画素値を、擬似サンプル画像において同一位置の画素にセットすることで、擬似サンプル画像を生成する。 For example, the analogy image generation unit 23 sets each pixel of the pseudo-label image as a target pixel, and has, in the peripheral region, a pixel value distribution closest to a pixel value distribution of a peripheral region of the target pixel (a window of a predetermined size including the target pixel). A pseudo sample image is generated by searching for a pixel in the original label image, and setting the pixel value of the original sample image at the same position as the position of the found pixel to the pixel at the same position in the pseudo sample image.

例えば、類推画像生成部２３は、イメージアナロジー法（Image analogies）に基づいて、原ラベル画像から原サンプル画像への変換特性に従って、擬似ラベル画像から類推される画像を、擬似サンプル画像として生成する。 For example, based on an image analogy method (Image analogies), the analogy image generation unit 23 generates an image analogized from a pseudo label image as a pseudo sample image according to a conversion characteristic from an original label image to an original sample image.

原訓練データセットが複数ある場合、類推画像生成部２３は、複数の原訓練データセットから、１つの原訓練データセットを選択し、選択した原訓練データセットにおける原ラベル画像から原サンプル画像への変換特性に従って、擬似ラベル画像から類推される画像を、擬似サンプル画像として生成する。その際、原訓練データセットは、例えばランダムに、あるいは、所定の規則に従って選択される。 When there are a plurality of original training data sets, the analogy image generation unit 23 selects one original training data set from the plurality of original training data sets, and converts the original label image to the original sample image in the selected original training data set. An image inferred from the pseudo label image is generated as a pseudo sample image according to the conversion characteristics. At this time, the original training data set is selected, for example, at random or according to a predetermined rule.

また、原訓練データセットが複数ある場合、類推画像生成部２３は、複数の原訓練データセットのそれぞれについて、選択した原訓練データセットにおける原ラベル画像から原サンプル画像への変換特性に従って、擬似ラベル画像から類推される画像を、擬似サンプル画像候補として生成し、複数の擬似サンプル画像候補から１つの擬似サンプル画像を選択したり、複数の擬似サンプル画像候補に対して所定の画像処理を行って１つの擬似サンプル画像を生成するようにしてもよい。複数の擬似サンプル画像候補から１つの擬似サンプル画像を選択する場合、例えばランダムに、あるいは、所定の規則に従って擬似サンプル画像が選択される。 When there are a plurality of original training data sets, the analogy image generation unit 23 generates a pseudo label for each of the plurality of original training data sets according to the conversion characteristic from the original label image to the original sample image in the selected original training data set. An image inferred from the image is generated as a pseudo sample image candidate, and one pseudo sample image is selected from the plurality of pseudo sample image candidates, or a predetermined image process is performed on the plurality of pseudo sample image candidates to perform one image processing. One pseudo sample image may be generated. When one pseudo sample image is selected from a plurality of pseudo sample image candidates, the pseudo sample image is selected, for example, randomly or according to a predetermined rule.

次に、上記システムの動作について説明する。図３は、図１に示すシステムにおける訓練データ生成方法について説明するフローチャートである。図４は、図１に示すシステムにおける原訓練データセットと擬似訓練データセットとの対応関係を説明する図である。 Next, the operation of the above system will be described. FIG. 3 is a flowchart illustrating a training data generation method in the system shown in FIG. FIG. 4 is a diagram for explaining the correspondence between the original training data set and the pseudo training data set in the system shown in FIG.

まず、原データ取得部２１が、上述のようにして１または複数の原訓練データセット（互いに対応する原サンプル画像と原ラベル画像との対）を取得する（ステップＳ１）。 First, the original data acquisition unit 21 acquires one or a plurality of original training data sets (pairs of original sample images and original label images corresponding to each other) as described above (step S1).

その後、類似画像生成部２２は、取得された原訓練データセットにおける原ラベル画像から、上述のようにして、複数（例えば数十万〜数千万）の擬似ラベル画像を生成する（ステップＳ２）。例えば、類似画像生成部２２は、ＧＡＮを使用する場合、互いに異なる複数の乱数ベクトルを生成し、その乱数ベクトルから、乱数ベクトルと同数の擬似ラベル画像を得る。 Thereafter, the similar image generation unit 22 generates a plurality of (for example, hundreds of thousands to tens of millions) pseudo-label images from the original label images in the acquired original training data set as described above (step S2). . For example, when GAN is used, the similar image generation unit 22 generates a plurality of different random number vectors and obtains the same number of pseudo-label images as the random number vectors from the random number vectors.

そして、類推画像生成部２３は、取得された原訓練データセットにおける原サンプル画像および原ラベル画像の対から、上述のようにして、各擬似ラベル画像に対応する擬似サンプル画像を生成する（ステップＳ３）。平行移動などの従来のデータ拡張では、原サンプル画像を加工して、擬似的なサンプル画像を取得しているが、本実施の形態においては、原サンプル画像および原ラベル画像の両方の関係性に基づいて、擬似ラベル画像から、擬似サンプル画像が生成される。 Then, the analogy image generation unit 23 generates a pseudo sample image corresponding to each pseudo label image from the pair of the original sample image and the original label image in the acquired original training data set as described above (step S3). ). In conventional data expansion such as translation, an original sample image is processed to obtain a pseudo sample image, but in the present embodiment, the relationship between both the original sample image and the original label image is Based on the pseudo label image, a pseudo sample image is generated.

これにより、互いに対応する擬似サンプル画像および擬似ラベル画像の対（つまり、擬似訓練データセット）が多数生成され、その後、例えば、記憶装置１に格納されたり、通信装置２で送信されたりする。そして、擬似訓練データセット（あるいは、原訓練データセットおよび擬似訓練データセット）に基づいて、多数の訓練データセットに基づく、ディープニューラルネットワークなどの画像生成エンジンの機械学習が行われる。 Thereby, a large number of pairs of the pseudo sample image and the pseudo label image (that is, the pseudo training data set) corresponding to each other are generated, and thereafter, for example, stored in the storage device 1 or transmitted by the communication device 2. Then, based on the pseudo training data set (or the original training data set and the pseudo training data set), machine learning of an image generation engine such as a deep neural network based on a large number of training data sets is performed.

ここで、原サンプル画像を細胞組織画像とし、原ラベル画像を細胞組織画像内の細胞の境界線を示す画像とした場合の具体例について説明する。 Here, a specific example will be described in which the original sample image is a cell tissue image and the original label image is an image showing a boundary line of cells in the cell tissue image.

図５は、原サンプル画像および原ラベル画像の具体例を説明する図である。 FIG. 5 is a diagram illustrating a specific example of an original sample image and an original label image.

図５（Ａ）は、１２枚の原サンプル画像としての細胞組織画像（グレースケール画像）を示しており、図５（Ｂ）は、１２枚の原ラベル画像としての、図５（Ａ）に示す細胞組織画像にそれぞれ対応する細胞境界をエッジ（黒白の境界）で示している２値画像である。 FIG. 5A shows a cell tissue image (gray scale image) as 12 original sample images, and FIG. 5B shows 12 original label images in FIG. 5 is a binary image in which cell boundaries respectively corresponding to the cell tissue images shown are indicated by edges (black-white boundaries).

このような、原ラベル画像は、原サンプル画像から手作業などで作成される。通常、多数（例えば１０万枚）の原サンプル画像を入手することは困難であるとともに、原サンプル画像から原ラベル画像を作成する手間もかかるため、原訓練データセットのみで、有効な機械学習を行えるような数の訓練データセットを用意することは困難である。 Such an original label image is created manually or the like from an original sample image. Usually, it is difficult to obtain a large number (for example, 100,000) of original sample images, and it also takes time to create an original label image from the original sample images. It is difficult to prepare as many training data sets as possible.

図６は、擬似ラベル画像の生成の具体例を説明する図である。 FIG. 6 is a diagram illustrating a specific example of generating a pseudo label image.

図６（Ａ）は、図５（Ｂ）に示した１２枚の原ラベル画像を示している。図６（Ｂ）は、図６（Ａ）に示す１２枚の原ラベル画像から生成された６枚の擬似ラベル画像を示している。なお、ここでは、６枚の擬似ラベル画像を示しているが、上述のように、類似画像生成処理においては、画像生成エンジンの入力値（上述の乱数、つまり実数値）を変更することで、上限数なく多数の擬似ラベル画像が生成可能であるので、例えば、互いに異なる１０万枚、１０００万枚などの多数の擬似ラベル画像が生成可能である。図６（Ｃ）は、図６（Ｂ）に示す６枚の擬似ラベル画像に対してNon-local Means Denoising algorithmに従ってノイズ除去処理を施したものである。なお、ノイズ除去処理は必要に応じて適用すればよく、別のアルゴリズムを使用してもよい。 FIG. 6A shows the 12 original label images shown in FIG. 5B. FIG. 6B shows six pseudo-label images generated from the twelve original label images shown in FIG. 6A. Although six pseudo-label images are shown here, as described above, in the similar image generation processing, the input value of the image generation engine (the above-described random number, that is, the real value) is changed. Since a large number of pseudo-label images can be generated without an upper limit, for example, a large number of pseudo-label images such as 100,000 sheets and 10 million sheets different from each other can be generated. FIG. 6C shows the result of performing noise removal processing on the six pseudo-label images shown in FIG. 6B according to the Non-local Means Denoising algorithm. The noise removal processing may be applied as needed, and another algorithm may be used.

図７は、擬似サンプル画像の生成の具体例を説明する図である。図５に示す原訓練データセットのうちの１つに基づいて、図６（Ｃ）に示す擬似ラベル画像から、図７に示すような、擬似サンプル画像が得られた。 FIG. 7 is a diagram illustrating a specific example of generating a pseudo sample image. A pseudo sample image as shown in FIG. 7 was obtained from the pseudo label image shown in FIG. 6 (C) based on one of the original training data sets shown in FIG.

次に、このようにして生成された擬似訓練データセットを使用した機械学習後の画像生成エンジンの出力画像の具体例について説明する。図８は、機械学習後の画像生成エンジンの出力画像の具体例を説明する図である。 Next, a specific example of an output image of the image generation engine after machine learning using the pseudo training data set generated as described above will be described. FIG. 8 is a diagram illustrating a specific example of an output image of the image generation engine after machine learning.

図８（Ａ）は、テスト用細胞組織画像（つまり、訓練データセットには含まれていないもの）を示している。 FIG. 8A shows a test cell tissue image (that is, one not included in the training data set).

図８（Ｂ）は、３つの原訓練データセットに基づき生成された、上述のテスト用細胞組織画像に対応するラベル画像を示している。具体的には、３つの原訓練データセットに対して通常のデータ拡張を行って得られた訓練データセットで機械学習した画像生成エンジンで、図８（Ａ）に示す細胞組織画像に対して、図８（Ｂ）に示すラベル画像が生成された。 FIG. 8B shows a label image generated based on the three original training data sets and corresponding to the above-described test cell tissue image. Specifically, an image generation engine machine-learned with a training data set obtained by performing normal data expansion on three original training data sets, and using the cell tissue image shown in FIG. The label image shown in FIG. 8B was generated.

図８（Ｃ）は、１２個の原訓練データセットに基づき生成された、上述のテスト用細胞組織画像に対応するラベル画像を示している。具体的には、１２個の原訓練データセットに対して通常のデータ拡張を行って得られた訓練データセットで機械学習した画像生成エンジンで、図８（Ａ）に示す細胞組織画像に対して、図８（Ｃ）に示すラベル画像が生成された。 FIG. 8C shows a label image generated based on the 12 original training data sets and corresponding to the above-described test cell tissue image. More specifically, an image generation engine machine-learned using a training data set obtained by performing normal data extension on 12 original training data sets, and using the cell tissue image shown in FIG. The label image shown in FIG. 8C was generated.

図８（Ｄ）は、１２個の原訓練データセットおよび１２個の擬似訓練データセットに基づき生成された、上述のテスト用細胞組織画像に対応するラベル画像を示している。具体的には、これらの訓練データセットに対して通常のデータ拡張を行って得られた訓練データセットで機械学習した画像生成エンジンで、図８（Ａ）に示す細胞組織画像に対して、図８（Ｄ）に示すラベル画像が生成された。 FIG. 8D shows a label image corresponding to the above-described test cell tissue image generated based on the 12 original training data sets and the 12 pseudo training data sets. More specifically, an image generation engine machine-learned with a training data set obtained by performing normal data extension on these training data sets uses the image generation engine shown in FIG. The label image shown in FIG. 8 (D) was generated.

ラベル画像内での個々の細胞に対応する領域の分離状況などから、図８（Ｂ）および図８（Ｃ）に比べ、図８（Ｄ）に示すラベル画像が適切であることは明らかである。このように、通常のデータ拡張より、この実施の形態に係る訓練データ生成方法により訓練データを生成したほうが、適切なラベル画像が得られる。 From the situation of separation of regions corresponding to individual cells in the label image, it is clear that the label image shown in FIG. 8D is more appropriate than those in FIGS. 8B and 8C. . As described above, an appropriate label image can be obtained by generating the training data by the training data generating method according to the present embodiment rather than the normal data extension.

以上のように、上記実施の形態によれば、類似画像生成部２２は、原サンプル画像に対応する原ラベル画像の類似画像を、擬似ラベル画像として生成する。類推画像生成部２３は、原ラベル画像から原サンプル画像への変換特性に従って擬似ラベル画像から類推される画像を、擬似ラベル画像に対応する擬似サンプル画像として生成する。ここで、擬似サンプル画像は、訓練データにおける機械学習システムの入力画像であり、擬似ラベル画像は、訓練データにおける機械学習システムの出力画像である。 As described above, according to the above embodiment, the similar image generation unit 22 generates a similar image of the original label image corresponding to the original sample image as a pseudo label image. The analogy image generation unit 23 generates an image analogized from the pseudo label image according to the conversion characteristic from the original label image to the original sample image as a pseudo sample image corresponding to the pseudo label image. Here, the pseudo sample image is an input image of the machine learning system in the training data, and the pseudo label image is an output image of the machine learning system in the training data.

これにより、原訓練データセットの原サンプル画像および原ラベル画像とはそれぞれ異なる新規な擬似サンプル画像および擬似ラベル画像の訓練データセットが自動的に多数生成できる。つまり、短時間で多くの適切な訓練データセットが自動的に生成される。 Thus, a large number of training data sets of new pseudo sample images and pseudo label images different from the original sample images and original label images of the original training data set can be automatically generated. That is, many appropriate training data sets are automatically generated in a short time.

なお、上述の実施の形態に対する様々な変更および修正については、当業者には明らかである。そのような変更および修正は、その主題の趣旨および範囲から離れることなく、かつ、意図された利点を弱めることなく行われてもよい。つまり、そのような変更および修正が請求の範囲に含まれることを意図している。 Various changes and modifications to the above-described embodiment will be apparent to those skilled in the art. Such changes and modifications may be made without departing from the spirit and scope of the subject matter and without diminishing its intended advantages. That is, such changes and modifications are intended to be included in the scope of the claims.

例えば、上記実施の形態において、ラベル画像は、境界を示すものに限らず、特定の細胞（癌細胞など）を示すものとしてもよい。また、上記実施の形態において、原サンプル画像は、細胞組織画像以外の生体画像でもよいし、その他の撮影画像でもよい。 For example, in the above embodiment, the label image is not limited to an image indicating a boundary, and may indicate a specific cell (such as a cancer cell). In the above embodiment, the original sample image may be a biological image other than the cell tissue image, or may be another captured image.

また、上記実施の形態において、上述の画像生成エンジンは、ディープニューラルネットワーク（つまり、隠れ層が２以上の階層型ニューラルネットワーク）に限定されない。 Further, in the above embodiment, the above-described image generation engine is not limited to a deep neural network (that is, a hierarchical neural network having two or more hidden layers).

本発明は、例えば、機械学習システム用の訓練データの自動生成に適用可能である。 The present invention is applicable to, for example, automatic generation of training data for a machine learning system.

１１訓練データ生成プログラム
２２類似画像生成部
２３類推画像生成部 11 Training Data Generation Program 22 Similar Image Generation Unit 23 Analogue Image Generation Unit

Claims

In a training data generation system that automatically generates training data for a machine learning system that generates an output image from an input image,
A similar image generation unit that generates a similar image of the original label image corresponding to the original sample image as a pseudo label image,
An analogy image generation unit that generates an image inferred from the pseudo label image according to the conversion characteristics from the original label image to the original sample image as a pseudo sample image corresponding to the pseudo label image,
The pseudo sample image is an input image of the machine learning system in the training data, the pseudo label image is an output image of the machine learning system in the training data,
A training data generation system characterized by the following.

The similar image generation unit includes an image generation engine configured to generate an output image corresponding to the input value using one or more random number values as an input value, and machine learning of the image generation engine using a plurality of the original label images. 2. The training data according to claim 1, wherein an output image obtained when another random value is input to the image generation engine is acquired as a similar image to the original label image, and is used as the pseudo label image. Generation system.

The analogy image generation unit sets each pixel of the pseudo label image as a target pixel, and sets a pixel having a pixel value distribution closest to a pixel value distribution of a peripheral region having a predetermined size for the target pixel in the peripheral region to the original label image. Searching within, setting the pixel value of the original sample image at the same position as the position of the found pixel at the pixel at the same position in the pseudo sample image to generate the pseudo sample image. The training data generation system according to claim 1 or 2, wherein the training data is generated.

The original sample image is a cell tissue image,
The original label image includes an image showing a boundary line of cells in the cell tissue image,
The training data generation system according to any one of claims 1 to 3, characterized in that:

In a training data generation method for automatically generating training data for a machine learning system that generates an output image from an input image,
Generating a similar image of the original label image corresponding to the original sample image as a pseudo-label image;
An analogy image generating step of generating an image inferred from the pseudo label image according to the conversion characteristics from the original label image to the original sample image as a pseudo sample image corresponding to the pseudo label image,
The pseudo sample image is an input image of the machine learning system in the training data, the pseudo label image is an output image of the machine learning system in the training data,
A training data generation method characterized by the following.

A training data generation program that causes a computer to automatically generate training data for a machine learning system that generates an output image from an input image,
On the computer,
Generating a similar image of the original label image corresponding to the original sample image as a pseudo-label image;
An analogy image generation step of generating an image inferred from the pseudo label image according to the conversion characteristics from the original label image to the original sample image as a pseudo sample image corresponding to the pseudo label image,
The pseudo sample image is an input image of the machine learning system in the training data, the pseudo label image is an output image of the machine learning system in the training data,
A training data generation program characterized by the following.