JP6516531B2

JP6516531B2 - Clustering device and machine learning device

Info

Publication number: JP6516531B2
Application number: JP2015069975A
Authority: JP
Inventors: 健太西行; 藤吉　弘亘; 弘亘藤吉
Original assignee: MegaChips Corp
Current assignee: MegaChips Corp
Priority date: 2015-03-30
Filing date: 2015-03-30
Publication date: 2019-05-22
Anticipated expiration: 2035-03-30
Also published as: JP2016191966A

Description

本発明は、転移学習を導入した機械学習で用いられるクラスタリング装置及び機械学習装置に関する。 The present invention relates to a clustering apparatus and a machine learning apparatus used in machine learning in which transfer learning is introduced.

画像データから人物を検出する処理や、センサによる計測データの解析処理などに機械学習が用いられている。 Machine learning is used for processing of detecting a person from image data and analysis of measurement data by a sensor.

例えば、監視カメラにより撮影された画像から人物を検出する場合、人物の特徴を学習することにより生成された識別用特徴データが用いられる。具体的には、機械学習装置は、人物が撮影された複数の画像（複数の学習サンプル）を用いて人物の特徴を学習し、学習結果を反映した識別用特徴データを生成する。人物検出装置は、機械学習装置により生成された識別用特徴データを用いて、監視カメラにより撮影された画像から人物を検出する。 For example, when detecting a person from an image captured by a surveillance camera, identification feature data generated by learning the characteristics of the person is used. Specifically, the machine learning apparatus learns the characteristics of a person using a plurality of images (a plurality of learning samples) in which the person is photographed, and generates identification feature data reflecting the learning result. The person detection device detects a person from the image captured by the surveillance camera using the identification feature data generated by the machine learning device.

監視カメラの設置環境が学習サンプルを収集する環境と異なる場合、監視カメラにより撮影される人物の見え方は、学習サンプルにおける人物の見え方と異なる。つまり、監視カメラにより撮影される人物の特徴が、学習サンプルに含まれる人物の特徴と異なる。従って、監視カメラにより生成された画像から人物を検出するにあたって学習サンプルから生成された識別用特徴データを用いた場合、人物の検出精度が低下する。人物の検出精度を向上させる場合、カメラの設置環境に合わせて、膨大な数の学習サンプルを準備しなければならず、コストが増大する。 When the installation environment of the surveillance camera is different from the environment in which the learning sample is collected, the appearance of the person photographed by the surveillance camera is different from the appearance of the person in the learning sample. That is, the feature of the person photographed by the surveillance camera is different from the feature of the person included in the learning sample. Therefore, when the identification feature data generated from the learning sample is used to detect a person from the image generated by the surveillance camera, the detection accuracy of the person decreases. In order to improve the detection accuracy of a person, a huge number of learning samples must be prepared according to the installation environment of the camera, which increases the cost.

そこで、転移学習を導入した機械学習の手法が提案されている。転移学習は、学習サンプルの収集環境と異なる環境から得られたサンプルを事前に学習し、事前学習により得られた検出対象の特徴を、学習サンプルの学習結果に適用（転移）させる手法である。転移学習は、学習サンプルの数を抑制することができるため、識別用特徴データを生成するためのコストを低減することができる。 Therefore, a method of machine learning that introduces transfer learning has been proposed. Transfer learning is a method of learning in advance a sample obtained from an environment different from the collection environment of learning samples, and applying (transferring) the feature of the detection target obtained by the prior learning to the learning result of the learning sample. Since transfer learning can suppress the number of learning samples, the cost for generating identification feature data can be reduced.

転移学習において、事前に学習されるサンプルの集合は、事前ドメインと呼ばれる。事前ドメインの学習結果が転移される対象は、目標ドメインと呼ばれる。監視カメラにより撮影された画像から人物を検出する場合は、目標ドメインは、監視カメラの設置環境に合わせて生成された学習サンプルの集合である。事前ドメインは、監視カメラの設置環境と異なる環境で生成された学習サンプルの集合である。 In transfer learning, a set of samples that are learned in advance is called a prior domain. The target to which the learning result of the prior domain is transferred is called the target domain. When detecting a person from an image captured by a surveillance camera, the target domain is a set of learning samples generated in accordance with the installation environment of the surveillance camera. The pre-domain is a set of learning samples generated in an environment different from the installation environment of the surveillance camera.

Xiaoxiao Shi, Wei Fan, Jiangtao Ren, ”Actively Transfer Domain Knowledge”, [online], [平成27年3月1日検索],インターネット<URL: www.cs.columbia.edu/~wfan/PAPERS/ecml08transfer.pdf>Xiaoxiao Shi, Wei Fan, Jiangtao Ren, "Actively Transfer Domain Knowledge", [online], [search on March 1, 2015], Internet <URL: www.cs.columbia.edu/~wfan/PAPERS/ecml08transfer. pdf>

転移学習を用いた場合、負の転移と呼ばれる現象が起こることが知られている。負の転移とは、転移学習のために事前に学習する事前ドメインが目標ドメインに含まれるデータと大きく異なるデータを含んでいた場合に、学習の精度が低下する現象である。このため、転移学習を導入した機械学習を実行する前に、転移学習に有効な事前ドメインを特定し、特定した事前ドメインのみを機械学習に用いることが望ましい。 It is known that, when using transfer learning, a phenomenon called negative transfer occurs. Negative transfer is a phenomenon in which the accuracy of learning decreases when the pre-learned in advance for transfer learning includes data significantly different from the data included in the target domain. For this reason, before performing machine learning in which transfer learning is introduced, it is desirable to specify an advance domain effective for transfer learning, and use only the specified advance domain for machine learning.

非特許文献１には、事前ドメインが転移学習に有効であるか否かを判断する方法が開示されている。具体的には、非特許文献１に係る方法は、事前ドメインのみを用いて学習した識別器（事前識別器）と、事前ドメインと目標ドメインとを用いた転移学習を行った識別器（転移識別器）とにサンプルデータをそれぞれ入力する。サンプルデータに対する事前識別器による識別結果が転移識別器による識別結果と同じである場合、この事前ドメインは、転移学習に有効であると判断される。 Non-Patent Document 1 discloses a method of determining whether a prior domain is effective for transfer learning. Specifically, in the method according to Non-Patent Document 1, a classifier (pre-classifier) learned using only the pre-domain, and a classifier that performed transition learning using the pre-domain and the target domain Input the sample data to the If the discrimination result by the pre-discriminator for the sample data is the same as the discrimination result by the metastasis discriminator, this pre-domain is judged to be effective for metastasis learning.

この結果、非特許文献１に開示されている方法において、転移学習に有効でないと判断された事前ドメインは、転移学習を導入した機械学習に用いられない。転移学習に導入される予定の事前ドメインの数が１つであり、この事前ドメインが転移学習に有効でないと判断された場合、転移学習を導入した機械学習を実行することができない。 As a result, in the method disclosed in Non-Patent Document 1, the pre-domain determined not to be effective for transfer learning is not used for machine learning in which transfer learning is introduced. If the number of pre-domains to be introduced into transfer learning is one and it is determined that this pre-domain is not effective for transfer learning, machine learning with transfer learning can not be performed.

従って、事前ドメインが転移学習に有効か否かを判断する場合、複数の事前ドメインを予め準備しておくことが望ましい。しかし、収集されたサンプルを人間が１つずつ確認して、複数の事前ドメインを分類する方法は、現実的でない。また、収集されたデータから複数の事前ドメインを効率的に作成する技術は開発されていない。 Therefore, when determining whether a preliminary domain is effective for transfer learning, it is desirable to prepare a plurality of preliminary domains in advance. However, a method in which a human confirms the collected samples one by one and classifies a plurality of prior domains is not realistic. Also, no technology has been developed to efficiently create multiple pre-domains from collected data.

本発明は、上記問題点に鑑み、事前ドメインを作成するために収集された複数のデータから、複数の事前ドメインを効率的に作成する技術を提供することを課題とする。 An object of the present invention is to provide a technique for efficiently creating a plurality of preliminary domains from a plurality of data collected to create preliminary domains in view of the above-mentioned problems.

上記課題を解決するため、請求項１に係る発明は、クラスタリング装置であって、転移学習を導入した機械学習に用いられる複数の転移候補データの各々から特徴を抽出して複数の転移候補特徴データを生成するクラスタリング用特徴抽出部と、前記クラスタリング用特徴抽出部により生成された複数の転移候補特徴データの各々が有する特徴に基づいて、各転移候補特徴データを第１グループ及び第２グループを含む複数のグループに分類する分類部と、前記分類部により前記第１グループに分類された転移候補特徴データの数が所定の分類継続基準値以下である場合、前記第１グループを前記機械学習に用いられる事前ドメインに決定し、前記転移候補特徴データの数が前記分類継続基準値よりも大きい場合、前記第１グループに分類された転移候補特徴データをさらに分類することを決定する事前ドメイン決定部と、を備える。 In order to solve the above problems, the invention according to claim 1 is a clustering device, and a feature is extracted from each of a plurality of transfer candidate data used for machine learning in which transfer learning is introduced, and a plurality of transfer candidate feature data Each transition candidate feature data includes a first group and a second group based on the features of each of the plurality of transition candidate feature data generated by the clustering feature extraction unit for generating The first group is used for the machine learning when the classification unit for classifying into a plurality of groups and the number of transfer candidate feature data classified into the first group by the classification unit is equal to or less than a predetermined classification continuation reference value Prior to the first group, and if the number of transfer candidate feature data is greater than the classification continuation reference value, the first group is classified. It comprises a pre-domain determining unit that determines that further classify transition candidate feature data, the.

請求項２記載の発明は、請求項１に記載のクラスタリング装置であって、事前ドメイン決定部は、前記第１グループに分類された転移候補特徴データの数が所定の破棄基準値よりも小さい場合、前記第１グループを事前ドメインから除外する。 The invention according to claim 2 is the clustering apparatus according to claim 1, wherein the prior domain determination unit determines that the number of transfer candidate feature data classified into the first group is smaller than a predetermined discarding reference value. , Excluding the first group from the pre-domain.

請求項３記載の発明は、請求項１又は２に記載のクラスタリング装置であって、さらに、前記第１グループに分類された転移候補特徴データの各々が有する特徴量に基づいて、前記第１グループに分類された転移候補特徴データの分散を計算する分散計算部、を備え、前記事前ドメイン決定部は、前記第１グループに分類された転移候補特徴データの数が前記分類継続基準値よりも大きい場合、前記分散計算部により計算された分散を所定の分散基準値と比較し、前記分散計算部により計算された分散が前記分散基準値以下である場合、前記第１グループを事前ドメインに決定する。 The invention according to claim 3 is the clustering apparatus according to claim 1 or 2, wherein the first group is further based on the feature value of each of the transfer candidate feature data classified into the first group. Calculating the variance of the transfer candidate feature data classified into two, wherein the prior domain determination unit determines that the number of transfer candidate feature data classified into the first group is higher than the classification continuation reference value If larger, the variance calculated by the variance calculator is compared with a predetermined variance reference value, and if the variance calculated by the variance calculator is less than or equal to the variance reference value, the first group is determined to be a pre-domain Do.

請求項４記載の発明は、請求項１ないし３のいずれかに記載のクラスタリング装置であって、前記分類部は、前記第１グループに分類された転移候補特徴データの数が所定の変更基準値よりも大きい場合、前記第１グループに分類された転移候補特徴データを第１の数の下位グループにさらに分類し、前記分類部は、前記第１グループに分類された転移候補特徴データの数が前記変更基準値以下である場合、前記第１グループに分類された転移候補特徴データを前記第１の数よりも小さい第２の数の下位グループに分類する。 The invention according to claim 4 is the clustering apparatus according to any one of claims 1 to 3, wherein the classification unit is configured to change the number of transfer candidate feature data classified into the first group to a predetermined change reference value If larger, the transition candidate feature data classified into the first group is further classified into a first number of lower groups, and the classification unit determines that the number of the transition candidate feature data classified into the first group is If it is equal to or less than the change reference value, transfer candidate feature data classified into the first group is classified into a second number of lower groups smaller than the first number.

請求項５記載の発明は、請求項１ないし５のいずれかに記載のクラスタリング装置であって、前記分類継続基準値が、前記クラスタリング用特徴抽出部により抽出される転移候補特徴データの次元数に基づいて決定される。 The invention according to claim 5 is the clustering apparatus according to any one of claims 1 to 5, wherein the classification continuation reference value corresponds to the number of dimensions of transfer candidate feature data extracted by the clustering feature extraction unit. It is decided based on.

請求項６記載の発明は、転移学習を導入した機械学習を実行して検出対象を学習する機械学習装置であって、前記機械学習に用いられる複数の転移候補データを分類して前記機械学習に用いられる事前ドメインを生成するクラスタリング装置と、前記クラスタリング装置により生成された事前ドメインが前記機械学習に有効であるか否かを評価する事前ドメイン評価装置と、を備え、前記クラスタリング装置は、前記複数の転移候補データの各々から特徴を抽出して複数の転移候補特徴データを生成するクラスタリング用特徴抽出部と、前記クラスタリング用特徴抽出部により生成された複数の転移候補特徴データの各々が有する特徴に基づいて、各転移候補特徴データを第１グループ及び第２グループを含む複数のグループに分類する分類部と、前記分類部により前記第１グループに分類された転移候補特徴データの数が所定の分類継続基準値以下である場合、前記第１グループを前記機械学習に用いられる事前ドメインに決定し、前記転移候補特徴データの数が前記分類継続基準値よりも大きい場合、前記第１グループに分類された転移候補特徴データをさらに分類することを決定する事前ドメイン決定部と、を備え、前記事前ドメイン評価装置は、前記事前ドメイン決定部により前記第１グループが前記事前ドメインに決定された場合、前記第１グループに含まれる転移候補特徴データと、各々が所定の条件下における検出対象の特徴を有する学習用データを含む目標ドメインとを用いて前記機械学習を実行して、前記事前ドメインを評価するための評価用識別器を生成する試行転移学習部と、前記試行転移学習部により生成された試行転移識別部に基づいて、前記第１グループが前記機械学習に有効であるか否かを判断する判断部と、を備える。 The invention according to claim 6 is a machine learning device that executes machine learning in which transfer learning is introduced to learn a detection target, and a plurality of transfer candidate data used for the machine learning are classified and the machine learning is performed. The clustering apparatus includes: a clustering apparatus that generates a pre-domain to be used; and a pre-domain evaluation apparatus that evaluates whether or not the pre-domain generated by the clustering apparatus is effective for the machine learning; A feature extraction unit for clustering which extracts features from each of the transfer candidate data of each to generate a plurality of transfer candidate feature data, and a feature possessed by each of a plurality of transfer candidate feature data generated by the clustering feature extraction unit Classification based on each transition candidate feature data into a plurality of groups including a first group and a second group And, if the number of transfer candidate feature data classified into the first group by the classification unit is equal to or less than a predetermined classification continuation reference value, the first group is determined to be a pre-domain used for the machine learning, A pre-domain determination unit that determines to further classify the transition candidate feature data classified into the first group if the number of metastasis candidate feature data is larger than the classification continuation reference value; The evaluation apparatus, when the first group is determined to be the prior domain by the prior domain determination unit, transfer candidate feature data included in the first group, and a feature to be detected under each of the predetermined conditions Perform the machine learning using a target domain including learning data having a to generate an evaluation discriminator for evaluating the pre-domain A row transfer learning unit, based on the trial transition identification unit generated by the trial transfer learning unit, and a determination portion that the first group to determine whether it is effective in the machine learning.

請求項７記載の発明は、請求項６に記載の機械学習装置であって、前記事前ドメイン評価装置は、さらに、前記目標ドメインに含まれる学習用データの各々が有する特徴を抽出して、学習用特徴データを生成する学習用特徴抽出部、を備え、前記試行転移学習部は、前記学習用特徴データを用いて前記機械学習を実行し、前記学習用特徴抽出部が学習用データから特徴を抽出する条件は、前記クラスタリング用特徴抽出部が前記複数の転移候補データの各々から特徴を抽出する条件と同じである。
請求項８記載の発明は、請求項７に記載の機械学習装置であって、さらに、前記目標ドメインと、前記事前ドメイン評価装置により前記機械学習に有効であると判断された全ての事前ドメインとを用いて前記機械学習を実行して転移識別部を生成する選択学習装置、
を備える。
請求項９記載の発明は、クラスタリング方法であって、転移学習を導入した機械学習に用いられる複数の転移候補データの各々から特徴を抽出して複数の転移候補特徴データを生成するステップと、生成された複数の転移候補特徴データの各々が有する特徴に基づいて、各転移候補特徴データを第１グループ及び第２グループを含む複数のグループに分類するステップと、前記第１グループに分類された転移候補特徴データの数が所定の分類継続基準値以下である場合、前記第１グループを前記機械学習に用いられる事前ドメインに決定するステップと、前記転移候補特徴データの数が前記分類継続基準値よりも大きい場合、前記第１グループに分類された転移候補特徴データをさらに分類することを決定するステップと、を備える。 The invention according to claim 7 is the machine learning device according to claim 6, wherein the prior domain evaluation device further extracts a feature possessed by each of learning data included in the target domain, The trial transition learning unit executes the machine learning using the learning feature data, and the learning feature extraction unit is characterized by the learning data. The conditions for extracting are the same as the conditions for the feature extraction unit for clustering to extract features from each of the plurality of transfer candidate data.
The invention according to claim 8 is the machine learning device according to claim 7, further comprising: the target domain, and all the prior domains judged to be effective for the machine learning by the prior domain evaluation device. And a selection learning device that executes the machine learning to generate a transition identification unit,
Equipped with
The invention according to claim 9 is a clustering method, comprising the steps of: extracting features from each of a plurality of transfer candidate data used for machine learning in which transfer learning is introduced and generating a plurality of transfer candidate feature data; Classifying each of the transfer candidate feature data into a plurality of groups including a first group and a second group based on the features of each of the plurality of transfer candidate feature data, and the transfer classified into the first group When the number of candidate feature data is equal to or less than a predetermined classification continuation reference value, the first group is determined to be a pre-domain used for the machine learning, and the number of transfer candidate feature data is from the classification continuation reference value If it is also large, it is determined to further classify the transition candidate feature data classified into the first group.

請求項１０記載の発明は、転移学習を導入した機械学習に用いられる複数の転移候補データの各々を分類するクラスタリング方法をコンピュータに実行させるためのプログラムであって、前記機械学習に用いられる複数の転移候補データの各々から特徴を抽出して複数の転移候補特徴データを生成するステップと、生成された複数の転移候補特徴データの各々が有する特徴に基づいて、各転移候補特徴データを第１グループ及び第２グループを含む複数のグループに分類するステップと、前記第１グループに分類された転移候補特徴データの数が所定の分類継続基準値以下である場合、前記第１グループを前記機械学習に用いられる事前ドメインに決定するステップと、前記転移候補特徴データの数が前記分類継続基準値よりも大きい場合、前記第１グループに分類された転移候補特徴データをさらに分類することを決定するステップと、を備えるクラスタリング方法をコンピュータに実行させるためのプログラムである。 The invention according to claim 10 is a program for causing a computer to execute a clustering method for classifying each of a plurality of transfer candidate data used for machine learning into which transfer learning has been introduced, the plurality of used for the machine learning Extracting a feature from each of the transfer candidate data and generating a plurality of transfer candidate feature data; and based on the features possessed by each of the plurality of transfer candidate feature data generated, each transfer candidate feature data being a first group And classifying the first group into the plurality of groups including the second group, and when the number of transfer candidate feature data classified into the first group is equal to or less than a predetermined classification continuation reference value, Determining a prior domain to be used, and if the number of transfer candidate feature data is greater than the classification continuation reference value; Determining that the further classify transition candidate feature data classified into serial first group, is a program for executing a clustering method on a computer equipped with.

本発明の機械学習装置において、複数の転移候補データから生成された複数の転移候補特徴データは、各々の特徴に基づいて、第１グループ及び第２グループを含む複数のグループに分類される。第１グループに分類された転移候補特徴データの数が分類継続基準値以下である場合、第１グループは、事前ドメインに決定され、転移候補特徴データの数が分類継続基準値よりも大きい場合、第１グループに分類された転移候補特徴データはさらに分類される。これにより、転移学習を導入した機械学習に用いられる事前ドメインを効率的に作成することができる。 In the machine learning apparatus of the present invention, the plurality of transfer candidate feature data generated from the plurality of transfer candidate data are classified into a plurality of groups including a first group and a second group based on the respective features. When the number of transfer candidate feature data classified into the first group is equal to or less than the classification continuation reference value, the first group is determined to be a pre-domain, and the number of transfer candidate feature data is larger than the classification continuation reference value The transfer candidate feature data classified into the first group is further classified. Thereby, it is possible to efficiently create the pre-domain used for machine learning in which transfer learning is introduced.

本発明の実施の形態に係る機械学習装置の構成を示す機能ブロック図である。It is a functional block diagram showing composition of a machine learning device concerning an embodiment of the invention. 図１に示すクラスタリング装置の構成を示す機能ブロック図である。It is a functional block diagram which shows the structure of the clustering apparatus shown in FIG. 図１に示す事前ドメイン評価装置の構成を示す機能ブロック図である。It is a functional block diagram which shows the structure of the prior domain evaluation apparatus shown in FIG. 図１に示す選択学習装置の構成を示す機能ブロック図である。It is a functional block diagram which shows the structure of the selective learning apparatus shown in FIG. 図１に示す機械学習装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the machine learning apparatus shown in FIG. 図１に示す転移候補データから生成される転移候補特徴データ及び学習用データから生成される学習用特徴データの分布の一例を示す図である。FIG. 7 is a view showing an example of distribution of transition candidate feature data generated from the transition candidate data shown in FIG. 1 and learning feature data generated from learning data. 図６に示す転移候補特徴データを分類することにより生成される事前ドメインの範囲を示す図である。It is a figure which shows the range of the prior domain produced | generated by classify | categorizing the transfer candidate characteristic data shown in FIG. 図５に示す事前ドメイン生成処理のフローチャートである。It is a flowchart of the prior domain production | generation process shown in FIG. 図５に示す事前ドメイン生成処理において作成される分類木の初期構造を示す図である。FIG. 6 is a diagram showing an initial structure of a classification tree created in the pre-domain generation process shown in FIG. 5; 図９に示す分類木にノードが追加された場合の構造の一例を示す図である。It is a figure which shows an example of a structure when a node is added to the classification tree shown in FIG. 図５に示す事前ドメイン生成処理が終了したときにおける分類木の構造の一例を示す図である。It is a figure which shows an example of a structure of a classification tree when the prior domain production | generation process shown in FIG. 5 is complete | finished. 図５に示す事前ドメイン評価処理のフローチャートである。It is a flowchart of the prior domain evaluation process shown in FIG. 図１１に示す分類木の変形例を示す図である。It is a figure which shows the modification of the classification | category tree shown in FIG.

以下、図面を参照しつつ、本発明の実施の形態を詳しく説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

［１．機械学習装置１００の構成］
［１．１．全体構成］
図１は、本発明の実施の形態に係る機械学習装置１００の構成を示す機能ブロック図である。図１に示す機械学習装置１００は、記憶装置４００に記憶された複数の転移候補データ４１と、記憶装置５００に記憶された目標ドメイン５０Ａとを用いて、転移学習を導入した機械学習を実行する。機械学習装置１００は、上記の機械学習の結果として検出対象を識別するための転移識別データ８０を生成する。 [1. Configuration of Machine Learning Device 100]
[1.1. overall structure]
FIG. 1 is a functional block diagram showing the configuration of a machine learning apparatus 100 according to an embodiment of the present invention. The machine learning apparatus 100 shown in FIG. 1 executes machine learning in which transfer learning is introduced, using the plurality of transfer candidate data 41 stored in the storage device 400 and the target domain 50A stored in the storage device 500. . The machine learning apparatus 100 generates transition identification data 80 for identifying a detection target as a result of the above-described machine learning.

本実施の形態において、検出対象は、人物である。機械学習装置１００により生成される転移識別データ８０は、人物検出装置（図示省略）がカメラにより撮影された画像から人物を検出するために用いられる。機械学習装置１００は、転移識別データ８０を生成するための学習アルゴリズムとして、転移学習を導入したランダムフォレストを用いる。従って、転移識別データ８０は、複数の決定木により構成されるデータ群である。 In the present embodiment, the detection target is a person. The transition identification data 80 generated by the machine learning device 100 is used by a person detection device (not shown) to detect a person from an image captured by a camera. The machine learning apparatus 100 uses, as a learning algorithm for generating the transfer identification data 80, a random forest into which transfer learning is introduced. Therefore, the transition identification data 80 is a data group constituted by a plurality of decision trees.

記憶装置５００は、目標ドメイン５０Ａを記憶する。目標ドメイン５０Ａは、所定の条件下における検出対象（人物）の特徴を有する複数の画像のグループである。目標ドメイン５０Ａは、学習用データ５１，５１，・・・を含む。学習用データ５１は、例えば、俯角０°で人物を撮影した画像である。目標ドメイン５０Ａは、選択学習装置３０が転移学習を導入した機械学習を実行して転移識別データ８０を生成する際に用いられる。 The storage device 500 stores the target domain 50A. The target domain 50A is a group of a plurality of images having features of a detection target (person) under predetermined conditions. The target domain 50A includes learning data 51, 51,. The learning data 51 is, for example, an image obtained by photographing a person at a depression angle of 0 °. The target domain 50A is used when the selection learning device 30 executes machine learning in which transfer learning is introduced to generate transfer identification data 80.

記憶装置４００は、転移候補データ４１，４１，・・・を記憶する。複数の転移候補データ４１は、人物が撮影された画像であり、人物を撮影した画像をインターネット上で検索することにより収集される。転移候補データ４１，４１，・・・の各々の特徴に基づいて転移候補データ４１，４１，・・・を分類することにより、事前ドメイン４５，４５，・・・が生成される。事前ドメイン４５，４５，・・・のうち、転移学習に有効と判断された事前ドメイン４５が、転移識別データ８０の生成に用いられる。 The storage device 400 stores transfer candidate data 41, 41,. The plurality of pieces of transfer candidate data 41 are images in which a person is photographed, and are collected by searching the images in which the person is photographed on the Internet. By classifying the transfer candidate data 41, 41,... Based on the respective features of the transfer candidate data 41, 41,. Among the pre-domains 45, 45, ..., the pre-domain 45 judged to be effective for transfer learning is used for generation of transfer identification data 80.

機械学習装置１００は、クラスタリング装置１０と、事前ドメイン評価装置２０と、選択学習装置３０とを備える。 The machine learning device 100 includes a clustering device 10, a pre-domain evaluation device 20, and a selection learning device 30.

クラスタリング装置１０は、転移候補データ４１の各々の特徴に基づいて転移候補データ４１を分類して、事前ドメイン４５を生成する。 The clustering device 10 classifies the transfer candidate data 41 based on the respective features of the transfer candidate data 41 to generate the pre-domain 45.

事前ドメイン評価装置２０は、クラスタリング装置１０により生成された事前ドメイン４５の各々が転移学習に有効であるか否かを評価する。事前ドメイン評価装置２０は、各事前ドメイン４５の評価結果を示す評価結果データ２５３Ａを選択学習装置３０に出力する。 The preliminary domain evaluation device 20 evaluates whether each of the preliminary domains 45 generated by the clustering device 10 is effective for transfer learning. The prior domain evaluation device 20 outputs evaluation result data 253A indicating the evaluation result of each prior domain 45 to the selective learning device 30.

選択学習装置３０は、評価結果データ２５３Ａに基づいて、事前ドメイン４５のうち、事前ドメイン評価装置２０により転移学習に有効と判断された事前ドメイン４５を選択する。選択学習装置３０は、選択した事前ドメイン４５と、記憶装置５００に記憶された目標ドメイン５０Ａとを用いて、転移学習を導入した機械学習を実行する。その結果、転移識別データ８０が生成される。 The selective learning device 30 selects, from among the pre-domains 45, the pre-domain 45 judged to be effective for transfer learning by the pre-domain evaluation device 20 based on the evaluation result data 253A. The selective learning device 30 executes machine learning in which transfer learning is introduced, using the selected prior domain 45 and the target domain 50A stored in the storage device 500. As a result, metastasis identification data 80 is generated.

［１．２．クラスタリング装置１０の構成］
図２は、図１に示すクラスタリング装置１０の構成を示す機能ブロック図である。図２に示すように、クラスタリング装置１０は、特徴抽出部１１と、分類部１２と、分散計算部１３と、事前ドメイン決定部１４とを備える。 [1.2. Configuration of Clustering Device 10]
FIG. 2 is a functional block diagram showing the configuration of the clustering device 10 shown in FIG. As shown in FIG. 2, the clustering device 10 includes a feature extraction unit 11, a classification unit 12, a distributed calculation unit 13, and a pre-domain determination unit 14.

クラスタリング装置１０は、記憶装置４００から複数の転移候補データ４１を入力する。特徴抽出部１１は、クラスタリング装置１０に入力された複数の転移候補データ４１の各々からＨＯＧ（Histograms of Oriented Gradients）特徴量を抽出して、転移候補データ４１の各々に対応する複数の転移候補特徴データ４２を生成する。以下、特に説明のない限り、ＨＯＧ特徴量を単に「特徴量」と記載する。 The clustering device 10 inputs a plurality of transfer candidate data 41 from the storage device 400. The feature extraction unit 11 extracts HOG (Histograms of Oriented Gradients) feature quantities from each of the plurality of transfer candidate data 41 input to the clustering apparatus 10, and a plurality of transfer candidate features corresponding to each of the transfer candidate data 41. Data 42 is generated. Hereinafter, the HOG feature amount is simply referred to as a “feature amount” unless otherwise described.

分類部１２は、特徴抽出部１１から複数の転移候補特徴データ４２を入力する。分類部１２は、入力した複数の転移候補特徴データ４２の各々に含まれる特徴量に基づいて、転移候補特徴データ４２を複数のグループに分類する。転移候補特徴データ４２の分類には、デンシティフォレスト（Density Forest）と呼ばれるアルゴリズムが用いられる。分類部１２は、１本の分類木を作成しながら、複数の転移候補特徴データ４２を分類する。分類木を構成するノードの各々が、各グループに対応する。 The classification unit 12 receives the plurality of transfer candidate feature data 42 from the feature extraction unit 11. The classification unit 12 classifies the transition candidate feature data 42 into a plurality of groups based on the feature amounts included in each of the plurality of input transition candidate feature data 42. An algorithm called Density Forest is used to classify the transfer candidate feature data 42. The classification unit 12 classifies the plurality of transfer candidate feature data 42 while creating one classification tree. Each of the nodes constituting the classification tree corresponds to each group.

分散計算部１３は、各ノードの共分散を計算する。各ノードの共分散は、各ノードに属する転移候補特徴データ４２が有する特徴量から計算される。各ノードの共分散は、各ノードに属する転移候補特徴データ４２を分類する際に用いられる。また、共分散は、分類木を構成するノードを事前ドメインに決定するか否かを判断するために用いられる。 The variance calculator 13 calculates the covariance of each node. The covariance of each node is calculated from the feature amount of the transfer candidate feature data 42 belonging to each node. The covariance of each node is used in classifying transition candidate feature data 42 belonging to each node. In addition, covariance is used to determine whether or not nodes constituting a classification tree are determined in advance.

事前ドメイン決定部１４は、分類木を構成するノードが事前ドメインとしての条件を満たしているか否かを判断する。判断対象のノードに属する転移候補特徴データ４２の数が、予め設定された分類継続基準値以下である場合、事前ドメイン決定部１４は、判断対象のノードを事前ドメインに決定する。 The prior domain determination unit 14 determines whether or not the nodes forming the classification tree satisfy the condition as the prior domain. If the number of transfer candidate feature data 42 belonging to the determination target node is equal to or less than the preset classification continuation reference value, the prior domain determination unit 14 determines the determination target node as the prior domain.

判断対象のノードに属する転移候補特徴データ４２の数が、分類継続基準値より大きい場合、事前ドメイン決定部１４は、判断対象のノードの共分散を予め設定された分散基準値と比較する。判断対象のノードの共分散が分散基準値以下である場合、事前ドメイン決定部１４は、判断対象のノードを事前ドメインに決定する。一方、判断対象のノードの分散が分散基準値より大きい場合、事前ドメイン決定部１４は、判断対象のノードに属する転移候補特徴データ４２をさらに分類することを決定する。 If the number of transfer candidate feature data 42 belonging to the judgment target node is larger than the classification continuation reference value, the pre-domain determination unit 14 compares the covariance of the judgment target node with the dispersion reference value set in advance. When the covariance of the node to be judged is equal to or less than the dispersion reference value, the prior domain determining unit 14 determines the node to be judged as the prior domain. On the other hand, when the variance of the node to be determined is larger than the variance reference value, the prior domain determination unit 14 determines to further classify the transfer candidate feature data 42 belonging to the node to be determined.

［１．３．事前ドメイン評価装置２０の構成］
図３は、図１に示す事前ドメイン評価装置２０の構成を示す機能ブロック図である。図３に示すように、事前ドメイン評価装置２０は、一時記憶部２１と、特徴抽出部２２と、試行転移学習部２３と、比較学習部２４と、評価部２５とを備える。 [1.3. Configuration of Prior Domain Evaluation Device 20]
FIG. 3 is a functional block diagram showing the configuration of the preliminary domain evaluation device 20 shown in FIG. As shown in FIG. 3, the prior domain evaluation device 20 includes a temporary storage unit 21, a feature extraction unit 22, a trial transfer learning unit 23, a comparison learning unit 24, and an evaluation unit 25.

事前ドメイン評価装置２０は、記憶装置４００に記憶された目標ドメイン５０Ａを入力し、クラスタリング装置１０により生成された事前ドメイン４５を入力する。 The prior domain evaluation device 20 inputs the target domain 50A stored in the storage device 400, and inputs the prior domain 45 generated by the clustering device 10.

一時記憶部２１は、クラスタリング装置１０から入力した事前ドメイン４５を一時的に記憶する。 The temporary storage unit 21 temporarily stores the prior domain 45 input from the clustering device 10.

特徴抽出部２２は、事前ドメイン評価装置２０に入力された目標ドメイン５０Ａに含まれる学習用データ５１，５１，・・・の各々から特徴量を抽出して、各学習用データ５１に対応する複数の学習用特徴データ５２を生成する。特徴抽出部２２により生成された学習用特徴データ５２は、目標ドメイン５０Ｂを構成する。 The feature extraction unit 22 extracts feature amounts from each of the learning data 51, 51, ... included in the target domain 50A input to the prior domain evaluation device 20, and a plurality of feature data corresponding to each learning data 51 are extracted. Feature data 52 for learning. The learning feature data 52 generated by the feature extraction unit 22 constitutes a target domain 50B.

試行転移学習部２３は、目標ドメイン５０Ｂを特徴抽出部２２から取得する。試行転移学習部２３は、事前ドメイン４５のうちいずれか１つの事前ドメイン（注目事前ドメイン）を評価対象として一時記憶部２１から取得する。試行転移学習部２３は、取得した目標ドメイン５０Ａ及び注目事前ドメインを用いて、注目事前ドメインの転移学習の有効性を評価するための機械学習（試行転移学習）を実行する。転移学習を導入したランダムフォレストが、試行転移学習のアルゴリズムとして用いられる。試行転移学習の結果、注目事前ドメインに対応する試行転移識別部６３が生成される。試行転移識別部６３の実体は、複数の決定木により構成されるデータ群である。試行転移識別部６３は、事前ドメイン４５ごとに生成される。 The trial transfer learning unit 23 acquires the target domain 50B from the feature extraction unit 22. The trial transfer learning unit 23 acquires, from the temporary storage unit 21, any one of the pre-domains (target pre-domains) of the pre-domains 45 as an evaluation target. The trial transfer learning unit 23 executes machine learning (trial transfer learning) for evaluating the effectiveness of transfer learning of the focused pre-domain, using the acquired target domain 50A and the focused pre-domain. A random forest introduced with transfer learning is used as an algorithm for trial transfer learning. As a result of trial transfer learning, a trial transfer identification unit 63 corresponding to the target pre-domain is generated. The entity of the trial transition identification unit 63 is a data group configured by a plurality of decision trees. The trial transfer identification unit 63 is generated for each prior domain 45.

比較学習部２４は、注目事前ドメインのみを利用して、比較用の機械学習（比較学習）を実行する。転移学習が導入されないランダムフォレストが、比較学習のアルゴリズムとして用いられる。比較学習の結果、注目事前ドメインに対応する比較識別部６４を生成する。比較識別部６４の実体は、複数の決定木を構成するデータ群である。比較識別部６４は、事前ドメイン４５ごとに生成される。 The comparison learning unit 24 executes machine learning for comparison (comparison learning) using only the focused pre-domain. A random forest in which transfer learning is not introduced is used as an algorithm for comparison learning. As a result of the comparison learning, the comparison identification unit 64 corresponding to the focused pre-domain is generated. The entity of the comparison identification unit 64 is a data group constituting a plurality of decision trees. The comparison identification unit 64 is generated for each prior domain 45.

評価部２５は、試行転移識別部６３及び比較識別部６４による各々の識別結果を用いて、注目事前ドメインが転移学習に有効であるか否かを判断する。評価部２５は、競合値計算部２５１と、信頼度計算部２５２と、転移評価部２５３とを備える。 The evaluation unit 25 uses the identification results of the trial transition identification unit 63 and the comparison identification unit 64 to determine whether or not the focused pre-domain is effective for transition learning. The evaluation unit 25 includes a competition value calculation unit 251, a reliability calculation unit 252, and a transition evaluation unit 253.

競合値計算部２５１は、比較識別部６４によるサンプルデータの識別結果を試行転移識別部６３によるサンプルデータの識別結果と比較する。サンプルデータは、目標ドメイン５０Ｂに含まれる学習用特徴データ５２及び注目事前ドメインに含まれる転移候補特徴データ４２のうちいずれかである。競合値計算部２５１は、比較結果に基づいて、競合値２５１Ａを計算する。競合値２５１Ａは、比較識別部６４による識別結果と、試行転移識別部６３による識別結果とが一致しない度合いを示す。 The competition value calculation unit 251 compares the identification result of the sample data by the comparison identification unit 64 with the identification result of the sample data by the trial transition identification unit 63. The sample data is any one of learning feature data 52 included in the target domain 50B and transfer candidate feature data 42 included in the target pre-domain. The competition value calculation unit 251 calculates the competition value 251A based on the comparison result. The competition value 251A indicates the degree to which the identification result by the comparison identification unit 64 and the identification result by the trial transfer identification unit 63 do not match.

信頼度計算部２５２は、試行転移識別部６３によるサンプルデータの識別結果を用いて、信頼度２５２Ａを計算する。信頼度２５２Ａは、試行転移識別部６３による識別結果の信頼性を示す。 The reliability calculation unit 252 calculates the reliability 252A using the identification result of the sample data by the trial transition identification unit 63. The reliability 252A indicates the reliability of the identification result by the trial transition identification unit 63.

転移評価部２５３は、競合値２５１Ａ及び信頼度２５２Ａに基づいて、注目事前ドメインが転移学習に有効であるか否かを評価する。転移評価部２５３は、事前ドメイン４５の各々の評価を示す評価結果データ２５３Ａを選択学習装置３０へ出力する。 The transfer evaluation unit 253 evaluates whether or not the focused pre-domain is effective for transfer learning based on the competition value 251A and the reliability 252A. The transfer evaluation unit 253 outputs evaluation result data 253A indicating the evaluation of each of the pre-domains 45 to the selective learning device 30.

［１．４．選択学習装置３０の構成］
図４は、図１に示す選択学習装置３０の構成を示す機能ブロック図である。図４に示すように、選択学習装置３０は、事前ドメイン選択部３１と、特徴抽出部３２と、転移学習部３３とを備える。 [1.4. Configuration of Selective Learning Device 30]
FIG. 4 is a functional block diagram showing the configuration of the selection learning device 30 shown in FIG. As shown in FIG. 4, the selection learning device 30 includes a pre-domain selection unit 31, a feature extraction unit 32, and a transfer learning unit 33.

事前ドメイン選択部３１は、クラスタリング装置１０から事前ドメイン４５を入力し、評価結果データ２５３Ａを事前ドメイン評価装置２０から入力する。事前ドメイン選択部３１は、入力した評価結果データ２５３Ａに基づいて、クラスタリング装置１０により生成された事前ドメイン４５のうち、転移学習に有効と評価された事前ドメイン４５を選択する。 The prior domain selection unit 31 receives the prior domain 45 from the clustering device 10, and receives the evaluation result data 253A from the prior domain evaluation device 20. The prior domain selection unit 31 selects, from among the prior domains 45 generated by the clustering device 10, the prior domain 45 that has been evaluated to be effective for transfer learning, based on the input evaluation result data 253A.

特徴抽出部３２は、記憶装置５００に記憶された目標ドメイン５０Ａを取得する。特徴抽出部３２は、特徴抽出部２２と同様に、取得した目標ドメイン５０Ａに含まれる学習用データ５１，５１，・・・の各々から特徴量を抽出して目標ドメイン５０Ｂを生成する。 The feature extraction unit 32 acquires the target domain 50A stored in the storage device 500. Like the feature extraction unit 22, the feature extraction unit 32 extracts feature amounts from each of the learning data 51, 51,... Included in the acquired target domain 50A, and generates a target domain 50B.

転移学習部３３は、目標ドメイン５０Ｂと、事前ドメイン選択部３１により選択された事前ドメイン４５とを用いて、転移学習を導入した機械学習を実行する。転移学習部３３が用いる学習アルゴリズムは、試行転移学習部２３が用いる学習アルゴリズムと同じである。転移学習部３３は、転移学習を導入した機械学習の結果として、転移識別データ８０を生成する。 The transfer learning unit 33 executes machine learning in which transfer learning is introduced, using the target domain 50B and the pre-domain 45 selected by the pre-domain selecting unit 31. The learning algorithm used by the transfer learning unit 33 is the same as the learning algorithm used by the trial transfer learning unit 23. The transfer learning unit 33 generates transfer identification data 80 as a result of machine learning in which transfer learning is introduced.

［２．動作概略］
図５は、機械学習装置１００の動作の概略を示すフローチャートである。図５に示すように、機械学習装置１００において、クラスタリング装置１０は、記憶装置４００に記憶された転移候補データ４１，４１，・・・から事前ドメイン４５を生成する事前ドメイン生成処理を実行する（ステップＳ１）。 [2. Outline of operation]
FIG. 5 is a flowchart showing an outline of the operation of the machine learning apparatus 100. As shown in FIG. 5, in the machine learning device 100, the clustering device 10 executes pre-domain generation processing for generating the pre-domain 45 from the transfer candidate data 41, 41,... Stored in the storage device 400 ( Step S1).

クラスタリング装置１０により生成される事前ドメイン４５の数は、特に限定されない。事前ドメイン４５の各々は、転移候補データ４１から特徴量を抽出することにより生成された転移候補特徴データ４２を有する。 The number of pre-domains 45 generated by the clustering device 10 is not particularly limited. Each of the prior domains 45 has transfer candidate feature data 42 generated by extracting feature amounts from transfer candidate data 41.

事前ドメイン評価装置２０は、クラスタリング装置１０により生成された事前ドメイン４５の各々が転移学習に有効であるか否かを判断する事前ドメイン評価処理を実行する（ステップＳ２）。事前ドメイン評価装置２０は、ステップＳ２の結果として、評価結果データ２５３Ａを生成する。評価結果データ２５３Ａは、クラスタリング装置１０により生成された事前ドメイン４５のうち、転移学習に有効と判断された事前ドメイン４５を特定したデータである。 The preliminary domain evaluation device 20 executes a preliminary domain evaluation process to determine whether each of the preliminary domains 45 generated by the clustering device 10 is effective for transfer learning (step S2). The prior domain evaluation device 20 generates evaluation result data 253A as a result of step S2. The evaluation result data 253A is data in which, among the pre-domains 45 generated by the clustering device 10, the pre-domain 45 determined to be effective for transfer learning is specified.

選択学習装置３０において、事前ドメイン選択部３１は、評価結果データ２５３Ａに基づいて、クラスタリング装置１０により生成された事前ドメイン４５の中から、転移学習に有効と判断された事前ドメイン４５を選択する（ステップＳ３）。 In the selective learning device 30, the prior domain selection unit 31 selects the prior domain 45 judged to be effective for transfer learning from the prior domains 45 generated by the clustering device 10 based on the evaluation result data 253A ( Step S3).

特徴抽出部３２（図４参照）は、記憶装置５００から目標ドメイン５０Ａを取得する。特徴抽出部３２は、取得した目標ドメイン５０Ａに含まれる学習用データ５１の各々から特徴量を抽出して、複数の学習用特徴データ５２を生成する（ステップＳ４）。特徴抽出部３２により実行される処理は、図３に示す特徴抽出部２２により実行される処理と同じである。つまり、特徴抽出部３２は、複数の学習用特徴データ５２により構成される目標ドメイン５０Ｂを生成する。 The feature extraction unit 32 (see FIG. 4) acquires the target domain 50A from the storage device 500. The feature extraction unit 32 extracts feature amounts from each of the learning data 51 included in the acquired target domain 50A, and generates a plurality of learning feature data 52 (step S4). The process performed by the feature extraction unit 32 is the same as the process performed by the feature extraction unit 22 illustrated in FIG. 3. That is, the feature extraction unit 32 generates a target domain 50B configured of a plurality of learning feature data 52.

転移学習部３３は、事前ドメイン選択部３１により選択された事前ドメイン４５と、特徴抽出部３２により生成された目標ドメイン５０Ｂとを用いて、転移学習を導入した機械学習を実行する（ステップＳ５）。転移学習部３３は、試行転移学習部２３が用いる学習アルゴリズムと同じ学習アルゴリズム（転移学習を導入したランダムフォレスト）を用いる。これにより、複数の決定木を示すデータ群である転移識別データ８０が生成される。 The transfer learning unit 33 executes machine learning in which transfer learning is introduced using the prior domain 45 selected by the prior domain selection unit 31 and the target domain 50B generated by the feature extraction unit 32 (step S5). . The transfer learning unit 33 uses the same learning algorithm (random forest to which transfer learning is introduced) as the learning algorithm used by the trial transfer learning unit 23. This generates transition identification data 80 which is a data group indicating a plurality of decision trees.

以下、事前ドメイン生成処理（ステップＳ１）及び事前ドメイン評価処理（ステップＳ２）が実行される理由を説明する。 Hereinafter, the reason why the prior domain generation process (step S1) and the prior domain evaluation process (step S2) are performed will be described.

図６は、目標ドメイン５０Ｂ及び転移候補特徴データ４２の分布の一例を示す図である。図６は、転移候補特徴データ４２及び学習用特徴データ５２の特徴量の次元数が２である場合を例にして、転移候補特徴データ４２の分布と、目標ドメイン５０Ｂを構成する学習用特徴データ５２の分布とを示している。 FIG. 6 is a view showing an example of the distribution of the target domain 50B and the transfer candidate feature data 42. As shown in FIG. FIG. 6 illustrates the distribution of the transfer candidate feature data 42 and the feature data for learning constituting the target domain 50B, taking the case where the number of dimensions of the feature amounts of the transfer candidate feature data 42 and the feature data for learning 52 is two. 52 shows the distribution.

目標ドメイン５０Ｂは、学習用データ５１から特徴量を抽出することにより生成された学習用特徴データ５２を含む。複数の学習用データ５１は、上述のように、俯角０°で撮影した人物を含む画像であるため、互いに類似する特徴を有する。従って、図６に示す２次元空間において、学習用特徴データ５２のばらつきは小さく、目標ドメイン５０Ｂは、比較的狭い領域に限定される。 The target domain 50 B includes learning feature data 52 generated by extracting feature quantities from the learning data 51. Since the plurality of learning data 51 are images including a person photographed at a depression angle of 0 ° as described above, they have characteristics similar to each other. Therefore, in the two-dimensional space shown in FIG. 6, the variation of the learning feature data 52 is small, and the target domain 50B is limited to a relatively narrow region.

一方、転移候補特徴データ４２の分布は、学習用特徴データ５２に比べてばらつきが大きい。転移候補データ４１は、インターネット上で検出対象（人物）を検索することにより収集されるため、転移候補データ４１の人物の撮影条件は、様々である。転移候補特徴データ４２は、転移候補データ４１から特徴量を抽出することにより生成される。従って、転移候補特徴データ４２は、図６に示す２次元空間全体に広がり、その位置はランダムである。 On the other hand, the distribution of the transfer candidate feature data 42 has a larger variation than the learning feature data 52. Since the transfer candidate data 41 is collected by searching for a detection target (person) on the Internet, the imaging conditions of the person of the transfer candidate data 41 are various. The transfer candidate feature data 42 is generated by extracting feature amounts from the transfer candidate data 41. Therefore, the transition candidate feature data 42 is spread over the entire two-dimensional space shown in FIG. 6, and its position is random.

ここで、転移学習を導入した機械学習について、画像から人物を検出する場合を例に説明する。転移学習を導入した機械学習では、目標ドメインと、事前ドメインとが予め準備される。目標ドメインは、所定の条件下における検出対象の特徴を有する画像のグループである。本実施の形態において、検出対象は人物であり、所定の条件は、検出対象（人物）が俯角０°で撮影された画像に含まれていることである。 Here, machine learning in which transfer learning is introduced will be described using an example of detecting a person from an image. In machine learning in which transfer learning is introduced, a target domain and a prior domain are prepared in advance. The target domain is a group of images having features to be detected under predetermined conditions. In the present embodiment, the detection target is a person, and the predetermined condition is that the detection target (person) is included in the image captured at a depression angle of 0 °.

事前ドメインは、上記の所定の条件と異なる条件下における検出対象の特徴を有する画像のグループである。事前ドメインは、収集された画像を所定の規則により分類することにより生成される。例えば、収集された各画像の撮影条件がわかっている場合、収集された画像を撮影条件に応じて分類することができる。これにより、事前ドメインは、互いに共通する、又は類似する特徴を有する画像の集合となる。 The pre-domain is a group of images having features to be detected under conditions different from the predetermined conditions described above. The pre-domain is generated by classifying the collected images according to a predetermined rule. For example, when the imaging condition of each acquired image is known, the acquired image can be classified according to the imaging condition. Thus, the pre-domains are a set of images having features common or similar to one another.

機械学習装置が転移学習を導入した機械学習を実行する場合、最初に事前ドメインの学習が行われ、次に目標ドメインの学習が行われる。そして、機械学習装置は、俯角０°で撮影された人物の特徴と共通又は類似する特徴を有する画像を特定し、特定した画像が有する特徴を、目標ドメイン５０Ｂに含まれる画像の学習結果に転移させる。これにより、目標ドメインを構成する画像の数を削減することができるとともに、人物の識別精度を向上させることができる。 When the machine learning apparatus executes machine learning in which transfer learning is introduced, learning of a pre-domain is performed first, and then learning of a target domain is performed. Then, the machine learning apparatus identifies an image having a feature common or similar to the feature of the person photographed at a depression angle of 0 °, and transfers the feature included in the identified image to the learning result of the image included in the target domain 50B. Let As a result, the number of images constituting the target domain can be reduced, and the identification accuracy of the person can be improved.

しかし、ある事前ドメインにおける画像の特徴が、目標ドメインにおける画像の特徴と大きく異なる場合、負の転移が発生する。この理由は、この事前ドメインにおける画像の特徴が、転移学習により、目標ドメインにおける画像の学習結果に反映されるためである。この結果、転移学習を導入した機械学習の結果として生成される識別器の精度が低下する。 However, if the image features in one prior domain differ significantly from the image features in the target domain, a negative transition occurs. The reason is that the feature of the image in this prior domain is reflected in the learning result of the image in the target domain by transfer learning. As a result, the accuracy of the classifier generated as a result of machine learning that introduced transfer learning is reduced.

図６に示すように、２次元空間全体に広がる全ての転移候補特徴データ４２を１つの事前ドメインとした場合、目標ドメイン５０Ｂの領域から離れた転移候補特徴データ４２が、転移学習に用いられることになる。この場合、負の転移が発生する可能性が非常に高い。負の転移の発生を防ぐためには、互いに共通する、又は類似する特徴を有する転移候補特徴データ４２同士をまとめることにより、事前ドメイン４５を生成し、生成した事前ドメイン４５が、転移学習を導入した機械学習に有効であるか否かを判断すればよい。事前ドメイン生成処理（ステップＳ１）は、互いに共通する、又は類似する特徴を有する転移候補特徴データ４２の集合である事前ドメイン４５を生成するために実行される。 As shown in FIG. 6, when all transition candidate feature data 42 extending over the entire two-dimensional space is one prior domain, the transition candidate feature data 42 apart from the target domain 50B is used for transition learning. become. In this case, the possibility of negative metastasis is very high. In order to prevent the occurrence of negative metastasis, the prior domain 45 is generated by putting together transition candidate feature data 42 having common or similar features, and the generated prior domain 45 introduces the transition learning. It may be determined whether it is effective for machine learning. The pre-domain generation process (step S1) is performed to generate a pre-domain 45 which is a set of transfer candidate feature data 42 having common or similar features.

図７は、図６に示す転移候補特徴データ４２を分類した一例を示す図である。クラスタリング装置１０は、図７に示す転移候補特徴データ４２を分類することにより、事前ドメイン４５Ａ〜４５Ｇを生成する。 FIG. 7 is a view showing an example in which the transfer candidate feature data 42 shown in FIG. 6 is classified. The clustering device 10 generates prior domains 45A to 45G by classifying the transfer candidate feature data 42 illustrated in FIG. 7.

事前ドメイン４５Ａ〜４５Ｇの中で、事前ドメイン４５Ａ及び４５Ｆは、目標ドメイン５０Ｂと重複していない。従って、事前ドメイン４５Ａ及び４５Ｆは、転移学習を導入した機械学習に有効でない。また、事前ドメイン４５Ｄは、目標ドメイン５０Ｂと重複しているが、重複している範囲が他の事前ドメインに比べて少ない。従って、事前ドメイン４５Ｄは、負の転移を発生させる可能性があり、転移学習に有効でない。 Among the pre-domains 45A-45G, pre-domains 45A and 45F do not overlap with the target domain 50B. Thus, the pre-domains 45A and 45F are not effective for machine learning that introduced transfer learning. Also, although the pre-domain 45D overlaps with the target domain 50B, the overlapping range is smaller compared to other pre-domains. Thus, the pre-domain 45D can cause negative metastasis and is not effective for metastasis learning.

このように、事前ドメイン生成処理（ステップＳ１）により、負の転移を発生させる可能性がある（転移学習に有効でない）事前ドメインが生成される可能性がある。転移学習を導入した機械学習の結果として生成される転移識別器の精度を向上させるためには、転移学習に有効でない事前ドメインを予め除外しておくことが望ましい。このため、事前ドメイン評価処理（ステップＳ２）は、事前ドメイン生成処理（ステップＳ１）により生成された事前ドメイン４５Ａ〜４５Ｇの中で、転移学習に有効な事前ドメインを特定するために行われる。 Thus, the pre-domain generation process (step S1) may generate a pre-domain (which is not effective for transfer learning) which may cause negative transfer. In order to improve the accuracy of a transition classifier generated as a result of machine learning in which transition learning is introduced, it is desirable to exclude beforehand a pre-domain that is not effective for transition learning. Therefore, the preliminary domain evaluation process (step S2) is performed to identify the preliminary domain effective for the transfer learning among the preliminary domains 45A to 45G generated by the preliminary domain generation process (step S1).

［３．事前ドメイン生成処理（ステップＳ１）］
図８は、事前ドメイン生成処理（ステップＳ１）のフローチャートである。図８を参照しながら、記憶装置４００に記憶された転移候補データ４１，４１，・・・から事前ドメイン４５を生成するクラスタリング装置１０の動作を詳しく説明する。 [3. Prior domain generation processing (step S1)]
FIG. 8 is a flowchart of the prior domain generation process (step S1). The operation of the clustering device 10 for generating the pre-domain 45 from the transition candidate data 41, 41,... Stored in the storage device 400 will be described in detail with reference to FIG.

［３．１．ＨＯＧ特徴量の抽出］
クラスタリング装置１０は、記憶装置４００に記憶された全ての転移候補データ４１を取得する。クラスタリング装置１０において、特徴抽出部１１（図２参照）は、取得した全ての転移候補データ４１の各々からＨＯＧ特徴量を抽出する（ステップＳ１０１）。これにより、全ての転移候補データ４１の各々に対応する複数の転移候補特徴データ４２が生成される。 [3.1. Extraction of HOG feature value]
The clustering device 10 acquires all the transfer candidate data 41 stored in the storage device 400. In the clustering device 10, the feature extraction unit 11 (see FIG. 2) extracts the HOG feature amount from each of all the acquired transfer candidate data 41 (step S101). Thereby, a plurality of transfer candidate feature data 42 corresponding to each of all transfer candidate data 41 is generated.

特徴抽出部１１は、転移候補データ４１からＨＯＧ特徴量を抽出する条件を、例えば、以下のように設定する。転移候補データ４１の色チャンネルは、グレースケールに設定される。転移候補データ４１のサイズは、縦６０ピクセル、横３０ピクセルに設定される。 The feature extraction unit 11 sets, for example, the conditions for extracting the HOG feature amount from the transfer candidate data 41 as follows. The color channel of the transition candidate data 41 is set to grayscale. The size of the transition candidate data 41 is set to 60 pixels vertically and 30 pixels horizontally.

ＨＯＧ特徴量の抽出時のパラメータとして、セル、ブロック、勾配方向数が設定される。セルは、輝度の勾配方向を計算する単位領域である。ブロックは、輝度の勾配方向のヒストグラムを作成する単位領域である。勾配方向数は、０°以上１８０°以下の範囲における分割数である。 The number of cells, blocks, and gradient directions are set as parameters at the time of extraction of HOG feature quantities. A cell is a unit area for calculating the gradient direction of luminance. A block is a unit area that creates a histogram in the gradient direction of luminance. The number of gradient directions is the number of divisions in the range of 0 ° to 180 °.

例えば、１セルの大きさは、縦５ピクセル、横５ピクセルに設定される。１ブロックの大きさは、縦３ピクセル、横３ピクセルに設定される。勾配方向数は、９に設定される。勾配方向数が９である場合、各セルの勾配方向は、２０°おきに９方向に分割され、９方向のうちいずれかの方向に設定される。この場合、転移候補特徴データ４２の次元数は、３２４０となる。 For example, the size of one cell is set to 5 pixels vertically and 5 pixels horizontally. The size of one block is set to 3 pixels vertically and 3 pixels horizontally. The number of gradient directions is set to nine. When the number of gradient directions is nine, the gradient direction of each cell is divided into nine directions at intervals of 20 °, and is set to one of nine directions. In this case, the number of dimensions of the transfer candidate feature data 42 is 3240.

［３．２．ルートノード３５Ｒにおける分類可否の判断］
図９は、分類部１２により生成される分類木３５の初期構造を示す図である。分類部１２は、転移候補特徴データ４２を分類するためのアルゴリズムとしてデンシティフォレストを用いる。デンシティフォレストを用いる場合、通常であれば、複数の分類木が生成されるが、分類部１２は、１本の分類木のみを生成する。 [3.2. Determination of classification availability at the root node 35R]
FIG. 9 is a diagram showing an initial structure of the classification tree 35 generated by the classification unit 12. The classification unit 12 uses a density forest as an algorithm for classifying the transfer candidate feature data 42. When a density forest is used, a plurality of classification trees are usually generated, but the classification unit 12 generates only one classification tree.

分類木３５は、転移候補特徴データ４２が分類部１２により分類される過程で形成される。分類木３５を構成するノードのうち、所定の条件を満たすノードが、事前ドメインに決定される。 The classification tree 35 is formed in the process in which the transfer candidate feature data 42 is classified by the classification unit 12. Among the nodes constituting the classification tree 35, nodes satisfying a predetermined condition are determined as the pre-domain.

分類部１２は、分類木３５のルートノード３５Ｒを作成する（ステップＳ１０２）。図９に示すノード３５Ａ及び３５Ｂは、ステップＳ１０２が実行される時点では生成されない。分類部１２は、特徴抽出部１１により生成された全ての転移候補特徴データ４２を、ルートノード３５Ｒに入力する（ステップＳ１０３）。ルートノード３５Ｒに入力される転移候補特徴データ４２の数は、３００００である。 The classification unit 12 creates the root node 35R of the classification tree 35 (step S102). The nodes 35A and 35B shown in FIG. 9 are not generated at the time when step S102 is executed. The classification unit 12 inputs all the transfer candidate feature data 42 generated by the feature extraction unit 11 to the root node 35R (step S103). The number of transfer candidate feature data 42 input to the root node 35R is 30,000.

次に、事前ドメイン決定部１４は、分類木３５において、全てのノードを分類対象ノードとして選択したか否かを判断する（ステップＳ１０４）。ルートノード３５Ｒが分類対象として選択されていないため（ステップＳ１０４においてＮｏ）、事前ドメイン決定部１４は、ルートノード３５Ｒを分類対象に選択する（ステップＳ１０５）。 Next, the advance domain determination unit 14 determines whether all nodes have been selected as classification target nodes in the classification tree 35 (step S104). Since the root node 35R is not selected as a classification target (No in step S104), the prior domain determination unit 14 selects the root node 35R as a classification target (step S105).

事前ドメイン決定部１４は、ステップＳ１０６を実行して、ルートノード３５Ｒが事前ドメインとしての条件を満たしているか否かを判断する。具体的には、事前ドメイン決定部１４は、ルートノード３５Ｒに属する転移候補特徴データ４２の数を取得する。事前ドメイン決定部１４は、取得した転移候補特徴データ４２の数が予め設定された分類継続基準値より大きいか否かを判断する（ステップＳ１０６）。分類継続基準値は、例えば、９２７０に設定される。 The prior domain determination unit 14 executes step S106 to determine whether or not the root node 35R satisfies the condition as the prior domain. Specifically, the advance domain determination unit 14 acquires the number of transfer candidate feature data 42 belonging to the root node 35R. The pre-domain determining unit 14 determines whether the number of acquired transfer candidate feature data 42 is larger than a preset classification continuation reference value (step S106). The classification continuation reference value is set to 9270, for example.

ルートノード３５Ｒに属する転移候補特徴データ４２の数（３００００）は、分類継続基準値（９２７０）より大きい（ステップＳ１０６においてＹｅｓ）。この場合、ルートノード３５Ｒに属する転移候補特徴データ４２の数が多すぎるため、ルートノード３５Ｒを事前ドメイン４５として用いることができない。 The number (30000) of transition candidate feature data 42 belonging to the root node 35R is larger than the classification continuation reference value (9270) (Yes in step S106). In this case, since the number of transfer candidate feature data 42 belonging to the root node 35R is too large, the root node 35R can not be used as the prior domain 45.

上述のように、１つの事前ドメインが全ての転移候補特徴データ４２を含む場合、転移学習を導入した機械学習により生成される転移識別データ８０の精度が低下する。ルートノード３５Ｒは、分類継続基準値よりも多い転移候補特徴データ４２を含むため、上記１つの事前ドメインと同様に、目標ドメイン５０Ｂの領域から大きく離れた転移候補特徴データ４２を数多く含む。この場合、事前ドメイン決定部１４は、ルートノード３５Ｒに属する転移候補特徴データ４２を分類する条件の１つが満たされていると判断する。 As described above, when one prior domain includes all the transition candidate feature data 42, the accuracy of the transition identification data 80 generated by machine learning in which transition learning is introduced is reduced. Since the root node 35R includes transfer candidate feature data 42 more than the classification continuation reference value, it includes many transfer candidate feature data 42 largely separated from the area of the target domain 50B, as in the case of the one prior domain. In this case, the prior domain determination unit 14 determines that one of the conditions for classifying the transfer candidate feature data 42 belonging to the root node 35R is satisfied.

分類継続基準値は、特徴抽出部１１により抽出される特徴量の次元数より大きい。例えば、本実施の形態では、分類継続基準値は、転移候補特徴データ４２の次元数（３２４０）の３倍である９７２０に設定される。 The classification continuation reference value is larger than the number of dimensions of the feature quantity extracted by the feature extraction unit 11. For example, in the present embodiment, the classification continuation reference value is set to 9720, which is three times the number of dimensions (3240) of the transition candidate feature data.

次に、クラスタリング装置１０は、ステップＳ１０７及びＳ１０８を実行して、ルートノード３５Ｒの共分散に基づいて、ルートノード３５Ｒに属する転移候補特徴データ４２を分類する条件が満たされているか否かを判断する。 Next, the clustering apparatus 10 executes steps S107 and S108 to determine whether the condition for classifying the transition candidate feature data 42 belonging to the root node 35R is satisfied based on the covariance of the root node 35R. Do.

事前ドメイン決定部１４は、分類対象のノード（ルートノード３５Ｒ）の共分散４４（図３参照）の計算を分類部１２に指示する。分類部１２は、事前ドメイン決定部１４の指示に応じて、分類対象のノード（ルートノード３５Ｒ）に属する転移候補特徴データ４２を分散計算部１３に出力する。分散計算部１３は、分類部１２から出力された転移候補特徴データ４２を用いて、分類対象のノードに属する転移候補特徴データ４２の特徴量の共分散１３Ａを計算する。分散計算部１３は、計算した共分散１３Ａを事前ドメイン決定部１４に出力する。 The prior domain determination unit 14 instructs the classification unit 12 to calculate the covariance 44 (see FIG. 3) of the node (root node 35R) to be classified. The classification unit 12 outputs the transition candidate feature data 42 belonging to the classification target node (the root node 35R) to the distribution calculation unit 13 according to the instruction of the prior domain determination unit 14. The variance calculation unit 13 uses the transition candidate feature data 42 output from the classification unit 12 to calculate the covariance 13A of the feature amounts of the transition candidate feature data 42 belonging to the node to be classified. The variance calculating unit 13 outputs the calculated covariance 13A to the pre-domain determining unit 14.

事前ドメイン決定部１４は、分散計算部１３により計算された共分散１３Ａ（ルートノード３５Ｒの共分散）が予め設定された分散基準値よりも大きいか否かを判断する（ステップＳ１０８）。共分散１３Ａは、分散基準値よりも大きいと仮定する（ステップＳ１０８においてＹｅｓ）。 The prior domain determination unit 14 determines whether the covariance 13A (covariance of the root node 35R) calculated by the dispersion calculation unit 13 is larger than a predetermined dispersion reference value (step S108). It is assumed that the covariance 13A is larger than the dispersion reference value (Yes in step S108).

上述のように、ルートノード３５Ｒは全ての転移候補特徴データ４２を含んでおり、全ての転移候補特徴データ４２のばらつきは非常に大きい。この場合、共分散４４は非常に大きいため、事前ドメイン決定部１４は、ルートノード３５Ｒに属する転移候補特徴データ４２をさらに分類することができると判断する。事前ドメイン決定部１４は、ルートノード３５Ｒに属する転移候補特徴データ４２の分類を分類部１２に指示する。 As described above, the root node 35R includes all the transition candidate feature data 42, and the variation of all the transition candidate feature data 42 is very large. In this case, since the covariance 44 is very large, the prior domain determination unit 14 determines that the transfer candidate feature data 42 belonging to the root node 35R can be further classified. The prior domain determination unit 14 instructs the classification unit 12 to classify the transfer candidate feature data 42 belonging to the root node 35R.

［３．３．転移候補特徴データ４２の分類］
分類部１２は、事前ドメイン決定部１４の指示に応じて、ルートノード３５Ｒに属する転移候補特徴データ４２を分類するために、ルートノード３５Ｒの子ノードとして、ノード３５Ａ及び３５Ｂを生成する（ステップＳ１０９）。 [3.3. Classification of transfer candidate feature data 42]
The classification unit 12 generates nodes 35A and 35B as child nodes of the root node 35R in order to classify the transition candidate feature data 42 belonging to the root node 35R according to an instruction of the prior domain determination unit 14 (step S109). ).

分類部１２は、ルートノード３５Ｒに属する転移候補特徴データ４２を、ステップＳ１０９で生成したノード３５Ａ及び３５Ｂのいずれかに分類する（ステップＳ１１０）。具体的には、下記式（１）に示す目的関数Ｉに基づいて、転移候補特徴データ４２の分類先のノードが決定される。 The classification unit 12 classifies the transition candidate feature data 42 belonging to the root node 35R into one of the nodes 35A and 35B generated in step S109 (step S110). Specifically, the classification destination node of the transfer candidate feature data 42 is determined based on the objective function I shown in the following equation (1).

式（１）において、Ｓは、親ノード（ルートノード３５Ｒ）である。Ｓ^Ｌは、２つの子ノードのうち左側のノード（ノード３５Ａ）であり、Ｓ^Ｒは、２つの子ノードのうち右側のノード（ノード３５Ｂ）である。Λ（Ｓ）は、親ノードの共分散であり、Λ（Ｓ^Ｌ）は、左側の子ノードの共分散であり、Λ（Ｓ^Ｒ）は、右側の子ノードの共分散である。 In Formula (1), S is a parent node (root node 35R). S ^L is a left node of the two child nodes (node 35A), ^{S R} is the right node of the two child nodes (node 35B). Λ (S) is the covariance of the parent node, Λ (S ^L ) is the covariance of the left child node, and Λ (S ^R ) is the covariance of the right child node.

分類部１２は、式（１）に示す目的関数Ｉを計算するために、ルートノード３５Ｒに属する転移候補特徴データ４２を暫定的に分類する。具体的には、分類部１２は、以下のようにして、転移候補特徴データ４２の暫定的な分岐条件を設定する。 The classification unit 12 provisionally classifies the transition candidate feature data 42 belonging to the root node 35R in order to calculate the objective function I shown in Equation (1). Specifically, the classification unit 12 sets a provisional branch condition of the transition candidate feature data 42 as follows.

転移候補特徴データ４２の次元数は、３２４０である。つまり、転移候補特徴データ４２は、３２４０個の特徴量を有する。分類部１２は、３２４０個の特徴量のうち、ｋ番目（０≦ｋ≦３２３９）の特徴量をランダムに選択し、ｋ番目の特徴量のしきい値をランダムに設定する。これにより、暫定的な分岐条件が設定される。 The number of dimensions of the transfer candidate feature data 42 is 3240. That is, the transfer candidate feature data 42 has 3240 feature quantities. The classification unit 12 randomly selects the k-th (0 ≦ k ≦ 3239) feature amount from the 3240 feature amounts, and randomly sets the threshold value of the k-th feature amount. Thus, a provisional branch condition is set.

分類部１２は、設定した分岐条件に基づいて、ルートノード３５Ｒに属する転移候補特徴データ４２を、ノード３５Ａ又は３５Ｂに暫定的に分類する。分散計算部１３は、ノード３５Ａに分類された転移候補特徴データ４２の共分散と、ノード３５Ｂに暫定的に分類された転移候補特徴データ４２の共分散とを計算する。ルートノード３５Ｒの共分散は、ステップＳ１０５において既に計算されている。分類部１２は、これら３つの共分散を用いて、ルートノード３５Ｒの目的関数Ｉを計算する。 The classification unit 12 provisionally classifies the transition candidate feature data 42 belonging to the root node 35R into the node 35A or 35B based on the set branching condition. The variance calculation unit 13 calculates the covariance of the transition candidate feature data 42 classified into the node 35A and the covariance of the transition candidate feature data 42 provisionally classified into the node 35B. The covariance of the root node 35R has already been calculated in step S105. The classification unit 12 calculates an objective function I of the root node 35R using these three covariances.

分類部１２は、ルートノード３５Ｒにおいて複数の分岐条件を設定する。分類部１２は、各分岐条件に対応する目的関数Ｉを計算するために、各分岐条件に基づいて転移候補特徴データ４２を暫定的に分類する。暫定的に分類された転移候補特徴データ４２に基づいて、各分岐条件における目的関数Ｉを計算する。分類部１２は、計算した複数の目的関数Ｉの中で最大の目的関数Ｉを特定する。分類部１２は、最大の目的関数Ｉに対応する分岐条件で、ルートノード３５Ｒに属する転移候補特徴データ４２を分類することを決定する。これにより、ルートノード３５Ｒに属する転移候補特徴データ４２は、ノード３５Ａ及び３５Ｂのいずれかに分類される。 The classification unit 12 sets a plurality of branch conditions in the root node 35R. The classification unit 12 provisionally classifies the transition candidate feature data 42 based on each branch condition in order to calculate an objective function I corresponding to each branch condition. An objective function I in each branch condition is calculated based on the provisionally classified transition candidate feature data 42. The classification unit 12 specifies the largest objective function I among the calculated plurality of objective functions I. The classification unit 12 determines to classify the transition candidate feature data 42 belonging to the root node 35R under the branch condition corresponding to the largest objective function I. Thereby, the transition candidate feature data 42 belonging to the root node 35R is classified into one of the nodes 35A and 35B.

図１０は、ルートノード３５Ｒに属する転移候補特徴データ４２が分類された後における分類木３５を示す図である。なお、転移候補特徴データ４２のノード３５Ａ及び３５Ｂへの分類が終了した時点では、ノード３５Ｂの子ノード（ノード３５Ｃ及び３５Ｄ）は、生成されていない。 FIG. 10 is a diagram showing the classification tree 35 after the transfer candidate feature data 42 belonging to the root node 35R is classified. At the time when classification of the transition candidate feature data 42 into the nodes 35A and 35B is finished, child nodes (nodes 35C and 35D) of the node 35B are not generated.

ルートノード３５Ｒに属する３００００個の転移候補特徴データ４２を分類した結果、７０００個の転移候補特徴データ４２が、ノード３５Ａに分類される。２３０００個の転移候補特徴データ４２が、ノード３５Ｂに分類される。これにより、ルートノード３５Ｒに属する転移候補特徴データ４２を２つの子ノードに分類するステップＳ１１０が終了する。 As a result of classifying 30,000 transition candidate feature data 42 belonging to the root node 35R, 7000 transition candidate feature data 42 are classified into the node 35A. The 23,000 transfer candidate feature data 42 are classified into the node 35B. Thus, step S110 of classifying transition candidate feature data 42 belonging to the root node 35R into two child nodes ends.

［３．４．ノード３５Ａにおける判断］
ルートノード３５Ｒに属する転移候補特徴データ４２の分類が終了した後に、事前ドメイン決定部１４は、分類対象として全てのノードが選択されたか否かを判断する（ステップＳ１０４）。事前ドメイン決定部１４は、選択されていないノード３５Ａ及び３５Ｂが存在するため（ステップＳ１０４において、Ｎｏ）、次の判断対象となるノードを前順で選択する（ステップＳ１０５）。具体的には、分類部１２は、ノード３５Ａを選択する。 3.4. Decision at node 35A]
After the classification of the transition candidate feature data 42 belonging to the root node 35R is completed, the prior domain determination unit 14 determines whether all the nodes have been selected as the classification target (step S104). Since there are nodes 35A and 35B which have not been selected (No in step S104), the pre-domain determining unit 14 selects nodes to be the next determination target in the order of precedence (step S105). Specifically, the classification unit 12 selects the node 35A.

図１０に示すように、ノード３５Ａに属する転移候補特徴データ４２の数は、７０００である。ノード３５Ａに属する転移候補特徴データ４２の数が、分類継続基準値（９２７０）以下であるため（ステップＳ１０６においてＮｏ）、事前ドメイン決定部１４は、ノード３５Ａを事前ドメイン４５に決定する（ステップＳ１１１）。つまり、事前ドメイン決定部１４は、ノード３５Ａに属する転移候補特徴データ４２をさらに分類しないことを決定し、ノード３５Ａをリーフノードに設定する。 As shown in FIG. 10, the number of transfer candidate feature data 42 belonging to the node 35A is 7000. Since the number of transfer candidate feature data 42 belonging to the node 35A is equal to or less than the classification continuation reference value (9270) (No in step S106), the prior domain determination unit 14 determines the node 35A as the prior domain 45 (step S111). ). That is, the advance domain determination unit 14 determines not to further classify the transition candidate feature data 42 belonging to the node 35A, and sets the node 35A as a leaf node.

［３．５．ノード３５Ｂにおける判断］
次に、事前ドメイン決定部１４は、ノード３５Ｂを判断対象として選択する（ステップＳ１０５）。ノード３５Ｂに属する転移候補特徴データ４２の数は、２３０００であり、分類継続基準値（９２７０）よりも大きい（ステップＳ１０６においてＹｅｓ）。また、ノード３５Ｂの共分散は、分散基準値よりも大きいと仮定する（ステップＳ１０８においてＹｅｓ）。この場合、事前ドメイン決定部１４は、ノード３５Ｂに属する転移候補特徴データ４２をさらに分類することを決定する。 [3.5. Decision at node 35 B]
Next, the prior domain determination unit 14 selects the node 35B as a determination target (step S105). The number of transfer candidate feature data 42 belonging to the node 35B is 23000, which is larger than the classification continuation reference value (9270) (Yes in step S106). Further, it is assumed that the covariance of the node 35B is larger than the dispersion reference value (Yes in step S108). In this case, the prior domain determination unit 14 determines to further classify the transfer candidate feature data 42 belonging to the node 35B.

分類部１２は、ノード３５Ｂに対する事前ドメイン決定部１４の決定に応じて、ノード３５Ｂの子ノード（ノード３５Ｃ及び３５Ｄ）を生成する（ステップＳ１０９）。分類部１２は、ルートノード３５Ｒにおける転移候補特徴データ４２の分類と同様に、ノード３５Ｂに属する転移候補特徴データ４２を、ノード３５Ｃ及び３５Ｄのいずれかに分類する（ステップＳ１１０）。 The classification unit 12 generates child nodes (nodes 35C and 35D) of the node 35B according to the determination of the prior domain determination unit 14 with respect to the node 35B (step S109). The classification unit 12 classifies the transition candidate feature data 42 belonging to the node 35B into any one of the nodes 35C and 35D, similarly to the classification of the transition candidate feature data 42 in the root node 35R (step S110).

図１１は、事前ドメイン生成処理（ステップＳ１）が終了した後の分類木３５を示す図である。図１１に示すように、ノード３５Ｂに属する転移候補特徴データ４２を、ノード３５Ｃ及び３５Ｄに分類した結果、１５０００個の転移候補特徴データ４２が、ノード３５Ｃに分類され、８０００個の転移候補特徴データ４２が、ノード３５Ｄに分類される。 FIG. 11 is a diagram showing the classification tree 35 after the prior domain generation processing (step S1) is finished. As shown in FIG. 11, as a result of classifying transition candidate feature data 42 belonging to node 35B into nodes 35C and 35D, 15,000 transition candidate feature data 42 are classified into node 35C, and 8,000 transition candidate feature data 42 are classified as node 35D.

ノード３５Ｃに属する転移候補特徴データ４２の数は、分類継続基準値（９２７０）よりも大きい（ステップＳ１０６においてＹｅｓ）。また、ノード３５Ｃの共分散が分散基準値よりも大きいと仮定する（ステップＳ１０８においてＹｅｓ）。この場合、事前ドメイン決定部１４は、ノード３５Ｃに属する転移候補特徴データ４２をさらに分類することを決定する。ノード３５Ｃに属する転移候補特徴データ４２の分類については、後述する。 The number of transfer candidate feature data 42 belonging to the node 35C is larger than the classification continuation reference value (9270) (Yes in step S106). Further, it is assumed that the covariance of the node 35C is larger than the dispersion reference value (Yes in step S108). In this case, the prior domain determination unit 14 determines to further classify the transfer candidate feature data 42 belonging to the node 35C. The classification of the transfer candidate feature data 42 belonging to the node 35C will be described later.

一方、ノード３５Ｄに属する転移候補特徴データ４２の数が分類継続基準値以下であるため（ステップＳ１０６においてＮｏ）、事前ドメイン決定部１４は、ノード３５Ｄを事前ドメインに決定する。 On the other hand, since the number of transfer candidate feature data 42 belonging to the node 35D is equal to or less than the classification continuation reference value (No in step S106), the prior domain determination unit 14 determines the node 35D to be a prior domain.

［３．６．転移候補特徴データ４２の分類の終了］
分類部１２は、ノード３５Ｃの子ノードとしてノード３５Ｅ及び３５Ｆを生成し（ステップＳ１０９）、ノード３５Ｃに属する転移候補特徴データ４２をノード３５Ｅ及び３５Ｆに分類する（ステップＳ１１０）。 [3.6. End of classification of transfer candidate feature data 42]
The classification unit 12 generates nodes 35E and 35F as child nodes of the node 35C (step S109), and classifies the transition candidate feature data 42 belonging to the node 35C into the nodes 35E and 35F (step S110).

ノード３５Ｅに属する転移候補特徴データ４２の数は、５００であり、分類継続基準値以下である（ステップＳ１０６においてＮｏ）。このため、事前ドメイン決定部１４は、ノード３５Ｅを事前ドメインに決定する（ステップＳ１１１）。 The number of transfer candidate feature data 42 belonging to the node 35E is 500, which is equal to or less than the classification continuation reference value (No in step S106). Therefore, the prior domain determination unit 14 determines the node 35E as the prior domain (step S111).

ノード３５Ｆに属する転移候補特徴データ４２の数は、１４５００であり、分類継続基準値よりも大きい（ステップＳ１０６においてＹｅｓ）。一方、ノード３５Ｆの共分散が、分散基準値よりも小さいと仮定する（ステップＳ１０８においてＮｏ）。この場合、事前ドメイン決定部１４は、ノード３５Ｆに属する転移候補特徴データ４２の特徴量の分布のばらつきが非常に小さいと判断する。例えば、ノード３５Ｆに属する転移候補特徴データ４２の大半が、同一の画像から生成される場合が考えられる。この場合、事前ドメイン決定部１４は、ノード３５Ｆに含まれる転移候補特徴データ４２をさらに分類することができないと判断し、ノード３５Ｆを事前ドメインに決定する（ステップＳ１１１）。これにより、分類木３５を構成する全てのノードを判断対象として選択したため（ステップＳ１０４においてＹｅｓ）、クラスタリング装置１０は、ステップＳ１１２に進む。 The number of transfer candidate feature data 42 belonging to the node 35F is 14500, which is larger than the classification continuation reference value (Yes in step S106). On the other hand, it is assumed that the covariance of the node 35F is smaller than the dispersion reference value (No in step S108). In this case, the prior domain determination unit 14 determines that the distribution of the feature amounts of the transfer candidate feature data 42 belonging to the node 35F is very small. For example, it can be considered that most of the transition candidate feature data 42 belonging to the node 35F is generated from the same image. In this case, the prior domain determination unit 14 determines that the transfer candidate feature data 42 included in the node 35F can not be further classified, and determines the node 35F as the prior domain (step S111). As a result, since all nodes constituting the classification tree 35 have been selected as judgment targets (Yes in step S104), the clustering device 10 proceeds to step S112.

［３．７．事前ドメインの除外］
事前ドメイン決定部１４は、事前ドメインに決定された各ノードが有する転移候補特徴データ４２の数を確認する。事前ドメイン決定部１４は、予め設定された破棄基準値以下の数の転移候補特徴データ４２を有するノードがある場合、このノードを事前ドメインから除外する（ステップＳ１１２）。破棄基準値は、例えば、転移候補特徴データ４２の次元数（３２４０）に設定される。具体的には、事前ドメインに決定されたノード３５Ｅは、転移候補特徴データ４２の数が５００であるため、事前ドメインから除外される。 [3.7. Advance Domain Exclusion]
The prior domain determination unit 14 confirms the number of transfer candidate feature data 42 possessed by each node determined in the prior domain. If there is a node having transfer candidate feature data 42 whose number is equal to or less than the preset discard reference value, the pre-domain determining unit 14 excludes this node from the pre-domain (step S112). The discarding reference value is set to, for example, the number of dimensions (3240) of the transition candidate feature data 42. Specifically, the node 35E determined to be the pre-domain is excluded from the pre-domain because the number of transfer candidate feature data 42 is 500.

上述のように、学習に用いられる数のデータが次元数よりも少ない場合、生成される識別装置の精度が低下する可能性がある。 As mentioned above, if the number of data used for learning is less than the number of dimensions, the accuracy of the generated identification device may be reduced.

分類継続基準値は、特徴抽出部１１により抽出される特徴量の次元数より大きい。機械学習において、学習に用いられるデータの数が学習に用いられるデータの次元数よりも少ない場合、学習に用いられるデータの特徴の学習結果が過大に評価され、転移識別データ８０の精度が低下する。このため、本実施の形態では、破棄基準値が、転移候補特徴データ４２の次元数である３２４０に設定される。これにより、事前ドメイン４５に属する転移候補特徴データ４２の数が、転移候補特徴データ４２の次元数より少なくなることを防ぐことができる。 The classification continuation reference value is larger than the number of dimensions of the feature quantity extracted by the feature extraction unit 11. In machine learning, when the number of data used for learning is smaller than the number of dimensions of data used for learning, the learning result of the feature of the data used for learning is overestimated, and the accuracy of the transition identification data 80 decreases. . Therefore, in the present embodiment, the discarding reference value is set to 3240, which is the number of dimensions of the transition candidate feature data 42. As a result, it is possible to prevent the number of transfer candidate feature data 42 belonging to the prior domain 45 from becoming less than the number of dimensions of transfer candidate feature data 42.

また、ある事前ドメインに含まれる転移候補特徴データ４２の数が、破棄基準値よりも少ない場合、この事前ドメインに含まれる転移候補特徴データ４２は、検出対象の特徴を有していない可能性が高い。 In addition, when the number of transfer candidate feature data 42 included in a certain prior domain is smaller than the discarding reference value, there is a possibility that the transfer candidate feature data 42 included in this prior domain does not have the feature to be detected. high.

例えば、インターネット上で人物の画像を収集する際に、人物以外の物が撮影された画像が転移候補データ４１として誤って取得される場合がある。誤って収集された転移候補データ４１から生成された転移候補特徴データ４２は、人物の特徴を有する転移候補特徴データ４２と異なる特徴を有し、転移学習に有効でない。また、検索条件が人物を撮影した画像であるため、人物以外の物が撮影された画像が転移候補データ４１の集合において占める割合は、非常に小さいと想定される。 For example, when collecting an image of a person on the Internet, an image in which an object other than the person is photographed may be erroneously acquired as the transfer candidate data 41. The transition candidate feature data 42 generated from the erroneously collected transition candidate data 41 has features different from the transition candidate feature data 42 having human features, and is not effective for transition learning. In addition, since the search condition is an image obtained by shooting a person, it is assumed that the ratio of an image obtained by shooting an object other than a person in the set of transfer candidate data 41 is very small.

従って、あるノードに属する転移候補特徴データ４２の数が破棄基準値よりも少ない場合、このノードは、誤って収集された転移候補データ４１から生成された転移候補特徴データ４２により構成されると考えられる。事前ドメイン決定部１４は、破棄基準値以下の数の転移候補特徴データ４２を有するノードを、事前ドメインから除外する。 Therefore, when the number of transfer candidate feature data 42 belonging to a certain node is smaller than the discarding reference value, this node is considered to be configured by transfer candidate feature data 42 generated from transfer candidate data 41 collected erroneously. Be The prior domain determination unit 14 excludes, from the prior domain, nodes having transfer candidate feature data 42 in number equal to or less than the discarding reference value.

この結果、図１１に示す分類木３５において、ノード３５Ａ、３５Ｄ及び３５Ｆが事前ドメイン４５に決定される。クラスタリング装置１０は、決定された３つの事前ドメイン４５を事前ドメイン評価装置２０及び選択学習装置３０に出力する。 As a result, in the classification tree 35 shown in FIG. 11, nodes 35 A, 35 D and 35 F are determined as the pre-domain 45. The clustering device 10 outputs the determined three prior domains 45 to the prior domain evaluation device 20 and the selection learning device 30.

以上説明したように、クラスタリング装置１０は、転移候補データ４１の各々から特徴を抽出して複数の転移候補特徴データ４２を生成し、分類木３５を作成する過程で、複数の転移候補特徴データ４２を分類木３５のノードに分類する。クラスタリング装置１０は、ノードに属する転移候補特徴データ４２の数が分類継続基準値以下であるか、ノードに属する転移候補特徴データ４２の共分散が分散基準値以下である場合、このノードを事前ドメインに決定する。これにより、互いに類似する、又は共通する特徴を有する転移候補特徴データ４２により構成される事前ドメインを生成することができる。 As described above, the clustering device 10 extracts a feature from each of the transfer candidate data 41 to generate a plurality of transfer candidate feature data 42, and generates a plurality of transfer candidate feature data 42 in the process of creating the classification tree 35. Are classified into the nodes of the classification tree 35. If the number of transfer candidate feature data 42 belonging to a node is less than or equal to the classification continuation reference value or the covariance of transfer candidate feature data 42 belonging to the node is less than or equal to the distribution reference value Decide on. Thereby, it is possible to generate a pre-domain composed of transfer candidate feature data 42 having similar or common features to each other.

［４．事前ドメイン評価処理（ステップＳ２）］
図１２は、図５に示す事前ドメイン評価処理（ステップＳ２）のフローチャートである。事前ドメイン評価装置２０が、ステップＳ２に示す処理を開始する際に、試行転移識別部６３が試行転移学習部２３内に生成されておらず、比較識別部６４が比較学習部２４内に生成されていない。 [4. Prior domain evaluation process (step S2)]
FIG. 12 is a flowchart of the prior domain evaluation process (step S2) shown in FIG. When the prior domain evaluation device 20 starts the process shown in step S2, the trial transition identification unit 63 is not generated in the trial transition learning unit 23, and the comparison identification unit 64 is generated in the comparison learning unit 24. Not.

［４．１．目標ドメイン５０Ｂの生成］
事前ドメイン評価装置２０は、クラスタリング装置１０により生成された事前ドメイン４５を取得する。具体的には、事前ドメイン評価装置２０は、図１１に示す分類木３５を作成する過程で生成された３つの事前ドメイン４５（図１１に示すノード３５Ａ、３５Ｄ、３５Ｆ）を取得する。事前ドメイン評価装置２０は、取得した事前ドメイン４５を一時記憶部２１に記憶する（ステップＳ２０１）。 [4.1. Generation of target domain 50B]
The prior domain evaluation device 20 acquires the prior domain 45 generated by the clustering device 10. Specifically, the preliminary domain evaluation device 20 acquires three preliminary domains 45 (nodes 35A, 35D, 35F shown in FIG. 11) generated in the process of creating the classification tree 35 shown in FIG. The prior domain evaluation device 20 stores the acquired prior domain 45 in the temporary storage unit 21 (step S201).

以下、ノード３５Ａ、３５Ｄ及び３５Ｆを、それぞれ「事前ドメイン３５Ａ」、「事前ドメイン３５Ｄ」、「事前ドメイン３５Ｆ」と記載する。 The nodes 35A, 35D, and 35F are hereinafter described as "pre-domain 35A", "pre-domain 35D", and "pre-domain 35F", respectively.

特徴抽出部２２（図３参照）が、記憶装置５００に記憶された目標ドメイン５０Ａを取得する。特徴抽出部２２は、取得した目標ドメイン５０Ａに含まれる学習用データ５１の各々から特徴量を抽出することにより、学習用データ５１の各々に対応する複数の学習用特徴データ５２を生成する（ステップＳ２０２）。これにより、複数の学習用特徴データ５２により構成される目標ドメイン５０Ｂが生成される。特徴抽出部２２は、生成した目標ドメイン５０Ｂを試行転移学習部２３に出力する。 The feature extraction unit 22 (see FIG. 3) acquires the target domain 50A stored in the storage device 500. The feature extraction unit 22 generates a plurality of learning feature data 52 corresponding to each of the learning data 51 by extracting feature amounts from each of the learning data 51 included in the acquired target domain 50A (Step S202). As a result, a target domain 50B configured of a plurality of learning feature data 52 is generated. The feature extraction unit 22 outputs the generated target domain 50B to the trial transfer learning unit 23.

特徴抽出部２２は、特徴抽出部１１（図２参照）が転移候補データ４１から転移候補特徴データ４２を生成する時と同じ条件で、特徴量の抽出を行う。従って、学習用特徴データ５２の次元数は、転移候補特徴データ４２の次元数と同じ３２４０である。この理由については後述する。 The feature extraction unit 22 extracts feature amounts under the same conditions as when the feature extraction unit 11 (see FIG. 2) generates the transition candidate feature data 42 from the transition candidate data 41. Therefore, the number of dimensions of the learning feature data 52 is 3240, which is the same as the number of dimensions of the transfer candidate feature data 42. The reason will be described later.

事前ドメイン評価装置２０は、一時記憶部２１に記憶された事前ドメイン４５のうち、転移学習に有効か否かを評価する対象となる事前ドメインを１つ選択する（ステップＳ２０３）。具体的には、一時記憶部２１に記憶されている事前ドメイン３５Ａ、３５Ｄ及び３５Ｆのうち、最初に事前ドメイン３５Ａが選択される。 The preliminary domain evaluation device 20 selects one of the preliminary domains 45 stored in the temporary storage unit 21 as a target to be evaluated for whether it is effective for transfer learning (step S203). Specifically, the pre-domain 35A is selected first among the pre-domains 35A, 35D, and 35F stored in the temporary storage unit 21.

［４．２．比較学習及び試行転移学習］
比較学習部２４は、ステップＳ２０３において選択された事前ドメイン３５Ａを入力する。比較学習部２４は、入力した事前ドメイン３５Ａを学習する（ステップＳ２０４）。比較学習部２４の学習アルゴリズムは、転移学習が導入されていないランダムフォレストである。比較学習部２４は、ステップＳ２０４を実行することにより、事前ドメイン３５Ａの学習結果を反映した比較識別部６４を生成する。比較識別部６４は、複数の決定木の構造を示すデータ群である。 [4.2. Comparative Learning and Trial Transfer Learning]
The comparison learning unit 24 inputs the prior domain 35A selected in step S203. The comparison learning unit 24 learns the input pre-domain 35A (step S204). The learning algorithm of the comparison learning unit 24 is a random forest in which transfer learning is not introduced. The comparison learning unit 24 generates the comparison identification unit 64 reflecting the learning result of the pre-domain 35A by executing step S204. The comparison identification unit 64 is a data group indicating the structure of a plurality of decision trees.

試行転移学習部２３は、特徴抽出部２２から目標ドメイン５０Ｂを取得し、一時記憶部２１から事前ドメイン３５Ａを取得する。試行転移学習部２３は、入力した目標ドメイン５０Ｂ及び事前ドメイン３５Ａを用いて、転移学習を導入した機械学習を行う（ステップＳ２０５）。試行転移学習部２３の学習アルゴリズムは、転移学習を導入したランダムフォレストである。試行転移学習部２３は、ステップＳ２０５を実行することにより、目標ドメイン５０Ａ及び事前ドメイン３５Ａの学習結果を反映した試行転移識別部６３を生成する。試行転移識別部６３は、複数の決定木の構成を示すデータ群である。試行転移学習部２３において用いられる学習アルゴリズム及びドメインが比較学習部２４のものと異なるため、試行転移識別部６３の構造は、比較識別部６４の構造と異なる。 The trial transfer learning unit 23 acquires the target domain 50B from the feature extraction unit 22, and acquires the prior domain 35A from the temporary storage unit 21. The trial transfer learning unit 23 performs machine learning in which transfer learning is introduced using the input target domain 50B and the pre-domain 35A (step S205). The learning algorithm of the trial transfer learning unit 23 is a random forest in which transfer learning is introduced. The trial transfer learning unit 23 generates the trial transfer identification unit 63 reflecting the learning result of the target domain 50A and the pre-domain 35A by executing step S205. The trial transition identification unit 63 is a data group indicating the configuration of a plurality of decision trees. Since the learning algorithm and domain used in the trial transfer learning unit 23 are different from those of the comparison learning unit 24, the structure of the trial transfer identifying unit 63 is different from the structure of the comparison identifying unit 64.

［４．３．事前ドメインの評価（ステップＳ２０６）］
評価部２５は、試行転移学習部２３により生成された試行転移識別部６３と比較学習部２４により生成された比較識別部６４とを用いて、評価対象の事前ドメイン３５Ａが転移学習に有効であるか否かを判断する（ステップＳ２０６）。 [4.3. Evaluation of pre-domain (step S206)]
The evaluation unit 25 uses the trial transfer identification unit 63 generated by the trial transfer learning unit 23 and the comparison identification unit 64 generated by the comparison learning unit 24 so that the prior domain 35A to be evaluated is effective for transfer learning. It is determined whether or not (step S206).

評価部２５は、転移学習の有効性を判断するために、競合値２５１Ａ及び信頼度２５２Ａの２種類のパラメータを計算する。評価部２５は、信頼度２５２Ａを計算する場合、サンプルグループに含まれるデータの試行転移識別部６３による識別結果を利用する。ここで、サンプルグループとは、目標ドメイン５０Ｂに含まれる学習用特徴データ５２と、評価対象である事前ドメイン３５Ａに含まれる転移候補特徴データ４２とを合わせた集合である。以下、サンプルグループに含まれるデータを、「サンプルデータ」と記載する。評価部２５は、競合値２５１Ａを計算する場合、試行転移識別部６３による識別結果に加えて、比較識別部６４による識別結果を利用する。 The evaluation unit 25 calculates two types of parameters, the competition value 251A and the reliability 252A, in order to determine the effectiveness of the transition learning. The evaluation unit 25 uses the identification result of the trial transition identification unit 63 of the data included in the sample group when calculating the reliability 252A. Here, the sample group is a set obtained by combining the learning feature data 52 included in the target domain 50B and the transfer candidate feature data 42 included in the prior domain 35A to be evaluated. Hereinafter, data included in the sample group will be referred to as "sample data". When calculating the competition value 251A, the evaluation unit 25 uses the identification result by the comparison identification unit 64 in addition to the identification result by the trial transition identification unit 63.

［４．３．１．競合値２５１Ａの計算］
競合値計算部２５１は、試行転移識別部６３により生成される各画像のラベルと、比較識別部６４により生成される各画像のラベルとの比較結果に基づいて、競合値２５１Ａを計算する。 [4.3.1. Calculation of the conflict value 251A]
The competition value calculation unit 251 calculates the competition value 251A based on the comparison result of the label of each image generated by the trial transition identification unit 63 and the label of each image generated by the comparison identification unit 64.

試行転移識別部６３は、サンプルグループに含まれるサンプルデータのうち、いずれか１つを入力する。試行転移識別部６３は、サンプルデータに対して人物の識別処理を行い、識別結果を示すラベル７３を生成する。ラベル７３の値は、例えば、０又は１である。ラベル７３が０である場合、ラベル７３は、サンプルデータが人物の特徴を含まないことを示す。ラベル７３が１である場合、ラベル７３は、サンプルデータが人物の特徴を含むことを示す。試行転移識別部６３は、生成したラベル７３を競合値計算部２５１に出力する。 The trial transition identification unit 63 inputs any one of the sample data included in the sample group. The trial transition identification unit 63 performs identification processing of a person on the sample data, and generates a label 73 indicating the identification result. The value of the label 73 is, for example, 0 or 1. When the label 73 is 0, the label 73 indicates that the sample data does not include human features. When the label 73 is 1, the label 73 indicates that the sample data includes the feature of a person. The trial transition identification unit 63 outputs the generated label 73 to the conflict value calculation unit 251.

なお、試行転移識別部６３は、サンプルデータの識別結果として、ラベル７３だけでなく、ラベル７３の確からしさを示す確度８３を計算する。確度８３は、後述する信頼度２５２Ａの計算に用いられる。 The trial transition identifying unit 63 calculates not only the label 73 but also the probability 83 indicating the likelihood of the label 73 as the identification result of the sample data. The accuracy 83 is used to calculate the reliability 252A described later.

比較識別部６４は、試行転移識別部６３に入力されたサンプルデータと同じデータを入力する。比較識別部６４は、サンプルデータに対して人物の識別処理を行い、識別結果を示すラベル７４を生成する。ラベル７４の値は、ラベル７３と同様に、０又は１である。ラベル７４が０である場合、ラベル７４は、サンプルデータが人物の特徴を含まないことを示す。ラベル７４が１である場合、ラベル７４は、サンプルデータが人物の特徴を含むことを示す。比較識別部６４は、生成したラベル７４を競合値計算部２５１に出力する。 The comparison identification unit 64 inputs the same data as the sample data input to the trial transition identification unit 63. The comparison identification unit 64 performs identification processing of a person on the sample data, and generates a label 74 indicating the identification result. The value of the label 74 is 0 or 1 similarly to the label 73. If the label 74 is 0, the label 74 indicates that the sample data does not contain human features. When the label 74 is 1, the label 74 indicates that the sample data contains the feature of a person. The comparison identification unit 64 outputs the generated label 74 to the conflict value calculation unit 251.

競合値計算部２５１は、サンプルデータから生成されるラベル７３及び７４を用いて、競合値２５１Ａを計算する。競合値２５１Ａは、下記式（２）により計算される。 The competition value calculation unit 251 calculates the competition value 251A using the labels 73 and 74 generated from the sample data. The competition value 251A is calculated by the following equation (2).

式（２）において、Ｅ_ｃ１は、競合値２５１Ａを示す。Ｘは、サンプルグループを示す。ｘは、サンプルグループを構成する要素（サンプルデータ）を示す。Ｍ（ｘ）は、要素ｘから生成されたラベル７４を示す。Ｔ（ｘ）は、要素ｘから生成されたラベル７３を示す。［Ｍ（ｘ）≠Ｔ（ｘ）］は、ラベル７４とラベル７３とが一致しなかったサンプルデータの数を示す。｜Ｘ｜は、サンプルグループＸを構成する要素の数である。 In the equation (2), E _c1 represents the contention value 251A. X indicates a sample group. x indicates an element (sample data) constituting a sample group. M (x) indicates the label 74 generated from the element x. T (x) indicates the label 73 generated from the element x. [M (x) ≠ T (x)] indicates the number of sample data for which the label 74 and the label 73 did not match. | X | is the number of elements constituting the sample group X.

式（２）により計算される競合値２５１Ａは、同一のサンプルデータから生成されるラベル７３及びラベル７４が一致しない確率を示す。競合値２５１Ａは、０以上１以下の数値である。競合値２５１Ａが０に近づくほど、競合値２５１Ａは、転移学習における事前ドメイン３５Ａの有効性が高いことを示す。一方、競合値２５１Ａが１に近づくほど。転移学習における事前ドメイン３５Ａの有効性が低いことを示す。 The conflict value 251A calculated by the equation (2) indicates the probability that the labels 73 and 74 generated from the same sample data do not match. The competition value 251A is a numerical value of 0 or more and 1 or less. As the competition value 251A approaches 0, the competition value 251A indicates that the effectiveness of the pre-domain 35A in transfer learning is high. On the other hand, the closer the competition value 251A is to 1 the more. It shows that the effectiveness of the pre-domain 35A in transfer learning is low.

目標ドメイン５０Ｂに含まれる学習用特徴データ５２と、評価対象の事前ドメイン３５Ａに含まれる転移候補特徴データ４２との相違点が多い場合、事前ドメイン３５Ａは、転移学習に有効ではない。この場合、競合値２５１Ａは、１に近づく。以下、その理由を説明する。 If there are many differences between the learning feature data 52 included in the target domain 50B and the transfer candidate feature data 42 included in the pre-domain 35A to be evaluated, the pre-domain 35A is not effective for transfer learning. In this case, the contention value 251A approaches one. The reason will be described below.

上述のように、比較学習部２４は、事前ドメイン３５Ａのみを学習する。このため、事前ドメイン３５Ａの学習結果のみが、比較識別部６４に反映される。 As described above, the comparison learning unit 24 learns only the pre-domain 35A. Therefore, only the learning result of the prior domain 35A is reflected in the comparison identification unit 64.

一方、試行転移識別部６３が、目標ドメイン５０Ａ及び事前ドメイン３５Ａとを用いて転移学習を導入した機械学習を実行している。しかし、目標ドメイン５０Ｂに含まれる学習用特徴データ５２と、評価対象の事前ドメイン３５Ａに含まれる転移候補特徴データ４２との相違点が多い場合、事前ドメイン３５Ａに含まれる転移候補特徴データ４２の学習結果が、学習用特徴データ５２の学習結果に反映されない。つまり、試行転移識別部６３と、比較識別部６４とは、互いに異なるドメインを学習することにより生成されたと考えることができる。この場合、試行転移識別部６３と、比較識別部６４との識別結果が一致しない場合が増加し、競合値２５１Ａが増加する。従って、競合値２５１Ａに基づいて、事前ドメイン３５Ａが転移学習に有効であるか否かを判断することが可能となる。 On the other hand, the trial transfer identification unit 63 executes machine learning in which transfer learning is introduced using the target domain 50A and the pre-domain 35A. However, when there are many differences between the learning feature data 52 included in the target domain 50B and the transfer candidate feature data 42 included in the evaluation target prior domain 35A, the learning of transfer candidate feature data 42 included in the advance domain 35A The result is not reflected in the learning result of the learning feature data 52. That is, it can be considered that the trial transfer identification unit 63 and the comparison identification unit 64 are generated by learning different domains. In this case, the case where the identification results of the trial transition identification unit 63 and the comparison identification unit 64 do not match increases, and the competition value 251A increases. Therefore, based on the competition value 251A, it is possible to determine whether the prior domain 35A is effective for transfer learning.

［４．３．２．信頼度の計算］
信頼度計算部２５２は、試行転移識別部６３により生成される各画像のラベル７３及び確度８３に基づいて、信頼度２５２Ａを計算する。信頼度２５２Ａの計算に当たり、比較識別部６４によるサンプルデータの識別結果は使用されない。 [4.3.2. Calculation of reliability]
The reliability calculation unit 252 calculates the reliability 252A based on the label 73 and the accuracy 83 of each image generated by the trial transition identification unit 63. In the calculation of the reliability 252A, the identification result of the sample data by the comparison identification unit 64 is not used.

試行転移識別部６３は、上述のように、サンプルデータに対する人物の識別結果を示すラベル７３と、ラベル７３の確からしさを示す確度８３を生成する。確度８３は、０以上１以下の値であり、確度８３が１に近づくほど、ラベル７３が誤りである可能性が小さくなる。 As described above, the trial transition identification unit 63 generates the label 73 indicating the identification result of the person with respect to the sample data, and the accuracy 83 indicating the likelihood of the label 73. The certainty 83 is a value of 0 or more and 1 or less, and as the certainty 83 approaches 1, the possibility that the label 73 is erroneous decreases.

信頼度計算部２５２は、試行転移識別部６３から各サンプルデータのラベル７３及び確度８３を入力する。信頼度計算部２５２は、入力した各サンプルデータのラベル７３及び確度８３を用いて、下記式（３）を計算することにより信頼度２５２Ａを計算する。 The reliability calculation unit 252 inputs the label 73 and the accuracy 83 of each sample data from the trial transition identification unit 63. The reliability calculation unit 252 calculates the reliability 252A by calculating the following equation (3) using the label 73 and the accuracy 83 of each input sample data.

上記式（３）において、Ｅ_ｃ２は、信頼度２５２Ａを示す。ｘは、上記式（２）と同様に、サンプルグループＸを構成する要素（サンプルデータ）を示す。｜Ｘ｜は、サンプルグループＸの要素数である。Ｐ_Ｔ（ｘ）は、要素ｘの確度８３を示す。Ｐ_Ｔ（ｘ）は、サンプルデータが試行転移識別部６３を構成する各決定木に入力された場合において、サンプルデータが各決定木において到達したリーフノードに設定されたクラスの確率の平均である。Ｔ（ｘ）は、要素ｘのラベル７３を示す。ｙは、人物の存在を示すラベル（ｙ＝１）である。つまり、信頼度２５２Ａは、ラベル７３がラベルｙと一致する場合に算出された確度８３の合計値を、サンプルグループＸの要素数で除算した値である。信頼度２５２Ａは、０以上１以下の値であり、１に近いほど、転移学習における事前ドメイン３５Ａの有効性が高いことを示す。 In the above equation (3), E _c2 represents the reliability 252A. x shows the element (sample data) which comprises the sample group X similarly to said formula (2). | X | is the number of elements of the sample group X. P _T (x) indicates the accuracy 83 of the element x. P _T (x) is the average of the probability of the class set to the leaf node that the sample data has reached in each decision tree when the sample data is input to each decision tree constituting the trial transition identification unit 63 . T (x) indicates the label 73 of the element x. y is a label (y = 1) indicating the presence of a person. That is, the reliability 252A is a value obtained by dividing the total value of the certainty 83 calculated when the label 73 matches the label y by the number of elements of the sample group X. The reliability 252A is a value of 0 or more and 1 or less, and the closer to 1, the higher the effectiveness of the prior domain 35A in transfer learning.

事前ドメイン３５Ａの転移候補特徴データ４２が、学習用特徴データ５２の特徴量と類似する特徴量を有している場合、試行転移学習部２３は、試行転移学習により、転移候補特徴データ４２の学習結果を学習用特徴データ５２の学習結果に転移させる。試行転移識別部６３には、学習用特徴データ５２と、事前ドメイン３５Ａの転移候補特徴データ４２との学習結果が反映されている。試行転移識別部６３は、試行転移学習に用られたサンプルグループの各データに対して識別処理を行った場合、ラベル７３は１となり、その確度８３も１に近づくと考えられる。従って、学習用特徴データ５２と事前ドメイン３５Ａの転移候補特徴データ４２とが類似している場合（事前ドメイン３５Ａが転移学習において有効である場合）、信頼度２５２Ａは、１に近づく。 When the transfer candidate feature data 42 of the prior domain 35A has a feature amount similar to the feature amount of the learning feature data 52, the trial transfer learning unit 23 learns the transfer candidate feature data 42 by trial transfer learning. The result is transferred to the learning result of the learning feature data 52. The trial transition identification unit 63 reflects the learning results of the learning feature data 52 and the transition candidate feature data 42 of the pre-domain 35A. When the trial transition identification unit 63 performs identification processing on each data of the sample group used for trial transition learning, the label 73 is considered to be 1 and the certainty 83 also approaches 1. Therefore, the reliability 252A approaches 1 when the learning feature data 52 and the transition candidate feature data 42 of the pre-domain 35A are similar (when the pre-domain 35A is effective in transfer learning).

［４．３．３．転移評価部２５３による事前ドメインの評価｝
転移評価部２５３は、競合値２５１Ａ及び信頼度２５２Ａを入力する。転移評価部２５３は、入力した競合値２５１Ａ及び信頼度２５２Ａに基づいて、転移学習における事前ドメイン３５Ａの有効性を評価する。 [4.3.3. Evaluation of prior domain by metastasis evaluation unit 253}
The transition evaluation unit 253 inputs the competition value 251A and the reliability 252A. The transfer evaluation unit 253 evaluates the effectiveness of the pre-domain 35A in transfer learning based on the entered competition value 251A and the reliability 252A.

転移評価部２５３は、下記の式（４）を用いて、総合評価値を計算する。 The transfer evaluation unit 253 calculates the comprehensive evaluation value using the following equation (4).

式（４）において、Ｅは、競合値２５１Ａ及び信頼度２５２Ａから得られる総合評価値である。事前ドメイン３５Ａの転移学習における有効性が低下するにつれて、競合値２５１Ａは増加する。一方、信頼度２５２Ａは、逆に低下する。信頼度２５２Ａの傾向を競合値２５１Ａの傾向に合わせるために、１から信頼度２５２Ａを減算した値を、総合評価値の計算に使用している。 In the equation (4), E is a comprehensive evaluation value obtained from the competition value 251A and the reliability 252A. As the effectiveness in transfer learning of the pre-domain 35A decreases, the competition value 251A increases. On the other hand, the reliability 252A decreases in reverse. In order to adjust the tendency of the reliability 252A to the tendency of the competition value 251A, a value obtained by subtracting the reliability 252A from 1 is used for the calculation of the comprehensive evaluation value.

上記式（４）により計算された総合評価値は、０以上１以下の値であり、転移学習の有効性が高くなるにつれて０に近づく。転移評価部２５３は、計算された総合評価値が予め設定されたしきい値よりも小さい場合、事前ドメイン３５Ａが転移学習において有効であると判断する。 The comprehensive evaluation value calculated by the above equation (4) is a value of 0 or more and 1 or less, and approaches 0 as the effectiveness of transfer learning becomes higher. The transfer evaluation unit 253 determines that the pre-domain 35A is effective in transfer learning, when the calculated comprehensive evaluation value is smaller than a preset threshold value.

［４．４．次の事前ドメインの指定］
事前ドメイン３５Ａの転移学習における有効性の評価（ステップＳ２０６）が終了した後に、事前ドメイン３５Ａの有効性の評価に用いられた試行転移識別部６３及び比較識別部６４が削除される（ステップＳ２０７）。事前ドメイン３５Ａに対応する試行転移識別部６３及び比較識別部６４は、転移学習における他の事前ドメインの有効性の評価で使用されないためである。 [4.4. Specify the following pre-domains]
After the evaluation of the effectiveness of the pre-domain 35A in the transition learning (step S206) is completed, the trial transition identifying unit 63 and the comparison identifying unit 64 used for the evaluation of the pre-domain 35A are deleted (step S207) . This is because the trial transition identification unit 63 and the comparison identification unit 64 corresponding to the pre-domain 35A are not used in the evaluation of the effectiveness of other pre-domains in transfer learning.

事前ドメイン評価装置２０は、一時記憶部２１に記憶されている全ての事前ドメインを選択したか否かを判断する（ステップＳ２０８）。全ての事前ドメインを選択していない場合（ステップＳ２０８においてＮｏ）、事前ドメイン評価装置２０は、選択されていない事前ドメインの転移学習における有効性を評価するために、ステップＳ２０３に戻る。これにより、転移学習における事前ドメイン３５Ｄ及び３５Ｆの有効性が評価される。 The preliminary domain evaluation device 20 determines whether all the preliminary domains stored in the temporary storage unit 21 have been selected (step S208). When all the preliminary domains have not been selected (No in step S208), the preliminary domain evaluation device 20 returns to step S203 to evaluate the effectiveness in transfer learning of the non-selected preliminary domains. This evaluates the effectiveness of the pre-domains 35D and 35F in transfer learning.

［４．５．評価結果データ２５３Ａの生成］
全ての事前ドメインが選択された場合（ステップＳ２０８においてＹｅｓ）、転移評価部２５３は、事前ドメイン３５Ａ、３５Ｄ及び３５Ｆの各々の評価結果を示す評価結果データ２５３Ａを作成する。転移学習に有効と判断される事前ドメインの数は、特に限定されない。転移評価部２５３は、作成した評価結果データ２５３Ａを選択学習装置３０に出力する。 [4.5. Generation of evaluation result data 253A]
When all pre-domains have been selected (Yes in step S208), the transfer evaluation unit 253 creates evaluation result data 253A indicating the evaluation results of each of the pre-domains 35A, 35D, and 35F. The number of prior domains judged to be effective for transfer learning is not particularly limited. The transfer evaluation unit 253 outputs the generated evaluation result data 253A to the selective learning device 30.

再び、図５を参照する。選択学習装置３０において、事前ドメイン選択部３１は、評価結果データ２５３Ａに基づいて、クラスタリング装置１０により生成された事前ドメイン４５の中から、転移学習に有効と判断された事前ドメイン３５Ａ、３５Ｄ及び３５Ｆを選択する（ステップＳ３）。特徴抽出部３２（図４参照）は、記憶装置５００から目標ドメイン５０Ａを取得し、取得した目標ドメイン５０Ａに含まれる学習用データ５１の各々から特徴量を抽出する（ステップＳ４）。これにより、学習用特徴データ５２を含む目標ドメイン５０Ｂが生成される。特徴抽出部３２は、特徴抽出部２２（図２参照）が学習用データ５１から特徴量を抽出する時と同じ条件で、特徴量の抽出を行う。 Refer again to FIG. In the selective learning device 30, the prior domain selection unit 31 selects, from among the prior domains 45 generated by the clustering device 10 based on the evaluation result data 253A, the prior domains 35A, 35D and 35F determined to be effective for transfer learning. Is selected (step S3). The feature extraction unit 32 (see FIG. 4) acquires the target domain 50A from the storage device 500, and extracts a feature amount from each of the learning data 51 included in the acquired target domain 50A (step S4). Thus, a target domain 50B including the learning feature data 52 is generated. The feature extraction unit 32 extracts feature amounts under the same conditions as when the feature extraction unit 22 (see FIG. 2) extracts feature amounts from the learning data 51.

転移学習部３３は、選択された事前ドメイン３５Ａ、３５Ｄ及び３５Ｆと、特徴抽出部３２により生成された目標ドメイン５０Ｂとを用いて、転移学習を導入した機械学習を実行する（ステップＳ５）。これにより、複数の決定木を示すデータ群である転移識別データ８０が生成される。 The transfer learning unit 33 uses the selected prior domains 35A, 35D and 35F and the target domain 50B generated by the feature extraction unit 32 to execute machine learning in which transfer learning is introduced (step S5). This generates transition identification data 80 which is a data group indicating a plurality of decision trees.

以上説明したように、機械学習装置１００は、記憶装置４００に記憶された転移候補データ４１，４１，・・・から特徴を抽出して転移候補特徴データ４２，４２，・・・を生成する。機械学習装置１００は、抽出した特徴量に基づいて転移候補特徴データ４２，４２，・・・を複数のグループに分類する。機械学習装置１００は、分類されたグループにおける転移候補特徴データ４２の数又は共分散に基づいて、分類されたグループを事前ドメインに決定するか否かを判断する。これにより、転移学習に用いられる事前ドメインを、転移候補データ４１から効率的に生成することができる。 As described above, the machine learning apparatus 100 extracts features from the transfer candidate data 41, 41,... Stored in the storage device 400, and generates transfer candidate feature data 42, 42,. The machine learning apparatus 100 classifies the transition candidate feature data 42, 42, ... into a plurality of groups based on the extracted feature quantities. The machine learning apparatus 100 determines whether to determine the classified group as the pre-domain based on the number or covariance of the transfer candidate feature data 42 in the classified group. Thereby, the prior domain used for transfer learning can be efficiently generated from transfer candidate data 41.

［変形例］
上記実施の形態において、クラスタリング装置１０が、転移候補特徴データ４２を分類する際に、デンシティフォレストを用いて分類木３５として２分木を生成する場合を例に説明したが、これに限られない。クラスタリング装置１０は、ｋ−ｍｅａｎｓ法などの他の分類アルゴリズムを用いて、転移候補特徴データ４２を分類してもよい。この場合、ステップＳ１０９（図８参照）において作成される子ノードの数は、３つ以上であってもよい。 [Modification]
In the above embodiment, when the clustering device 10 classifies the transition candidate feature data 42, the case of generating a binary tree as the classification tree 35 using the density forest has been described as an example, but the present invention is not limited thereto. . The clustering device 10 may classify the transition candidate feature data 42 using another classification algorithm such as the k-means method. In this case, the number of child nodes created in step S109 (see FIG. 8) may be three or more.

また、クラスタリング装置１０は、２つ以上の分類アルゴリズムを用いて、転移候補特徴データ４２を分類してもよい。例えば、クラスタリング装置１０は、分類対象のノードに属する転移候補特徴データ４２の数が、分類アルゴリズムの変更を判断するための基準値（アルゴリズム変更基準値）より大きいか否かに基づいて、分類アルゴリズムを決定する。 In addition, the clustering device 10 may classify the transition candidate feature data 42 using two or more classification algorithms. For example, the clustering device 10 performs classification algorithm based on whether the number of transfer candidate feature data 42 belonging to the classification target node is larger than a reference value (algorithm change reference value) for determining a change in the classification algorithm. Decide.

図１３は、ｋ−ｍｅａｎｓ法と、デンシティフォレストとを用いて生成された分類木３５の一例を示す図である。例えば、アルゴリズム変更基準値が、２５０００に設定されていると仮定する。 FIG. 13 is a diagram showing an example of the classification tree 35 generated using the k-means method and the density forest. For example, assume that the algorithm change reference value is set to 25000.

ルートノード３５Ｒに属する転移候補特徴データ４２の数は、３００００であり、アルゴリズム変更基準値よりも大きい。この場合、クラスタリング装置１０は、ルートノード３５Ｒの子ノードとして、ノード３６Ａ、３６Ｂ及び３６Ｃを生成する。そして、クラスタリング装置１０は、ｋ−ｍｅａｎｓ法を用いて、ルートノード３５Ｒに属する転移候補特徴データ４２を、ノード３６Ａ、３６Ｂ及び３６Ｃを生成する。 The number of transfer candidate feature data 42 belonging to the root node 35R is 30,000, which is larger than the algorithm change reference value. In this case, the clustering device 10 generates nodes 36A, 36B and 36C as child nodes of the root node 35R. Then, the clustering device 10 generates the transition candidate feature data 42 belonging to the root node 35R using the k-means method to generate nodes 36A, 36B, and 36C.

そして、ノード３６Ａ及び３６Ｃに属する転移候補特徴データ４２の数は、５０００及び８０００であり、分類継続基準値（９２７０）以下である。クラスタリング装置１０は、ノード３６Ａ及び３６Ｃをそれぞれ事前ドメインに決定する。一方、ノード３６Ｂに属する転移候補特徴データ４２の数は、１７０００であり、分類継続基準値よりも大きい。この場合、クラスタリング装置１０は、ノード３６Ｂに属する転移候補特徴データ４２をさらに分類する。 Then, the numbers of transfer candidate feature data 42 belonging to the nodes 36A and 36C are 5000 and 8000, respectively, which are below the classification continuation reference value (9270). The clustering device 10 determines nodes 36A and 36C as pre-domains, respectively. On the other hand, the number of transfer candidate feature data 42 belonging to the node 36 B is 17000, which is larger than the classification continuation reference value. In this case, the clustering device 10 further classifies the transition candidate feature data 42 belonging to the node 36B.

ノード３６Ｂに属する転移候補特徴データ４２の数（１７０００）は、アルゴリズム変更基準値（２５０００）以下であるため、クラスタリング装置１０は、ノード３６Ｂに属する転移候補特徴データ４２の分類にデンシティフォレストを用いることを決定する。クラスタリング装置１０は、ノード３６Ｂの子ノードとして、ノード３６Ｄ及び３６Ｅを生成し、ノード３６Ｂに属する転移候補特徴データ４２を分類する。 Since the number (17000) of transition candidate feature data 42 belonging to the node 36B is less than or equal to the algorithm change reference value (25000), the clustering apparatus 10 uses density forest for classification of the transition candidate feature data 42 belonging to the node 36B. Decide. The clustering device 10 generates nodes 36D and 36E as child nodes of the node 36B, and classifies transition candidate feature data 42 belonging to the node 36B.

このように、分類対象のノードに属する転移候補特徴データ４２の数に応じて分類アルゴリズムを切り替えることにより、転移候補特徴データ４２の分類を高速に実行することができる。 As described above, by switching the classification algorithm in accordance with the number of transfer candidate feature data 42 belonging to the classification target node, the transfer candidate feature data 42 can be classified at high speed.

また、上記実施の形態において、選択学習装置３０（図４参照）が、特徴抽出部３２を備える例を説明したが、これに限られない。選択学習装置３０は、事前ドメイン評価装置２０（図３参照）が備える特徴抽出部２２により生成された目標ドメイン５０Ｂを用いて、転移識別データ８０を生成してもよい。また、事前ドメイン評価装置２０が、各事前ドメイン４５に対応する転移候補データ４１から特徴量を抽出して転移候補特徴データ４２を生成してもよい。あるいは、選択学習装置３０が、転移学習に有効と判断された事前ドメインに対応する転移候補データ４１から特徴量を抽出して転移候補特徴データ４２を生成してもよい。 Moreover, although the example which the selective learning apparatus 30 (refer FIG. 4) equips with the feature extraction part 32 was demonstrated in the said embodiment, it is not restricted to this. The selective learning device 30 may generate the transition identification data 80 using the target domain 50B generated by the feature extraction unit 22 included in the prior domain evaluation device 20 (see FIG. 3). Alternatively, the preliminary domain evaluation device 20 may extract feature amounts from the transfer candidate data 41 corresponding to each of the preliminary domains 45 to generate transfer candidate feature data 42. Alternatively, the selective learning device 30 may generate the transfer candidate feature data 42 by extracting feature amounts from the transfer candidate data 41 corresponding to the pre-domain determined to be effective for transfer learning.

いずれの場合においても、クラスタリング装置１０、事前ドメイン評価装置２０、選択学習装置３０の各々において用いられる転移候補特徴データ４２は、全て同じ条件で転移候補データ４１から特徴量を抽出することにより生成されることが望ましい。同様に、学習用特徴データ５２は、全て同じ条件で学習用データ５１から特徴量を抽出することにより生成されることが望ましい。以下、その理由について説明する。 In any case, the transition candidate feature data 42 used in each of the clustering device 10, the prior domain evaluation device 20, and the selection learning device 30 are all generated by extracting feature amounts from the transition candidate data 41 under the same conditions. Is desirable. Similarly, it is desirable that the learning feature data 52 be generated by extracting feature quantities from the learning data 51 under the same conditions. The reason will be described below.

例えば、クラスタリング装置１０と事前ドメイン評価装置２０とで特徴量の抽出条件が異なる場合、クラスタリング装置１０において生成される転移候補特徴データ４２は、事前ドメイン評価装置２０において転移候補特徴データ４２における分布と異なる分布を有する。目標ドメインと事前ドメインとの位置関係が、クラスタリング装置１０において生成される転移候補特徴データ４２と、事前ドメイン評価装置２０において転移候補特徴データ４２とで異なる。この結果、事前ドメイン評価装置２０において、クラスタリング装置１０で生成された事前ドメインが転移学習で有効であるか否かを判定する精度が低下する。 For example, when the extraction condition of the feature amount differs between the clustering device 10 and the pre-domain evaluation device 20, the transfer candidate feature data 42 generated in the clustering device 10 has a distribution in the transfer candidate feature data 42 in the pre-domain evaluation device 20. Have different distributions. The positional relationship between the target domain and the prior domain is different between the transition candidate feature data 42 generated in the clustering device 10 and the transition candidate feature data 42 in the prior domain evaluation device 20. As a result, in the prior domain evaluation device 20, the accuracy of determining whether the prior domain generated by the clustering device 10 is effective in the transfer learning is lowered.

事前ドメイン評価装置２０と、選択学習装置３０とで特徴量の抽出条件が異なる場合も同様に、事前ドメイン評価装置２０で有効と判断された事前ドメイン４５における転移候補特徴データ４２の分布が変化する。この結果、選択学習装置３０における転移学習を導入した機械学習の学習精度が低下し、転移識別データ８０を用いた人物の識別精度が低下する可能性がある。 The distribution of transfer candidate feature data 42 in the prior domain 45 determined to be effective by the prior domain evaluation device 20 changes similarly even when the extraction conditions of the feature amount differ between the prior domain evaluation device 20 and the selection learning device 30. . As a result, the learning accuracy of machine learning in which transfer learning is introduced in the selective learning device 30 may be reduced, and the identification accuracy of a person using the transfer identification data 80 may be reduced.

これに対して、クラスタリング装置１０、事前ドメイン評価装置２０、及び選択学習装置３０における特徴量の抽出条件を揃えることにより、事前ドメインの有効性を評価するときの精度、転移識別データ８０を生成するときの学習の精度が低下することを防ぐことができる。 On the other hand, the transition identification data 80 is generated with accuracy when evaluating the effectiveness of the pre-domain by aligning the extraction conditions of the feature amounts in the clustering device 10, the pre-domain evaluation device 20, and the selection learning device 30. The accuracy of learning can be prevented from decreasing.

上記実施の形態において、試行転移学習部２３、比較学習部２４及び転移学習部３３が、学習アルゴリズムとしてランダムフォレストを用いる場合を例に説明したが、これに限られない。例えば、試行転移学習部２３、比較学習部２４及び転移学習部３３は、ＩＤ３（Iterative Dichotomiser 3）や、ブースティング、ニューラルネットワークなどの各種アルゴリズムを用いてもよい。いずれの学習アルゴズムを用いる場合であっても、試行転移学習部２３及び転移学習部３３は、転移学習を導入した機械学習を実行し、比較学習部２４は、転移学習を導入しない機械学習を実行すればよい。 Although the trial transfer learning unit 23, the comparison learning unit 24, and the transfer learning unit 33 use a random forest as a learning algorithm in the above embodiment, the present invention is not limited to this. For example, the trial transfer learning unit 23, the comparison learning unit 24, and the transfer learning unit 33 may use various algorithms such as ID3 (Iterative Dichotomizer 3), boosting, and a neural network. Regardless of which learning algorithm is used, the trial transfer learning unit 23 and the transfer learning unit 33 execute machine learning in which transfer learning is introduced, and the comparison learning unit 24 performs machine learning in which transfer learning is not introduced. do it.

上記実施の形態において、転移評価部２５３は、競合値２５１Ａ及び信頼度２５２Ａを乗算することにより、総合評価値を計算する例を説明したが、これに限られない。たとえば、転移評価部２５３は、競合値２５１Ａ及び信頼度２５２Ａの合計を総合評価値として計算してもよい。つまり、転移評価部２５３は、競合値２５１Ａ及び信頼度２５２Ａを用いて、総合評価値を計算すればよい。 In the above embodiment, although the example in which the transition evaluation unit 253 calculates the comprehensive evaluation value by multiplying the competition value 251A and the reliability 252A has been described, the present invention is not limited thereto. For example, the transfer evaluation unit 253 may calculate the sum of the competition value 251A and the reliability 252A as a comprehensive evaluation value. That is, the transfer evaluation unit 253 may calculate the comprehensive evaluation value using the competition value 251A and the reliability 252A.

上記実施の形態において、機械学習装置１００が、転移候補データ４１及び学習用データ５１の各々からＨＯＧ特徴量を抽出する場合を例にして説明したが、これに限られない。例えば、機械学習装置１００は、人物の顔を学習する場合、Ｈａａｒ−ｌｉｋｅ特徴量を抽出してもよい。機械学習装置１００は、学習対象に応じて、転移候補データ４１及び学習用データ５１から抽出する特徴量を適宜変更すればよい。 In the above embodiment, although the machine learning device 100 extracts the HOG feature value from each of the transfer candidate data 41 and the learning data 51 has been described as an example, the present invention is not limited to this. For example, when learning the face of a person, the machine learning apparatus 100 may extract a Haar-like feature. The machine learning apparatus 100 may appropriately change the feature quantities extracted from the transfer candidate data 41 and the learning data 51 according to the learning target.

上記実施の形態において、機械学習装置１００が、人物を検出するための転移識別データ８０を生成する例を説明したが、これに限られない。学習の対象は、センサにより計測された測定データであってもよい。センサの種類は、特に限定されず、加速度センサ、光センサなどの様々な測定データを使用することができる。例えば、自動車の自動運転を行うために、これらのセンサの測定データを用いるために機械学習を実行してもよい。 Although the machine learning device 100 has described the example in which the transition identification data 80 for detecting a person is generated in the above embodiment, the present invention is not limited to this. The target of learning may be measurement data measured by a sensor. The type of sensor is not particularly limited, and various measurement data such as an acceleration sensor and an optical sensor can be used. For example, machine learning may be performed to use the measurement data of these sensors to provide automated driving of the vehicle.

上記実施の形態の機械学習装置１００の一部または全部は、集積回路（例えば、ＬＳＩ、システムＬＳＩ等）として実現されるものであってもよい。 A part or all of the machine learning apparatus 100 according to the above-described embodiment may be realized as an integrated circuit (for example, an LSI, a system LSI, etc.).

上記実施の形態の各機能ブロックの処理の一部または全部は、プログラムにより実現されるものであってもよい。そして、上記実施形態の各機能ブロックの処理の一部または全部は、コンピュータにおいて、中央演算装置（ＣＰＵ）により行われる。また、それぞれの処理を行うためのプログラムは、ハードディスク、ＲＯＭなどの記憶装置に格納されており、ＲＯＭにおいて、あるいはＲＡＭに読み出されて実行される。 Part or all of the processing of each functional block in the above embodiment may be realized by a program. And a part or all of processing of each functional block of the above-mentioned embodiment is performed by central processing unit (CPU) in a computer. Further, programs for performing respective processing are stored in a storage device such as a hard disk or a ROM, and are read out from the ROM or to the RAM and executed.

また、上記実施形態の各処理をハードウェアにより実現してもよいし、ソフトウェア（ＯＳ（オペレーティングシステム）、ミドルウェア、あるいは、所定のライブラリとともに実現される場合を含む。）により実現してもよい。さらに、ソフトウェアおよびハードウェアの混在処理により実現しても良い。なお、上記実施の形態に係る機械学習装置１００をハードウェアにより実現する場合、各処理を行うためのタイミング調整を行う必要があるのは言うまでもない。上記実施形態においては、説明便宜のため、実際のハードウェア設計で生じる各種信号のタイミング調整の詳細については省略している。 Further, each process of the above-described embodiment may be realized by hardware or may be realized by software (including a case where it is realized with an OS (Operating System), middleware, or a predetermined library). Furthermore, it may be realized by mixed processing of software and hardware. Needless to say, when the machine learning apparatus 100 according to the above-described embodiment is realized by hardware, it is necessary to perform timing adjustment for performing each process. In the above embodiment, for the convenience of description, the details of the timing adjustment of various signals generated in the actual hardware design are omitted.

また、上記実施形態における処理方法の実行順序は、必ずしも、上記実施形態の記載に制限されるものではなく、発明の要旨を逸脱しない範囲で、実行順序を入れ替えることができるものである。 Further, the order of execution of the processing method in the above embodiment is not necessarily limited to the description of the above embodiment, and the order of execution can be interchanged without departing from the scope of the invention.

前述した方法をコンピュータに実行させるコンピュータプログラム及びそのプログラムを記録したコンピュータ読み取り可能な記録媒体は、本発明の範囲に含まれる。ここで、コンピュータ読み取り可能な記録媒体としては、例えば、フレキシブルディスク、ハードディスク、ＣＤ−ＲＯＭ、ＭＯ、ＤＶＤ、ＤＶＤ−ＲＯＭ、ＤＶＤ−ＲＡＭ、大容量ＤＶＤ、次世代ＤＶＤ、半導体メモリを挙げることができる。 A computer program that causes a computer to execute the above-described method and a computer readable recording medium recording the program are included in the scope of the present invention. Here, examples of the computer-readable recording medium include a flexible disk, a hard disk, a CD-ROM, an MO, a DVD, a DVD, a DVD-ROM, a DVD-RAM, a large capacity DVD, a next-generation DVD, and a semiconductor memory. .

上記コンピュータプログラムは、上記記録媒体に記録されたものに限られず、電気通信回線、無線又は有線通信回線、インターネットを代表とするネットワーク等を経由して伝送されるものであってもよい。 The computer program is not limited to one recorded in the recording medium, but may be transmitted via a telecommunication line, a wireless or wired communication line, a network represented by the Internet, or the like.

１００機械学習装置
１０クラスタリング装置
１１、２２、３２特徴抽出部
１２分類部
１３分散計算部
１４事前ドメイン決定部
２０事前ドメイン評価装置
２１一時記憶部
２３試行転移学習部
２４比較学習部
２５評価部
２５１競合値計算部
２５２信頼度計算部
２５３転移評価部
３０選択学習装置
３１事前ドメイン選択部
３３転移学習部 100 machine learning device 10 clustering device 11, 22, 32 feature extraction unit 12 classification unit 13 distributed calculation unit 14 prior domain determination unit 20 prior domain evaluation device 21 temporary storage unit 23 trial transition learning unit 24 comparison learning unit 25 evaluation unit 251 competition Value calculation unit 252 reliability calculation unit 253 transition evaluation unit 30 selective learning device 31 prior domain selection unit 33 transition learning unit

Claims

A machine learning apparatus that executes machine learning in which transfer learning is introduced to learn a detection target,
A clustering device that classifies a plurality of transfer candidate data used for the machine learning and generates a pre-domain used for the machine learning;
A pre-domain evaluation device that evaluates whether or not the pre-domain generated by the clustering device is effective for the machine learning;
Equipped with
The clustering device
A feature extraction unit for clustering which extracts features from each of the plurality of transfer candidate data and generates a plurality of transfer candidate feature data;
A classification unit that classifies each transfer candidate feature data into a plurality of groups including a first group and a second group based on the features of each of the plurality of transfer candidate feature data generated by the clustering feature extraction unit;
When the number of transfer candidate feature data classified into the first group by the classification unit is equal to or less than a predetermined classification continuation reference value, the first group is determined to be a pre-domain used for the machine learning, and the transfer candidate A pre-domain determination unit that determines to further classify transfer candidate feature data classified into the first group if the number of feature data is larger than the classification continuation reference value;
Equipped with
The prior domain evaluation device
When the first group is determined to be the prior domain by the prior domain determination unit, for learning having transfer candidate feature data included in the first group, and each having a feature to be detected under a predetermined condition A trial transfer learning unit that executes the machine learning using a target domain including data to generate an evaluation discriminator for evaluating the pre-domain;
A determination unit that determines whether the first group is effective for the machine learning based on the trial transfer identification unit generated by the trial transfer learning unit;
Machine learning device comprising:

The machine learning device according to claim 1 , wherein
The pre-domain evaluator may further:
A learning feature extraction unit that extracts features of each of the learning data included in the target domain and generates learning feature data;
Equipped with
The trial transfer learning unit executes the machine learning using the learning feature data,
The machine learning apparatus, wherein a condition for the feature extraction unit for learning to extract a feature from data for learning is the same as a condition for the feature extraction unit for clustering to extract a feature from each of the plurality of transfer candidate data.

The machine learning apparatus according to claim 2 , further comprising:
A selection learning device that executes the machine learning using the target domain and all pre-domains determined to be effective for the machine learning by the pre-domain evaluating device to generate a transition identification unit;
Machine learning device comprising: