JP7268347B2

JP7268347B2 - IDENTIFICATION DEVICE, IDENTIFICATION METHOD AND PROGRAM

Info

Publication number: JP7268347B2
Application number: JP2018239067A
Authority: JP
Inventors: 和久松永
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 2018-12-21
Filing date: 2018-12-21
Publication date: 2023-05-08
Anticipated expiration: 2038-12-21
Also published as: JP2020101958A

Description

本発明は、識別装置、識別方法及びプログラムに関する。 The present invention relates to an identification device, identification method and program.

従来から、ニューラルネットワーク（ＮｅｕｒａｌＮｅｔｗｏｒｋ）、ＳＶＭ（ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ）等を用いて、入力データが複数のクラスの中のどのクラスに属するかを判定する識別器（分類器とも呼ばれる）が知られており、様々な分野で利用されている。例えば、花の種類をクラスとして、花の画像を入力すると、その花の種類を出力する識別器が存在する。また、皮膚疾患の名称をクラスとして、皮膚の画像を入力すると、その皮膚の疾患名を出力する識別器が存在する。 Conventionally, a classifier (also called a classifier) has been known that uses a neural network, an SVM (Support Vector Machine), or the like to determine which class among a plurality of classes input data belongs to. and is used in various fields. For example, there is a classifier that outputs the type of flower when a flower image is input with the type of flower as a class. There is also a classifier that outputs the name of a skin disease when a skin image is input with the name of a skin disease as a class.

このような識別器は、クラスが既知である多数の学習データを用いて学習させることにより作成される。しかし、学習データに含まれているクラスの構成比率が極端に偏っている（クラス毎のサンプル数の不均衡が大きい）と、比率の高いクラス（サンプル数が多いクラス）にバイアスされてしまって、識別の精度が落ちるという問題がある。この問題を解決するために、特許文献１では、一方のクラスに属する学習データの個数が他方のクラスの個数より極端に少ない場合でも、少数クラスの学習データの一部を複製して増やしたり（オーバーサンプリング）、多数クラスの学習データの一部を除いて減らしたり（アンダーサンプリング）して、学習データのバランスを調整することにより、高精度な識別を行う技術が提案されている。 Such classifiers are created by learning using a large amount of training data whose classes are known. However, if the composition ratio of the classes included in the training data is extremely skewed (large imbalance in the number of samples for each class), it will be biased to classes with a high ratio (classes with a large number of samples). , there is a problem that the accuracy of identification falls. In order to solve this problem, in Patent Document 1, even if the number of learning data belonging to one class is extremely smaller than the number of the other class, part of the learning data of the minority class is duplicated to increase it ( (oversampling), or by removing some of the learning data of many classes and reducing (undersampling) the learning data to adjust the balance of the learning data, thereby performing highly accurate classification.

特開２０１３－１６１２９８号公報JP 2013-161298 A

従来、学習データのバランスを調整する尺度として、各クラスの識別結果をまとめた混同行列が用いられており、従来技術においては、混同行列の要素間の値の対称性が高くなるように、クラス間の個数のバランスが調整されている。しかし、適正なバランスは、各クラスに属するデータの個数だけでなく、クラス間の境界付近のデータの分布状況にも大いに影響されるため、オーバーサンプリングやアンダーサンプリングによって、学習データを適正なバランスに調整するのは容易ではない。 Conventionally, as a scale for adjusting the balance of learning data, a confusion matrix summarizing the classification results of each class is used. The balance between the numbers has been adjusted. However, the proper balance is greatly affected not only by the number of data belonging to each class, but also by the distribution of data near the boundaries between classes. Not easy to adjust.

本発明は、上記問題を解決するためになされたものであり、識別の精度の向上を図ることができる識別装置、識別方法及びプログラムを提供することを目的とする。 SUMMARY OF THE INVENTION The present invention has been made to solve the above problems, and an object of the present invention is to provide an identification device, an identification method, and a program capable of improving the accuracy of identification.

上記目的を達成するため、本発明の識別装置は、
入力される未知データの属する複数のクラスの各々を識別する各クラスの予測値を出力する識別器と、
前記識別器が出力する各クラスの予測値を、各クラスの補正パラメータを用いて補正する補正部と、
前記補正部によって補正された予測値で識別したクラスに基づいて混同行列を作成する作成部と、
前記作成部によって作成された混同行列において前記識別器の各正解ラベルに対する各クラスのサンプル数の合計値の偏りが前記複数のクラスの間でより小さくなるように前記補正パラメータを修正する修正部と、
前記識別器の出力する予測値を、前記修正部によって修正された補正パラメータを用いて補正して、最終的な識別結果を取得する結果取得部と、
を備える。 In order to achieve the above object, the identification device of the present invention
a discriminator that outputs a predicted value for each class that discriminates each of a plurality of classes to which unknown data to be input belongs;
a correction unit that corrects the predicted value of each class output by the discriminator using a correction parameter of each class;
A creation unit that creates a confusion matrix based on the class identified by the predicted value corrected by the correction unit;
a modifying unit that modifies the correction parameter so that the deviation of the total number of samples of each class for each correct label of the classifier in the confusion matrix generated by the generating unit is smaller among the plurality of classes; ,
a result acquisition unit that acquires a final identification result by correcting the predicted value output by the classifier using the correction parameter corrected by the correction unit;
Prepare.

本発明によれば、識別の精度の向上を図ることができる。 According to the present invention, it is possible to improve the accuracy of identification.

本発明の実施形態１に係る識別装置の機能構成を示す図である。It is a figure which shows the functional structure of the identification device which concerns on Embodiment 1 of this invention. 実施形態１に係る補正パラメータ取得処理のフローチャートである。5 is a flowchart of correction parameter acquisition processing according to the first embodiment; 実施形態１に係る識別器学習処理のフローチャートである。4 is a flowchart of discriminator learning processing according to the first embodiment. 実施形態１に係る識別処理のフローチャートである。5 is a flowchart of identification processing according to the first embodiment; 混同行列を説明する図である。It is a figure explaining a confusion matrix. 補正パラメータを取得する処理を説明する図である。It is a figure explaining the process which acquires a correction parameter. 予測値を補正パラメータで補正した値での混同行列を説明する図である。FIG. 10 is a diagram illustrating a confusion matrix with values obtained by correcting predicted values with correction parameters; 補正パラメータを修正する処理を説明する図である。It is a figure explaining the process which corrects a correction parameter. 予測値を他の補正パラメータで補正した値での混同行列を説明する図である。FIG. 10 is a diagram illustrating a confusion matrix with values obtained by correcting predicted values with other correction parameters; 補正パラメータをさらに修正する処理を説明する図である。It is a figure explaining the process which corrects a correction parameter further.

以下、本発明の実施形態に係る識別装置等について、図表を参照して説明する。なお、図中同一又は相当部分には同一符号を付す。 Hereinafter, identification devices and the like according to embodiments of the present invention will be described with reference to the drawings. The same reference numerals are given to the same or corresponding parts in the drawings.

（実施形態１）
本発明の実施形態１に係る識別装置１００は、サンプルデータの数にクラス間で偏りがあっても、識別器の出力を補正することよって、サンプル数の偏りの影響を軽減して、精度の高い識別を行うことができる。このような識別装置１００について、以下に説明する。 (Embodiment 1)
The identification device 100 according to the first embodiment of the present invention reduces the influence of the bias in the number of samples by correcting the output of the classifier even if the number of sample data is biased between classes, and improves the accuracy. A high degree of discrimination can be achieved. Such an identification device 100 will be described below.

実施形態１に係る識別装置１００は、図１に示すように、制御部１０、記憶部２０、データ入力部３１、出力部３２、通信部３３、操作入力部３４、を備える。 The identification device 100 according to Embodiment 1 includes a control unit 10, a storage unit 20, a data input unit 31, an output unit 32, a communication unit 33, and an operation input unit 34, as shown in FIG.

制御部１０は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）等で構成され、記憶部２０に記憶されたプログラムを実行することにより、後述する各部（識別器１１、補正部１２、結果取得部１３）の機能を実現する。 The control unit 10 is composed of a CPU (Central Processing Unit) or the like, and by executing a program stored in the storage unit 20, functions of each unit (discriminator 11, correction unit 12, result acquisition unit 13) to be described later. Realize.

記憶部２０は、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）等で構成され、制御部１０のＣＰＵが実行するプログラム及び必要なデータを記憶する。 The storage unit 20 includes a ROM (Read Only Memory), a RAM (Random Access Memory), etc., and stores programs executed by the CPU of the control unit 10 and necessary data.

データ入力部３１は、学習用のデータ、試験用のデータ又は識別する（未知の）データを入力するためのデバイスである。制御部１０は、データ入力部３１を介してデータを取得する。データ入力部３１としては、制御部１０がデータを取得できるなら、任意のデバイスを使用することができる。例えば、記憶部２０にデータを記憶させておき、制御部１０が記憶部２０を読み出すことによってデータを取得する場合は、記憶部２０がデータ入力部３１を兼ねることになる。また、制御部１０が通信部３３を介して外部のサーバ等からデータを取得する場合は、通信部３３がデータ入力部３１を兼ねることになる。 The data input unit 31 is a device for inputting learning data, test data, or identifying (unknown) data. The control unit 10 acquires data via the data input unit 31 . Any device can be used as the data input unit 31 as long as the control unit 10 can acquire data. For example, when data is stored in the storage unit 20 and the control unit 10 acquires the data by reading the storage unit 20 , the storage unit 20 also serves as the data input unit 31 . Also, when the control unit 10 acquires data from an external server or the like via the communication unit 33 , the communication unit 33 also serves as the data input unit 31 .

出力部３２は、制御部１０が、データ入力部３１から入力したデータを識別した結果等を出力するためのデバイスである。例えば、出力部３２は、液晶ディスプレイや有機ＥＬ（Ｅｌｅｃｔｏｒｏ－Ｌｕｍｉｎｅｓｃｅｎｃｅ）ディスプレイである。この場合、出力部３２は、表示部として機能する。ただし、識別装置１００は、出力部３２としてこのようなディスプレイ（表示部）を備えてもよいし、外部のディスプレイを接続するためのインタフェースとしての出力部３２を備えてもよい。識別装置１００は、インタフェースとしての出力部３２を備える場合は、出力部３２を介して接続した外部のディスプレイに識別結果等を表示する。出力部３２は、出力手段として機能する。 The output unit 32 is a device for the control unit 10 to output the results of identification of data input from the data input unit 31 and the like. For example, the output unit 32 is a liquid crystal display or an organic EL (Electro-Luminescence) display. In this case, the output section 32 functions as a display section. However, the identification device 100 may include such a display (display unit) as the output unit 32, or may include the output unit 32 as an interface for connecting an external display. When the identification device 100 has the output unit 32 as an interface, the identification result and the like are displayed on an external display connected via the output unit 32 . The output unit 32 functions as output means.

通信部３３は、外部の他の装置（例えば、学習データや試験データのデータベースが格納されているサーバ等）とデータの送受信を行うためのデバイス（ネットワークインタフェース等）である。制御部１０は、通信部３３を介してデータを取得することができる。 The communication unit 33 is a device (network interface, etc.) for transmitting and receiving data to and from other external devices (for example, a server storing databases of learning data and test data). The control unit 10 can acquire data via the communication unit 33 .

操作入力部３４は、識別装置１００に対するユーザの操作入力を受け付けるデバイスであり、例えば、キーボード、マウス、タッチパネル等である。識別装置１００は、操作入力部３４を介して、ユーザからの指示等を受け付ける。操作入力部３４は、操作入力手段として機能する。 The operation input unit 34 is a device that receives a user's operation input to the identification device 100, and is, for example, a keyboard, mouse, touch panel, or the like. The identification device 100 receives an instruction or the like from the user via the operation input unit 34 . The operation input unit 34 functions as operation input means.

次に、制御部１０の機能について説明する。制御部１０は、識別器１１、補正部１２、結果取得部１３、の機能を実現する。 Next, functions of the control unit 10 will be described. The control unit 10 realizes the functions of the discriminator 11 , the correction unit 12 and the result acquisition unit 13 .

識別器１１は、ロジスティク回帰やＤＮＮ（ＤｅｅｐＮｅｕｒａｌＮｅｔｗｏｒｋ）等で構成され、与えられた入力データの属するクラスを識別する各クラスの予測値を出力することにより、該入力データが複数のクラスの何れに属するかを識別する。ここでは、識別器１１が識別するクラスの数はＮ（識別器１１は、Ｎクラス識別器）とする。識別器１１は、入力データが与えられると、各クラスの予測値を０以上１以下の数で出力する。予測値はＮ（識別するクラス数）個出力されるので、各クラスの予測値をまとめて、Ｎ次元のベクトルとして予測値を扱うこともできる。そして、Ｎ個の予測値（ベクトルの各要素）のうち、最大の予測値に対応するクラスが、該入力データの識別結果（予測クラス）になる。なお、ロジスティク回帰やＤＮＮによる識別器は、これらを実現するプログラムを制御部１０が実行することにより実現することができ、この場合、制御部１０は識別器１１としても機能することになる。なお、識別器１１は、各クラスの予測値を０以上１以下の数で出力するため、識別器１１としてＳＶＭを直接用いることはできないが、ＳＶＭの出力をシグモイド関数等で０以上１以下の数に変換するようにすれば、ＳＶＭを用いた識別器１１を構成することも可能である。 The discriminator 11 is composed of logistic regression, DNN (Deep Neural Network), or the like, and outputs predicted values for discriminating the class to which given input data belongs. identify who belongs to Here, the number of classes identified by the discriminator 11 is N (the discriminator 11 is an N class discriminator). The discriminator 11 outputs a predicted value of each class as a number of 0 or more and 1 or less when input data is given. Since N (the number of classes to be identified) predicted values are output, the predicted values of each class can be collected and handled as an N-dimensional vector. Then, the class corresponding to the maximum predicted value among the N predicted values (each element of the vector) becomes the identification result (predicted class) of the input data. Logistic regression and DNN discriminators can be realized by the control unit 10 executing a program for realizing them. In this case, the control unit 10 also functions as the discriminator 11 . Since the discriminator 11 outputs the predicted value of each class as a number between 0 and 1, the SVM cannot be directly used as the discriminator 11. If converted into numbers, it is possible to configure the discriminator 11 using SVM.

補正部１２は、識別器１１が出力する各クラスの予測値を、混同行列の対称性を高めるように補正する補正パラメータを取得する。なお、混同行列とは、正解のわかっている入力データ（ここでは試験データと言う）を識別器１１に入力させた時の、クラス識別の結果をまとめた表のことである。識別器１１が識別したクラス（予測クラス）を横方向に、正解クラスを縦方向に取った時に、それぞれの位置に対応するサンプル数が混同行列の要素となる。また、混同行列の対称性が高いとは、入力データの各クラスのサンプル数が均等になるように入力した場合に、列毎の合計値の偏りが少ないことを意味する。 The correction unit 12 acquires a correction parameter for correcting the predicted value of each class output by the discriminator 11 so as to enhance the symmetry of the confusion matrix. The confusion matrix is a table summarizing the results of class discrimination when the classifier 11 receives input data whose correct answer is known (referred to as test data here). When the class (predicted class) identified by the classifier 11 is taken in the horizontal direction and the correct class is taken in the vertical direction, the number of samples corresponding to each position becomes an element of the confusion matrix. A confusion matrix with high symmetry means that the total value of each column is less biased when the number of samples in each class of input data is equal.

より詳しく説明すると、補正部１２は、識別器１１が識別する各クラスに対応する（Ｎ個の）補正パラメータを取得し、識別器１１が出力する各クラスの予測値を当該クラスに対応する補正パラメータを用いて補正することにより、補正予測値を取得する。そして、補正予測値で識別したクラスに基づいて混同行列を作成する。そして、混同行列を行方向で（各クラスの正解データ数が１になるように）正規化した時の、列方向の合計（各クラスの予測率合計）ができるだけ１に近づくように当該Ｎ個の（各クラスに対応する）補正パラメータを修正する。そして、最終的に各クラスの予測率合計が最も１に近づいた時の補正パラメータを取得する。 More specifically, the correction unit 12 acquires (N) correction parameters corresponding to each class identified by the discriminator 11, and converts the predicted value of each class output by the discriminator 11 to the correction corresponding to the class. A corrected predicted value is obtained by correcting using the parameters. Then, a confusion matrix is created based on the classes identified by the corrected predicted values. Then, when the confusion matrix is normalized in the row direction (so that the number of correct data in each class is 1), the column direction sum (total prediction rate of each class) is as close to 1 as possible. modify the correction parameters (corresponding to each class) of Then, the correction parameter when the total prediction rate of each class is closest to 1 is obtained.

結果取得部１３は、識別器１１が出力する各クラスの予測値を、補正部１２が取得した各クラスの補正パラメータを用いて補正して、最終的な識別結果を取得する。結果取得部１３は、結果取得手段として機能する。 The result acquisition unit 13 corrects the predicted value of each class output by the classifier 11 using the correction parameter of each class acquired by the correction unit 12, and acquires the final classification result. The result acquisition unit 13 functions as result acquisition means.

以上、識別装置１００の機能構成について説明した。次に、識別装置１００の補正部１２が補正パラメータを取得する処理（補正パラメータ取得処理）について説明する。この処理は、補正パラメータを取得する際に実行される。また、この処理を実行する前に、正解ラベルが付いている所定の量（ここではＭ個とする）の試験用の入力データ（試験データ）を記憶部２０等に予め用意しておく必要がある。 The functional configuration of the identification device 100 has been described above. Next, a process (correction parameter acquisition process) in which the correction unit 12 of the identification device 100 acquires correction parameters will be described. This process is executed when the correction parameters are acquired. In addition, before executing this process, it is necessary to prepare a predetermined amount (in this case, M) of test input data (test data) with correct labels in advance in the storage unit 20 or the like. be.

正解ラベルとは、その正解ラベルが付いたデータがどのクラスに属するものかを示すものである。例えば、疾患画像を入力データとして入力すると疾患名を出力する識別装置１００を用意したいのであれば、「疾患名」を正解ラベルとして各試験データに付けておく必要がある。別の例として、花の画像の画像を入力するとその花の名称を出力する識別装置１００を用意したいのであれば、「花の名称」を正解ラベルとして各試験データに付けておく必要がある。では、識別装置１００の補正部１２が補正パラメータを取得する処理（補正パラメータ取得処理）について、図２を参照して説明する。 A correct label indicates to which class the data with the correct label belongs. For example, if it is desired to prepare an identification device 100 that outputs a disease name when a disease image is input as input data, it is necessary to attach "disease name" as a correct label to each test data. As another example, if it is desired to prepare the identification device 100 that outputs the name of a flower when an image of a flower is input, it is necessary to attach the "name of the flower" as a correct label to each piece of test data. Now, a process (correction parameter acquisition process) in which the correction unit 12 of the identification device 100 acquires correction parameters will be described with reference to FIG.

まず、制御部１０は、後述する識別器学習処理により、予め正解ラベルが付いている大量の学習用の入力データ（学習データ）で識別器１１を機械学習させる（ステップＳ１０１）。識別器１１が既に学習済みである場合は、ステップＳ１０１はスキップしてよい。 First, the control unit 10 causes the discriminator 11 to undergo machine learning using a large amount of input data for learning (learning data) to which correct labels are attached in advance by discriminator learning processing described later (step S101). If the discriminator 11 has already been trained, step S101 may be skipped.

次に、制御部１０は、試験データを識別器１１に入力して識別器１１の出力である予測値を取得する（ステップＳ１０２）。試験データがＭ個用意されているなら、Ｍ個の予測値（それぞれがＮ（クラス数）次元ベクトル）が取得される。 Next, the control unit 10 inputs the test data to the discriminator 11 and acquires the predicted value output from the discriminator 11 (step S102). If M pieces of test data are prepared, M prediction values (each being an N (number of classes) dimensional vector) are obtained.

そして、補正部１２は、各クラスの補正パラメータを１に初期化し、カウンタ変数ＣＴを０に初期化する（ステップＳ１０３）。 Then, the correction unit 12 initializes the correction parameter of each class to 1, and initializes the counter variable CT to 0 (step S103).

次に、補正部１２は、ステップＳ１０２で取得した各予測値を、各クラスの補正パラメータで補正して、補正予測値を取得する（ステップＳ１０４）。具体的には、各クラスの予測値を該クラスの補正パラメータでべき乗した値に変換し、それを補正予測値とする。そして、補正予測値から混同行列を作成し（ステップＳ１０５）、混同行列の各行を合計が１になるように正規化する（ステップＳ１０６）。 Next, the correction unit 12 corrects each predicted value obtained in step S102 with the correction parameter of each class to obtain a corrected predicted value (step S104). Specifically, the predicted value of each class is converted to a value raised to the power of the correction parameter of the class, and the converted value is used as the corrected predicted value. Then, a confusion matrix is created from the corrected prediction values (step S105), and each row of the confusion matrix is normalized so that the sum becomes 1 (step S106).

次に、補正部１２は、正規化して混同行列の各列の合計値を各クラス予測率合計として取得する（ステップＳ１０７）。そして、各クラス予測率合計と１との差分絶対値が最大となるクラスを検索し、検索されたクラスを変数ＭＣに、差分絶対値の最大値（当該クラスのクラス予測率合計と１との差分絶対値）を変数ＭＤにセットし、カウンタ変数ＣＴをインクリメントする（ステップＳ１０８）。 Next, the correction unit 12 normalizes and acquires the total value of each column of the confusion matrix as the total prediction rate of each class (step S107). Then, the class with the maximum absolute difference between the total class prediction rate and 1 is searched for, and the searched class is set as a variable MC, and the maximum absolute difference value (the total class prediction rate of the class and 1 difference absolute value) is set in the variable MD, and the counter variable CT is incremented (step S108).

そして、補正部１２は、変数ＭＤ（差分絶対値の最大値）が基準値ＳＤ（例えば０．０２）以下か否かを判定する（ステップＳ１０９）。変数ＭＤが基準値ＳＤ以下なら（ステップＳ１０９；Ｙｅｓ）、現在の補正パラメータで混同行列の対称性は十分上がっていると考えられるので、補正部１２は現在の補正パラメータを後述の識別処理で用いる補正パラメータとして確定させ（ステップＳ１１４）、補正パラメータ取得処理を終了する。ステップＳ１１４は、補正パラメータ取得ステップとも呼ばれる。 Then, the correction unit 12 determines whether or not the variable MD (the maximum value of the absolute difference values) is equal to or less than the reference value SD (for example, 0.02) (step S109). If the variable MD is equal to or less than the reference value SD (step S109; Yes), it is considered that the symmetry of the confusion matrix is sufficiently improved with the current correction parameters, so the correction unit 12 uses the current correction parameters in the identification process described later. It is determined as a correction parameter (step S114), and the correction parameter acquisition process ends. Step S114 is also called a correction parameter acquisition step.

変数ＭＤが基準値ＳＤより大きければ（ステップＳ１０９；Ｎｏ）、補正部１２は、カウンタ変数ＣＴが最大検索回数ＬＩＭＩＴ（例えば１００）以上か否かを判定する（ステップＳ１１０）。カウンタ変数ＣＴが最大検索回数ＬＩＭＩＴ以上なら（ステップＳ１１０；Ｙｅｓ）、補正部１２は現在の補正パラメータを後述の識別処理で用いる補正パラメータとして確定させ（ステップＳ１１４）、補正パラメータ取得処理を終了する。この補正パラメータ取得処理では、変数ＭＤの値が基準値ＳＤ以下にならずに振動してしまう場合等があるため、ある程度の回数（ＬＩＭＩＴの回数）検索を繰り返しても変数ＭＤが基準値ＳＤ以下にならなければ、補正パラメータ取得処理を終了させるためにステップＳ１１０での判定を行っている。 If the variable MD is greater than the reference value SD (step S109; No), the correction unit 12 determines whether or not the counter variable CT is equal to or greater than the maximum search count LIMIT (eg, 100) (step S110). If the counter variable CT is equal to or greater than the maximum number of searches LIMIT (step S110; Yes), the correction unit 12 determines the current correction parameters as correction parameters used in the later-described identification process (step S114), and ends the correction parameter acquisition process. In this correction parameter acquisition process, there are cases where the value of the variable MD does not become equal to or less than the reference value SD and the variable MD vibrates. If not, the determination in step S110 is made to end the correction parameter acquisition process.

カウンタ変数ＣＴが最大検索回数ＬＩＭＩＴ未満なら（ステップＳ１１０；Ｎｏ）、補正部１２は、変数ＭＣのクラスのクラス予測率合計が１より大きいか否かを判定する（ステップＳ１１１）。変数ＭＣのクラスのクラス予測率合計が１より大きければ（ステップＳ１１１；Ｙｅｓ）、変数ＭＣのクラスの補正パラメータを増やし（ステップＳ１１２）、ステップＳ１０４に戻る。補正パラメータを増やす量は、例えば０．０５のような定数としてもよいし、変数ＭＤ（差分絶対値の最大値）の値に応じて変化させてもよい（ＭＤの値が大きければ補正パラメータを増やす量も大きくし、ＭＤの値が小さければ補正パラメータを増やす量も小さくする）。ただし、この値が大きいと、変数ＭＤの値が振動したり発散したりし易くなるため、計算時間がかかってもよければ、基本的に増やす量はできるだけ小さい値にする方がよい。 If the counter variable CT is less than the maximum search count LIMIT (step S110; No), the correction unit 12 determines whether or not the total class prediction rate of the class of the variable MC is greater than 1 (step S111). If the total class prediction rate of the class of variable MC is greater than 1 (step S111; Yes), the correction parameter of the class of variable MC is increased (step S112), and the process returns to step S104. The amount by which the correction parameter is increased may be a constant such as 0.05, or may be changed according to the value of the variable MD (maximum value of absolute difference) (if the value of MD is large, the correction parameter is The amount of increase is also increased, and if the value of MD is small, the amount of increase of the correction parameter is also decreased). However, if this value is large, the value of the variable MD tends to oscillate or diverge, so basically it is better to set the amount of increase as small as possible if the computation time is acceptable.

変数ＭＣのクラスのクラス予測率合計が１以下なら（ステップＳ１１１；Ｎｏ）、変数ＭＣのクラスの補正パラメータを減らし（ステップＳ１１３）、ステップＳ１０４に戻る。ステップＳ１１２と同様、補正パラメータを減らす量は、例えば０．０５のような定数としてもよいし、変数ＭＤ（差分絶対値の最大値）の値に応じて変化させてもよい（ＭＤの値が大きければ補正パラメータを減らす量も大きくし、ＭＤの値が小さければ補正パラメータを減らす量も小さくする）。ただし、この値が大きいと、変数ＭＤの値が振動したり発散したりし易くなるため、計算時間がかかってもよければ、基本的に減らす量はできるだけ小さい値にする方がよい。 If the total class prediction rate of the class of the variable MC is 1 or less (step S111; No), the correction parameter of the class of the variable MC is decreased (step S113), and the process returns to step S104. As in step S112, the amount by which the correction parameter is reduced may be a constant value such as 0.05, or may be changed according to the value of the variable MD (maximum absolute value of difference) (when the value of MD is If the value of MD is large, the amount by which the correction parameter is decreased is also increased, and if the value of MD is small, the amount by which the correction parameter is decreased is also decreased). However, if this value is large, the value of the variable MD tends to oscillate or diverge, so basically it is better to set the amount to be reduced as small as possible if the calculation time is acceptable.

以上の補正パラメータ取得処理を説明した。次に、上記ステップＳ１０１で行われる識別器学習処理について、図３を参照して説明する。 The above correction parameter acquisition processing has been described. Next, the discriminator learning process performed in step S101 will be described with reference to FIG.

まず、制御部１０は、データ入力部３１を介して正解ラベルが付いている学習データを取得する（ステップＳ２０１）。そして、制御部１０は、学習データを識別器１１に入力し、学習データに付いている正解ラベルに基づいて、識別器１１を学習させる（ステップＳ２０２）。 First, the control unit 10 acquires learning data with a correct label through the data input unit 31 (step S201). Then, the control unit 10 inputs the learning data to the discriminator 11, and causes the discriminator 11 to learn based on the correct label attached to the learning data (step S202).

そして、制御部１０は、学習を終了するか否かを判定する（ステップＳ２０３）。例えば、学習データを予め決められた個数（例えばＭ個）学習させたら学習を終了する。学習させていない学習データが残っている場合等、学習を終了しないなら（ステップＳ２０３；Ｎｏ）、ステップＳ２０１に戻って、次の学習データを取得して識別器１１を学習させる。学習を終了するなら（ステップＳ２０３；Ｙｅｓ）、識別器学習処理を終了する。 Then, the control unit 10 determines whether or not to end learning (step S203). For example, the learning is terminated when a predetermined number (for example, M) of learning data are learned. If learning is not finished (step S203; No), such as when learning data that has not been learned remains (step S203; No), the process returns to step S201 to acquire the next learning data and make the discriminator 11 learn. If learning is to be ended (step S203; Yes), the classifier learning process is ended.

以上、説明した補正パラメータ取得処理（図２）が終了した時点での補正パラメータを、補正部１２は取得し、識別装置１００は、この補正パラメータで識別器１１の出力（予測値）を補正して、入力データの識別を行う。では、識別装置１００の識別処理について、図４を参照して説明する。 The correction unit 12 acquires the correction parameters at the time when the correction parameter acquisition process (FIG. 2) described above is completed, and the identification device 100 corrects the output (predicted value) of the discriminator 11 with the correction parameters. to identify the input data. Now, the identification processing of the identification device 100 will be described with reference to FIG.

まず、制御部１０は、データ入力部３１を介して未知データ（識別するデータ）を取得する（ステップＳ３０１）。そして、制御部１０は、当該未知データを識別器１１に入力し、識別器１１が出力した各クラスの予測値を取得する（ステップＳ３０２）。 First, the control unit 10 acquires unknown data (identifying data) through the data input unit 31 (step S301). Then, the control unit 10 inputs the unknown data to the discriminator 11 and acquires the predicted value of each class output by the discriminator 11 (step S302).

次に、結果取得部１３は、各クラスの予測値を、補正部１２が補正パラメータ取得処理（図２）で取得した補正パラメータで補正する（ステップＳ３０３）。具体的には、各クラスの予測値を該クラスの補正パラメータでべき乗した値に変換し、それを補正予測値とする。そして、結果取得部１３は、補正予測値が最大となるクラスを識別結果として取得し、出力部３２を介して出力する（ステップＳ３０４）。そして識別処理を終了する。ステップＳ３０４は、結果取得ステップとも呼ばれる。 Next, the result obtaining unit 13 corrects the predicted value of each class with the correction parameter obtained by the correction parameter obtaining process (FIG. 2) by the correcting unit 12 (step S303). Specifically, the predicted value of each class is converted to a value raised to the power of the correction parameter of the class, and the converted value is used as the corrected predicted value. Then, the result acquisition unit 13 acquires the class with the largest corrected predicted value as the identification result, and outputs it via the output unit 32 (step S304). Then, the identification processing ends. Step S304 is also called a result acquisition step.

以上の識別処理により、未知データが識別される。そして、この識別は、識別器１１の出力した予測値を補正した補正予測値に基づいて行われているため、学習データのクラス毎の数が不均衡だったとしても、その影響を極力排除して、高精度な識別を行うことができる。 Unknown data is identified by the above identification processing. Since this discrimination is performed based on the corrected predicted value obtained by correcting the predicted value output from the classifier 11, even if the number of learning data for each class is unbalanced, the influence thereof is eliminated as much as possible. Therefore, highly accurate identification can be performed.

次に、識別装置１００による補正パラメータ取得処理（図２）について、具体例で説明する。ここでは、識別器１１は、「Ａ」、「Ｂ」、「Ｃ」の３つのクラスを識別する３クラス識別器であり、図２のステップＳ１０１で学習済みであるとする。また、図５の左に示すような正解ラベルが付けられた２４個の試験データが用意されているものとする。そして、２４個の試験データを図２のステップＳ１０２で識別器１１に入力して得られた出力値（予測値）が、正解ラベルの右に記載されている。図２のステップＳ１０３で、各クラスの補正パラメータは３つのクラスとも１．０に初期化されるので、これを３次元ベクトル（１．０，１．０，１．０）で表すことにする。また、カウンタ変数ＣＴ＝０となる。 Next, the correction parameter acquisition processing (FIG. 2) by the identification device 100 will be described with a specific example. Here, the discriminator 11 is a three-class discriminator that discriminates three classes of "A", "B", and "C", and is presumed to have been learned in step S101 of FIG. It is also assumed that 24 pieces of test data labeled with correct answers as shown on the left side of FIG. 5 are prepared. An output value (predicted value) obtained by inputting 24 pieces of test data to the discriminator 11 in step S102 of FIG. 2 is written to the right of the correct label. At step S103 in FIG. 2, the correction parameters of each class are initialized to 1.0 for all three classes, which is represented by a three-dimensional vector (1.0, 1.0, 1.0). . Also, the counter variable CT=0.

例えば一番上の試験データの場合、正解ラベルは「Ｃ」であり、識別器１１で得られた予測値は（０．７，０．２，０．６８）である。この場合、予測値のベクトルの要素の最大値が０．７であり、０．７は最初の要素（Ａに対応）なので、識別器１１による識別結果は「Ａ」となる。また、補正パラメータの初期値は上述したように、（１．０，１．０，１．０）であり、図２のステップＳ１０４で、予測値をこの補正パラメータで補正すると、（０．７＾１．０，０．２＾１．０，０．６８＾１．０）で、元の予測値と同じ（０．７，０．２，０．６８）が補正予測値として得られ、この補正予測値による識別結果も、予測値による識別結果と同じ「Ａ」となる。 For example, in the case of the test data at the top, the correct label is "C" and the predicted values obtained by the discriminator 11 are (0.7, 0.2, 0.68). In this case, the maximum value of the elements of the predicted value vector is 0.7, and 0.7 is the first element (corresponding to A), so the identification result of the identifier 11 is "A". Also, as described above, the initial values of the correction parameters are (1.0, 1.0, 1.0), and in step S104 of FIG. ^ 1.0, 0.2 ^ 1.0, 0.68 ^ 1.0), and the same (0.7, 0.2, 0.68) as the original prediction values are obtained as corrected prediction values, The identification result based on this corrected predicted value is also "A", which is the same as the identification result based on the predicted value.

図２のステップＳ１０５で、２４個の試験データ全てについて、正解ラベルと補正予測値による識別結果との関係を表にまとめたものが、図５の右に記載されている混同行列である。この混同行列を見ると、正解ラベルが「Ａ」の試験データのうち、識別結果が「Ａ」となったものが１０個、識別結果が「Ｂ」となったものが２個、識別結果が「Ｃ」となったものが０個あることがわかる。同様に、正解ラベルが「Ｂ」の試験データのうち、識別結果が「Ａ」となったものが３個、識別結果が「Ｂ」となったものが５個、識別結果が「Ｃ」となったものが１個あることがわかる。さらに、正解ラベルが「Ｃ」の試験データのうち、識別結果が「Ａ」となったものが２個、識別結果が「Ｂ」となったものが０個、識別結果が「Ｃ」となったものが１個あることがわかる。 The confusion matrix shown on the right side of FIG. 5 is a table summarizing the relationship between the correct label and the identification result based on the corrected prediction value for all 24 pieces of test data in step S105 of FIG. Looking at this confusion matrix, among the test data with the correct answer label of "A", 10 have the classification result of "A", 2 have the classification result of "B", and the classification result of It can be seen that there are 0 "C". Similarly, among the test data with the correct answer label of "B", three have the identification result of "A", five have the identification result of "B", and have the identification result of "C". It turns out that there is one thing that has become. Furthermore, among the test data with the correct answer label of "C", there are 2 items with the identification result of "A", 0 items with the identification result of "B", and the identification result of "C". It turns out that there is one thing.

次に、この混同行列を正規化して、補正パラメータを取得する処理について、図６を参照して説明する。まず、図２のステップＳ１０６で、混同行列を正規化するために、まず、図６の上にあるように各正解ラベルの付いた試験データの個数の合計を求める。そして、図６の上から２番目の表にあるように、混合行列の各行について、要素を合計の値で割ることにより、合計を１に正規化する。 Next, the process of normalizing this confusion matrix and acquiring correction parameters will be described with reference to FIG. First, in step S106 of FIG. 2, in order to normalize the confusion matrix, first, the total number of pieces of test data with each correct label is obtained as shown in the top of FIG. Then, for each row of the mixing matrix, normalize the sum to 1 by dividing the element by the value of the sum, as in the second table from the top of FIG.

そして、図２のステップＳ１０７で、この正規化された混同行列の各列について縦方向に要素の合計値（各クラスの予測率合計）を計算すると、図６に示すように、クラス「Ａ」の予測率合計は１．８３３、クラス「Ｂ」の予測率合計は０．７２２、クラス「Ｃ」の予測率合計は０．４４４となり、これらの値にはかなりばらつきがあることがわかる。この例では、識別結果が「Ａ」となる割合がかなり高く、混同行列の対称性が低くなっている。また平均正解率も０．５７４とかなり低い。なお、ここで、平均正解率とは、正解がクラス「Ｘ」である試験データのうち、識別結果（予測クラス）が「Ｘ」になったものの割合を全てのクラスについて平均した値である。図６においては、上から２番目の表の対角線上に存在している値を平均して、（０．８３３＋０．５５６＋０．３３３）÷３＝０．５７４が得られ、この表の右下にこの平均正解率の数値が記載されている。 Then, in step S107 of FIG. 2, when the total value of the elements in the vertical direction for each column of this normalized confusion matrix (total prediction rate of each class) is calculated, as shown in FIG. is 1.833, the total prediction rate for class "B" is 0.722, and the total prediction rate for class "C" is 0.444. In this example, the proportion of the identification result being "A" is quite high, and the symmetry of the confusion matrix is low. Also, the average accuracy rate is as low as 0.574. Here, the average correct answer rate is a value obtained by averaging the proportion of test data whose correct answer is class "X" and whose identification result (predicted class) is "X" for all classes. In FIG. 6, the values existing on the diagonal line of the second table from the top are averaged to obtain (0.833+0.556+0.333)/3=0.574. The numerical value of this average accuracy rate is described.

混同行列の対称性を高くするには、各列の縦方向の合計値が大きいクラスの予測値を小さい値に補正するか、又は、この合計値が小さいクラスの予測値を大きい値に補正すればよいと考えられる。上述したように識別器１１が出力する予測値は０以上１以下の値であり、この値は１より大きな数でべき乗すると（０以上１以下の範囲内で）元の値より小さな値になり、１より小さな値でべき乗すると（０以上１以下の範囲内で）元の値より大きな値になる。したがって、この性質を利用すると、混同行列の対称性を高くするには、各列の縦方向の合計値が大きいクラスの予測値を１より大きな数でべき乗するか、又は、この合計値が小さいクラスの予測値を１より小さな数でべき乗すればよいことになる。 In order to increase the symmetry of the confusion matrix, the predicted value of a class with a large vertical total value in each column should be corrected to a small value, or the predicted value of a class with a small total value should be corrected to a large value. It is considered to be good. As described above, the predicted value output by the discriminator 11 is a value between 0 and 1, and when this value is raised to a power greater than 1 (within the range between 0 and 1), it becomes a smaller value than the original value. , raised to a power less than 1 (between 0 and 1) yields a greater value than the original. Therefore, using this property, in order to increase the symmetry of the confusion matrix, the predicted value of the class with a large vertical total value in each column should be raised to a power greater than 1, or the total value is small It suffices to raise the predicted value of the class to a power smaller than one.

図６の例では、クラス「Ａ」の予測率合計は１．８３３で最も大きく、クラス「Ｃ」の予測率合計は０．４４４で最も小さい。そこで、クラス「Ａ」の予測値を小さくするために１より大きな数でべき乗し、クラス「Ｃ」の予測値を大きくするために１より小さな数でべき乗することが考えられる。ただし、同時に複数のパラメータを変化させると、各クラスの予測値が急激に変化して、識別結果が予想外に変化してしまう可能性がある。そこで、本実施形態１では、一度に１つのパラメータだけを変化させることとしている。この例では、図２のステップＳ１０８で、基準値１との差分絶対値が最大となるクラスとして、クラス「Ａ」が検索されて、ＭＣ＝「Ａ」となり、差分絶対値の最大値ＭＤ＝｜１．８３３－１｜＝０．８３３となる。そして、カウンタ変数ＣＴはインクリメントされて１となる。 In the example of FIG. 6, class "A" has the highest total prediction rate at 1.833 and class "C" has the lowest total prediction rate at 0.444. Therefore, it is conceivable to raise the predicted value of class "A" to a power greater than 1, and to raise the predicted value of class "C" to a power of less than 1. However, if multiple parameters are changed at the same time, there is a possibility that the prediction value of each class will change abruptly and the classification result will change unexpectedly. Therefore, in the first embodiment, only one parameter is changed at a time. In this example, in step S108 of FIG. 2, the class "A" is searched as the class having the maximum absolute difference value from the reference value 1, MC="A", and the maximum absolute difference value MD= |1.833−1|=0.833. Then, the counter variable CT is incremented to one.

次に図２のステップＳ１０９では、変数ＭＤ（ここでは０．８３３）が基準値ＳＤ（例えば０．０２）以下か否かが判定されて判定はＮｏとなり、ステップＳ１１０に進む。ステップＳ１１０でカウンタ変数ＣＴ（ここでは１）が最大検索回数ＬＩＭＩＴ（例えば１００）以上か否かが判定されて判定はＮｏとなり、ステップＳ１１１に進む。ステップＳ１１１では、クラス「Ａ」のクラス予測率合計である１．８３３が１より大きいか否かが判定されて判定はＹｅｓとなり、ステップＳ１１３でクラス「Ａ」の補正パラメータが増やされる。ここでは増やす量は０．０５であるとする。すると、クラス「Ａ」の補正パラメータは１．０５となり、クラス「Ｂ」及びクラス「Ｃ」の補正パラメータは１のままなので、補正パラメータをベクトルで表すと（１．０５，１．０，１．０）となる。 Next, in step S109 of FIG. 2, it is determined whether or not the variable MD (here, 0.833) is equal to or less than the reference value SD (eg, 0.02), the determination is No, and the process proceeds to step S110. In step S110, it is determined whether or not the counter variable CT (here, 1) is equal to or greater than the maximum search count LIMIT (eg, 100), the determination is No, and the process proceeds to step S111. In step S111, it is determined whether or not 1.833, which is the total class prediction rate of class "A", is greater than 1, the determination is Yes, and in step S113 the correction parameter of class "A" is increased. Here, the amount to be increased is assumed to be 0.05. Then, the correction parameter of class "A" becomes 1.05, and the correction parameters of class "B" and class "C" remain 1. Therefore, if the correction parameters are represented by a vector, (1.05, 1.0, 1 .0).

図示しないが、ステップＳ１１３からステップＳ１０４に戻って、補正パラメータ（１．０５，１．０，１．０）で図５にある識別器の予測値を補正しても、識別結果に変化は生じないので、また同じ混同行列が得られ、ステップＳ１１３まで同じように進み、ステップＳ１１３でクラスＡの補正パラメータは１．１となる。そして、またステップＳ１０４に戻って、補正パラメータ（１．１，１．０，１．０）で図５にある識別器の予測値を補正して補正予測値を取得すると、図７に示すように一部の識別結果が変化する。そして、ステップＳ１０５で混同行列を作成すると、図７の右にあるような混同行列が作成される。 Although not shown, even if the process returns from step S113 to step S104 and corrects the predicted values of the discriminator shown in FIG. Therefore, the same confusion matrix is obtained again, and the same process proceeds to step S113, where the correction parameter for class A becomes 1.1. Then, returning to step S104, the predicted values of the discriminator shown in FIG. part of the identification results change. Then, when the confusion matrix is created in step S105, the confusion matrix shown on the right side of FIG. 7 is created.

そして、ステップＳ１０６で正規化すると、図８の上から２番目の表にあるような正規化した混同行列が得られる。そして、ステップＳ１０７で各クラスの予測率合計を計算すると、クラス「Ａ」の予測率合計は１．１６７、クラス「Ｂ」の予測率合計は１．０５６、クラス「Ｃ」の予測率合計は０．７７８となり、前回の混同行列よりも対称性が高くなったことがわかる。平均正解率も０．７９６と、前回よりも上昇している。 Then, when normalized in step S106, a normalized confusion matrix as shown in the second table from the top in FIG. 8 is obtained. Then, when the total prediction rate of each class is calculated in step S107, the total prediction rate of class "A" is 1.167, the total prediction rate of class "B" is 1.056, and the total prediction rate of class "C" is It is 0.778, and it can be seen that the symmetry is higher than the previous confusion matrix. The average accuracy rate is 0.796, which is higher than the previous time.

そして、今回の場合は、ステップＳ１０８で、基準値１との差分絶対値が最大となるクラスとして、クラス「Ｃ」が検索されて、ＭＣ＝「Ｃ」となり、差分絶対値の最大値ＭＤ＝｜０．７７８－１｜＝０．２２２となる。そして、ステップＳ１１１で、クラス「Ｃ」のクラス予測率合計である０．７７８が１より大きいか否かが判定されて判定はＮｏとなり、ステップＳ１１２でクラス「Ｃ」の補正パラメータが減らされる。ここでは減らす量は０．０５であるとする。すると、クラス「Ｃ」の補正パラメータは０．９５となり、補正パラメータをベクトルで表すと（１．１，１．０，０．９５）となる。 In this case, in step S108, the class "C" is searched as the class having the maximum absolute difference from the reference value 1, MC="C", and the maximum absolute difference MD= |0.778−1|=0.222. Then, in step S111, it is determined whether or not 0.778, which is the total class prediction rate of class "C", is greater than 1, the determination is No, and in step S112, the correction parameter of class "C" is decreased. Assume here that the amount to be reduced is 0.05. Then, the correction parameter for class "C" is 0.95, and the correction parameter is expressed as a vector (1.1, 1.0, 0.95).

そして、またステップＳ１０４に戻って、補正パラメータ（１．１，１．０，０．９５）で図５にある識別器の予測値を補正して補正予測値を取得すると、図９に示すように一部の識別結果が変化する。そして、ステップＳ１０５で混同行列を作成すると、図９の右にあるような混同行列が作成される。 Then, returning to step S104, the predicted values of the discriminator shown in FIG. part of the identification results change. Then, when the confusion matrix is created in step S105, the confusion matrix shown on the right side of FIG. 9 is created.

そして、ステップＳ１０６で正規化すると、図１０の上から２番目の表にあるような正規化した混同行列が得られる。そして、ステップＳ１０７で各クラスの予測率合計を計算すると、クラス「Ａ」の予測率合計は０．８３３、クラス「Ｂ」の予測率合計は１．０５６、クラス「Ｃ」の予測率合計は１．１１１となり、前回の混同行列よりも対称性が高くなったことがわかる。平均正解率も０．９０７と、前回よりもさらに上昇している。 Then, when normalized in step S106, a normalized confusion matrix as shown in the second table from the top in FIG. 10 is obtained. Then, when the total prediction rate of each class is calculated in step S107, the total prediction rate of class "A" is 0.833, the total prediction rate of class "B" is 1.056, and the total prediction rate of class "C" is 1.111, and it can be seen that the symmetry is higher than the previous confusion matrix. The average accuracy rate is 0.907, which is higher than the previous time.

そして、今回の場合は、ステップＳ１０８で、基準値１との差分絶対値が最大となるクラスとして、クラス「Ａ」が検索されて、ＭＣ＝「Ａ」となり、差分絶対値の最大値ＭＤ＝｜０．８３３－１｜＝０．１６６となる。そして、ステップＳ１１１で、クラス「Ａ」のクラス予測率合計である０．８３３が１より大きいか否かが判定されて判定はＮｏとなり、ステップＳ１１２でクラス「Ａ」の補正パラメータが減らされる。ここでは減らす量は０．０５であるとする。すると、クラス「Ａ」の補正パラメータは１．０５となり、補正パラメータをベクトルで表すと（１．０５，１．０，０．９５）となる。このようにして、クラス予測率合計と１との差分絶対値の最大値ＭＤが基準値ＳＤ以下になるか、カウンタ変数ＣＴが最大検索回数ＬＩＭＩＴ以上になるまで処理が繰り返され、最終的に補正パラメータが求まることになる。 In this case, in step S108, the class "A" is searched as the class having the maximum absolute difference value from the reference value 1, MC="A", and the maximum absolute difference value MD= |0.833−1|=0.166. Then, in step S111, it is determined whether or not 0.833, which is the total class prediction rate of class "A", is greater than 1, the determination is No, and in step S112, the correction parameter of class "A" is decreased. Assume here that the amount to be reduced is 0.05. Then, the correction parameter for class "A" is 1.05, and the correction parameter is expressed as a vector (1.05, 1.0, 0.95). In this way, the process is repeated until the maximum value MD of the absolute difference between the total class prediction rate and 1 becomes equal to or less than the reference value SD, or until the counter variable CT becomes equal to or greater than the maximum number of searches LIMIT. parameters will be obtained.

以上説明した識別処理により、識別装置１００は、混同行列の対称性を高めるように識別器１１の予測値を補正することにより、サンプルデータがクラス間で偏っていても、偏りの影響を軽減して、精度の高い識別を行うことができる。 With the above-described identification processing, the identification device 100 corrects the prediction values of the classifier 11 so as to increase the symmetry of the confusion matrix, thereby reducing the influence of bias even if sample data is biased between classes. Therefore, highly accurate identification can be performed.

（変形例１）
実施形態１では、補正部１２は、識別器１１の予測値を補正する際に、予測値を補正パラメータでべき乗する演算を行ったが、補正部１２が予測値を補正する際に用いる演算（補正パラメータによって定まる関数）はこれに限られない。予測値は０以上１以下の値であり、予測値を補正パラメータで補正した後の値（補正予測値）も０以上１以下の値である必要があるという前提があるが、補正パラメータを固定した場合に元の予測値の大小関係が、補正した後も（補正予測値でも）保たれることと、補正パラメータの値の大小に応じて予測値が補正された時の変化方向及び変化度合いが定まることとが満たされるなら、補正部１２は、他の関数を用いて識別器１１の予測値を補正してもよい。 (Modification 1)
In the first embodiment, when correcting the predicted value of the discriminator 11, the correcting unit 12 performs the calculation of exponentiating the predicted value by the correction parameter. function determined by correction parameters) is not limited to this. There is a premise that the predicted value is a value between 0 and 1, and the value after correcting the predicted value with the correction parameter (corrected predicted value) must also be a value between 0 and 1, but the correction parameter is fixed. The magnitude relationship of the original predicted value is maintained even after correction (even the corrected predicted value), and the direction and degree of change when the predicted value is corrected according to the magnitude of the correction parameter value is determined, the correction unit 12 may correct the predicted value of the discriminator 11 using another function.

つまり、補正部１２は、入力値（予測値）として０以上１以下の値を０以上１以下の値の出力値（補正予測値）に変換する単調非減少関数（広義の単調増加関数）であって、当該単調非減少関数（広義の単調増加関数）は補正パラメータによって定まる関数であって、入力値（予測値）を固定した場合に補正パラメータによる出力値（補正予測値）の変化も単調非減少又は単調非増加となるような関数であれば、任意の関数を用いて予測値を補正することが可能である。例えば、入力値を補正パラメータによってべき乗する関数は、これらの条件を満たす関数である。 That is, the correction unit 12 is a monotonically non-decreasing function (monotonically increasing function in a broad sense) that converts a value between 0 and 1 as an input value (predicted value) into an output value (corrected predicted value) between 0 and 1. , the monotonically non-decreasing function (broadly defined monotonically increasing function) is a function determined by the correction parameter, and when the input value (predicted value) is fixed, the change in the output value (corrected predicted value) due to the correction parameter is also monotonous Any function that is non-decreasing or monotonically non-increasing can be used to correct the predicted value. For example, a function that exponentiates an input value by a correction parameter is a function that satisfies these conditions.

（変形例２）
実施形態１では、補正パラメータ取得処理（図２）において、差分絶対値の最大値ＭＤが基準値ＳＤ以下にならなかったら、カウンタ変数ＣＴが最大検索回数ＬＩＭＩＴ内以上になった時の補正パラメータを最終的な補正パラメータとしている。しかし、これに限られない。例えば、ステップＳ１０７とステップＳ１０８の間で、差分絶対値の最大値ＭＤの値がそれまでで最も小さくなった時の補正パラメータを保存しておき、ステップＳ１０９でカウンタ変数ＣＴが最大検索回数ＬＩＭＩＴ以上になった場合には、ステップＳ１１４でその保存した補正パラメータを最終的な補正パラメータとして確定させるようにしてもよい。このようにすることで、差分絶対値の最大値ＭＤの値が振動したり発散したりしてしまうような場合でも、補正部１２は、混同行列の対称性が最も高くなった時の補正パラメータを取得することができる。 (Modification 2)
In the first embodiment, in the correction parameter acquisition process (FIG. 2), if the maximum value MD of the absolute difference value does not become equal to or less than the reference value SD, the correction parameter when the counter variable CT becomes equal to or greater than the maximum number of searches LIMIT is set to It is used as the final correction parameter. However, it is not limited to this. For example, between steps S107 and S108, the correction parameter for when the maximum value MD of the difference absolute value is the smallest is stored, and in step S109, the counter variable CT is set to the maximum number of searches LIMIT or more. If so, the stored correction parameters may be determined as the final correction parameters in step S114. By doing so, even if the value of the maximum value MD of the difference absolute value oscillates or diverges, the correction unit 12 uses the correction parameter when the symmetry of the confusion matrix becomes highest can be obtained.

（変形例３）
実施形態１では、補正パラメータ取得処理（図２）において、各クラスの識別正解率の高低によらずに、混同行列の対称性のみを高くする処理を行っているが、識別正解率の高さを考慮してもよい。例えば、ステップＳ１０５で混同行列を正規化した時に、平均正解率（上述したように、正解がクラス「Ｘ」である試験データのうち、識別結果（予測クラス）が「Ｘ」になったものの割合を全てのクラスについて平均した値であり、図６、図８及び図１０の２番目の混同行列の表の右下の数値）も求めておき、この値がそれまでで最大となる時の補正パラメータを保存する。そして、ステップＳ１０９でカウンタ変数ＣＴが最大検索回数ＬＩＭＩＴ以上になった場合には、ステップＳ１１４でその保存した補正パラメータを最終的な補正パラメータとして確定させるようにしてもよい。このようにすることで、差分絶対値の最大値ＭＤの値が振動したり発散したりしてしまうような場合でも、補正部１２は、平均正解率が最も高くなった時の補正パラメータを取得することができる。 (Modification 3)
In the first embodiment, in the correction parameter acquisition process (FIG. 2), processing is performed to increase only the symmetry of the confusion matrix regardless of the level of the correct identification rate of each class. may be considered. For example, when the confusion matrix is normalized in step S105, the average correct answer rate (as described above, the percentage of test data whose correct answer is class "X", the identification result (predicted class) is "X" is the average value for all classes, and the lower right numerical value of the second confusion matrix table in FIGS. Save parameters. Then, when the counter variable CT reaches or exceeds the maximum search count LIMIT in step S109, the stored correction parameter may be determined as the final correction parameter in step S114. By doing so, even if the value of the maximum value MD of the difference absolute value oscillates or diverges, the correction unit 12 acquires the correction parameter when the average accuracy rate is the highest. can do.

（変形例４）
識別器１１に与えられる入力データ（学習データ及び試験データ）によっては、補正パラメータ取得処理（図２）において、混同行列の対称性の高さと平均正解率の高さとがトレードオフの関係になることがある。そのため、補正パラメータ取得処理（図２）において、補正パラメータを１つに決定してしまうのではなく、ステップＳ１１４において、補正パラメータの候補をいくつかユーザに提示して、ユーザに補正パラメータを選択してもらってもよい。 (Modification 4)
Depending on the input data (learning data and test data) given to the discriminator 11, in the correction parameter acquisition process (FIG. 2), there is a trade-off relationship between the high symmetry of the confusion matrix and the high average accuracy rate. There is Therefore, in the correction parameter acquisition process (FIG. 2), instead of deciding on one correction parameter, several correction parameter candidates are presented to the user in step S114, and the user selects the correction parameter. You can have it.

そのような処理を行うために、制御部１０は、混同行列の対称性が高い（差分絶対値の最大値ＭＤの値が差分基準値（例えば０．１）以下である）時の補正パラメータや、平均正解率が高い（平均正解率が正解率基準値（例えば９０％）以上である）時の補正パラメータを、補正パラメータの候補として記憶部２０に保存しておき、ステップＳ１１４では、これら保存した補正パラメータを用いて作成した混同行列及び平均正解率を、出力部３２を介してユーザに提示する。そして、制御部１０は、ユーザから操作入力部３４を介して補正パラメータを選択してもらい、ユーザが選択した補正パラメータを、補正部１２が取得するようにする。 In order to perform such processing, the control unit 10 has a correction parameter when the confusion matrix has high symmetry (the maximum value MD of the difference absolute value is equal to or less than the difference reference value (for example, 0.1)), , the correction parameters when the average accuracy rate is high (the average accuracy rate is equal to or higher than the accuracy rate reference value (for example, 90%)) are stored in the storage unit 20 as correction parameter candidates. The confusion matrix and average accuracy rate created using the corrected parameters are presented to the user via the output unit 32 . Then, the control unit 10 causes the user to select a correction parameter via the operation input unit 34, and causes the correction unit 12 to acquire the correction parameter selected by the user.

混同行列の対称性の高さと平均正解率の高さとがトレードオフの関係になる場合であっても、変形例４に係る識別装置１００では、ユーザに補正パラメータの候補の中から採用する補正パラメータを選択してもらうことにより、妥当な補正パラメータを用いて識別を行うことができる。 Even if there is a trade-off relationship between the high symmetry of the confusion matrix and the high average accuracy rate, the identification device 100 according to Modification 4 allows the user to select the correction parameter to be adopted from among the correction parameter candidates. is selected, identification can be performed using appropriate correction parameters.

なお、上述の実施形態及び変形例では、学習データと試験データは別のデータであるものとして説明したが、学習データの全部又は一部を試験データとして用いて上述の識別器学習処理（図３）及び補正パラメータ取得処理（図２）を行ってもよい。 In the above-described embodiment and modified example, the learning data and the test data are described as different data. ) and correction parameter acquisition processing (FIG. 2) may be performed.

なお、上述の実施形態及び変形例では、識別器１１を実現するプログラムを制御部１０が実行することにより、制御部１０は識別器１１としても機能することとしていたが、これに限られない。識別装置１００は、制御部１０とは別に（例えば、ＧＰＵ（ＧｒａｐｈｉｃｓＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）や、専用のＩＣ（ＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）等の）識別器１１の機能を実現するデバイスを備えてもよい。 In the above-described embodiment and modification, the control unit 10 executes the program that implements the discriminator 11 so that the control unit 10 also functions as the discriminator 11. However, the present invention is not limited to this. The identification device 100 may include a device (for example, a GPU (Graphics Processing Unit), a dedicated IC (Integrated Circuit), etc.) that realizes the function of the identification device 11 , separately from the control unit 10 .

また、上述の実施形態及び変形例は適宜組み合わせることができる。例えば、変形例２と変形例３とを組み合わせ、混同行列の対称性を平均正解率よりも重要視する処理とする場合には、ステップＳ１０９でカウンタ変数ＣＴが最大検索回数ＬＩＭＩＴ以上になった時に、ステップＳ１１４で、差分絶対値の最大値ＭＤの値がそれまでで最も小さくなった時の補正パラメータを識別処理で用いる補正パラメータとして確定させるが、そのような補正パラメータが複数ある場合（複数の補正パラメータにおいて、差分絶対値の最大値ＭＤの値が最小になる場合）には、それぞれの補正パラメータにおける平均正解率を比較し、平均正解率が最も高くなる補正パラメータを識別処理で用いる補正パラメータとして確定させる処理にすればよい。 Also, the above-described embodiments and modifications can be combined as appropriate. For example, in the case of combining Modification 2 and Modification 3 to make the symmetry of the confusion matrix more important than the average correct answer rate, when the counter variable CT becomes equal to or greater than the maximum number of searches LIMIT in step S109, , in step S114, the correction parameter at which the value of the maximum value MD of the absolute difference is the smallest is established as the correction parameter to be used in the identification process. When the maximum value MD of the difference absolute value is the minimum in the correction parameters), the average accuracy rate for each correction parameter is compared, and the correction parameter with the highest average accuracy rate is used in the identification process. It is sufficient to perform processing to determine as .

なお、識別装置１００の各機能は、通常のＰＣ（ＰｅｒｓｏｎａｌＣｏｍｐｕｔｅｒ）等のコンピュータによっても実施することができる。具体的には、上記実施形態では、識別装置１００が行う識別処理のプログラムが、記憶部２０のＲＯＭに予め記憶されているものとして説明した。しかし、プログラムを、フレキシブルディスク、ＣＤ－ＲＯＭ（ＣｏｍｐａｃｔＤｉｓｃＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）、ＭＯ（Ｍａｇｎｅｔｏ－ＯｐｔｉｃａｌＤｉｓｃ）、メモリカード、ＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）メモリ等のコンピュータ読み取り可能な記録媒体に格納して配布し、そのプログラムをコンピュータに読み込んでインストールすることにより、上述の各機能を実現することができるコンピュータを構成してもよい。 Each function of the identification device 100 can also be performed by a computer such as a normal PC (Personal Computer). Specifically, in the above embodiment, the identification processing program performed by the identification device 100 is pre-stored in the ROM of the storage unit 20 . However, the program may be stored in a computer-readable storage medium such as a flexible disk, CD-ROM (Compact Disc Read Only Memory), DVD (Digital Versatile Disc), MO (Magneto-Optical Disc), memory card, USB (Universal Serial Bus) memory, etc. By storing and distributing the program in a recording medium, and reading and installing the program in the computer, a computer capable of realizing each of the functions described above may be configured.

以上、本発明の好ましい実施形態について説明したが、本発明は係る特定の実施形態に限定されるものではなく、本発明には、特許請求の範囲に記載された発明とその均等の範囲が含まれる。以下に、本願出願の当初の特許請求の範囲に記載された発明を付記する。 Although the preferred embodiments of the present invention have been described above, the present invention is not limited to such specific embodiments, and the present invention includes the invention described in the claims and their equivalents. be The invention described in the original claims of the present application is appended below.

（付記１）
入力データの属するクラスを識別する各クラスの予測値を出力する識別器と、
前記識別器の出力する各クラスの予測値をクラス毎の補正パラメータを用いて補正した補正予測値で識別したクラスに基づいて作成した混同行列の対称性を高める前記補正パラメータを取得する補正部と、
前記識別器の出力する予測値を、前記補正部が取得した補正パラメータを用いて補正して、最終的な識別結果を取得する結果取得部と、
を備える識別装置。 (Appendix 1)
a classifier that identifies the class to which the input data belongs and outputs a predicted value for each class;
a correction unit that acquires the correction parameter that enhances the symmetry of the confusion matrix created based on the class identified by the corrected predicted value obtained by correcting the predicted value of each class output from the classifier using the correction parameter for each class; ,
a result acquisition unit that acquires a final identification result by correcting the predicted value output by the discriminator using the correction parameter acquired by the correction unit;
identification device.

（付記２）
前記補正部は、前記混同行列の各クラスの正解データ数が１になるように前記混同行列を正規化した時の、前記正規化後の混同行列の各クラスの予測率合計が、できるだけ１に近づく前記補正パラメータを取得する、
付記１に記載の識別装置。 (Appendix 2)
The correcting unit reduces the total prediction rate of each class of the confusion matrix after normalization to 1 as much as possible when the confusion matrix is normalized so that the number of correct data in each class of the confusion matrix is 1. obtaining the correction parameters that approach;
The identification device according to appendix 1.

（付記３）
前記補正部は、
前記補正パラメータによって定まる単調非減少関数であって、０以上１以下の入力値を０以上１以下の出力値に変換する前記単調非減少関数を用いて前記識別器の出力する予測値を補正して補正予測値を取得し、
前記補正予測値で識別したクラスに基づいて前記混同行列を作成し、
前記作成した前記混同行列の各クラスの正解データ数が１になるように前記混同行列を正規化した時の、前記正規化後の混同行列の各クラスの予測率合計が、できるだけ１に近づく前記補正パラメータを取得する、
付記２に記載の識別装置。 (Appendix 3)
The correction unit is
correcting the predicted value output from the discriminator using the monotonically non-decreasing function determined by the correction parameter, which converts an input value of 0 or more and 1 or less to an output value of 0 or more and 1 or less; to get the corrected predictions, and
creating the confusion matrix based on the classes identified by the corrected predicted values;
When the confusion matrix is normalized so that the number of correct data in each class of the created confusion matrix is 1, the total prediction rate of each class of the confusion matrix after normalization approaches 1 as much as possible get the correction parameters,
The identification device according to appendix 2.

（付記４）
前記単調非減少関数は、前記識別器の出力する予測値を、前記補正パラメータでべき乗する演算を行う関数である、
付記３に記載の識別装置。 (Appendix 4)
The monotone non-decreasing function is a function that performs an operation of exponentiating the predicted value output by the discriminator by the correction parameter,
The identification device according to appendix 3.

（付記５）
前記補正部は、前記各クラスの正解率を平均した平均正解率を取得し、前記混同行列の対称性を高める前記補正パラメータの中で、前記平均正解率が最大となる前記補正パラメータを取得する、
付記１から４のいずれか１つに記載の識別装置。 (Appendix 5)
The correction unit acquires an average accuracy rate by averaging the accuracy rate of each class, and acquires the correction parameter that maximizes the average accuracy rate among the correction parameters that increase the symmetry of the confusion matrix. ,
5. Identification device according to any one of appendices 1 to 4.

（付記６）
識別器の出力する各クラスの予測値をクラス毎の補正パラメータを用いて補正した補正予測値で識別したクラスに基づいて作成した混同行列の対称性を高める前記補正パラメータを取得する補正パラメータ取得ステップと、
前記識別器の出力する予測値を、前記補正パラメータ取得ステップで取得した補正パラメータを用いて補正して、最終的な識別結果を取得する結果取得ステップと、
を含む識別方法。 (Appendix 6)
A correction parameter acquisition step of acquiring the correction parameter that enhances the symmetry of the confusion matrix created based on the class identified by the corrected predicted value obtained by correcting the predicted value of each class output by the discriminator using the correction parameter for each class and,
a result obtaining step of obtaining a final identification result by correcting the predicted value output by the discriminator using the correction parameter obtained in the correction parameter obtaining step;
identification methods, including

（付記７）
識別器を備える識別装置のコンピュータに、
前記識別器の出力する各クラスの予測値をクラス毎の補正パラメータを用いて補正した補正予測値で識別したクラスに基づいて作成した混同行列の対称性を高める前記補正パラメータを取得する補正パラメータ取得ステップ、及び、
前記識別器の出力する予測値を、前記補正パラメータ取得ステップで取得した補正パラメータを用いて補正して、最終的な識別結果を取得する結果取得ステップ、
を実行させるためのプログラム。 (Appendix 7)
In the computer of the identification device equipped with the identifier,
Acquisition of correction parameter for enhancing symmetry of a confusion matrix created based on the class identified by the corrected predicted value obtained by correcting the predicted value of each class output from the classifier using a correction parameter for each class step, and
A result obtaining step of obtaining a final identification result by correcting the predicted value output by the classifier using the correction parameter obtained in the correction parameter obtaining step;
program to run the

１０…制御部、１１…識別器、１２…補正部、１３…結果取得部、２０…記憶部、３１…データ入力部、３２…出力部、３３…通信部、３４…操作入力部、１００…識別装置 DESCRIPTION OF SYMBOLS 10... Control part 11... Discriminator 12... Correction part 13... Result acquisition part 20... Storage part 31... Data input part 32... Output part 33... Communication part 34... Operation input part 100... identification device

Claims

a discriminator that outputs a predicted value for each class that discriminates each of a plurality of classes to which unknown data to be input belongs;
a correction unit that corrects the predicted value of each class output by the discriminator using a correction parameter of each class;
A creation unit that creates a confusion matrix based on the class identified by the predicted value corrected by the correction unit;
a modifying unit that modifies the correction parameter so that the deviation of the total number of samples of each class for each correct label of the classifier in the confusion matrix generated by the generating unit is smaller among the plurality of classes; ,
a result acquisition unit that acquires a final identification result by correcting the predicted value output by the classifier using the correction parameter corrected by the correction unit;
An identification device comprising:

The correction unit reduces the total prediction rate of each class of the confusion matrix after normalization to 1 as much as possible when the confusion matrix is normalized so that the number of correct data in each class of the confusion matrix is 1. characterized by modifying the correction parameters to approximate
2. Identification device according to claim 1.

The correction unit is
correcting the predicted value output from the discriminator using the monotonically non-decreasing function determined by the correction parameter, which converts an input value of 0 or more and 1 or less to an output value of 0 or more and 1 or less; ,
The correction unit
When the confusion matrix created by the creation unit is normalized so that the number of correct data in each class of the confusion matrix is 1, the total prediction rate of each class of the confusion matrix after normalization is as much as possible characterized by modifying the correction parameter to approach 1,
3. Identification device according to claim 2.

The monotone non-decreasing function is a function that performs an operation to raise the predicted value output by the classifier to the power of the correction parameter,
4. Identification device according to claim 3.

The correcting unit obtains an average accuracy rate by averaging the accuracy rate of each class , and adjusts the bias of the total number of samples of each class for each correct label of the classifier in the confusion matrix between the plurality of classes. Obtaining the correction parameter that maximizes the average accuracy rate among the correction parameters to be made smaller by
Identification device according to any one of claims 1 to 4.

a correction step of correcting the predicted value of each class output by a discriminator that outputs a predicted value of each class for identifying each of a plurality of classes to which unknown data to be input belongs, using a correction parameter of each class;
A creation step of creating a confusion matrix based on the classes identified by the predicted values corrected in the correction step;
A correction step of correcting the correction parameter so that the deviation of the total number of samples of each class for each correct label of the classifier in the confusion matrix created in the creation step is smaller among the plurality of classes. and,
a result obtaining step of obtaining a final identification result by correcting the predicted value output by the discriminator using the correction parameter corrected in the correcting step;
A method of identification comprising:

A computer of an identification device equipped with a classifier that outputs a predicted value for each class that identifies each of a plurality of classes to which unknown data that is input belongs,
correction means for correcting the predicted value of each class output by the discriminator using a correction parameter of each class;
creating means for creating a confusion matrix based on the classes identified by the predicted values corrected by the correcting means;
Correction means for correcting the correction parameter so that the deviation of the total number of samples of each class for each correct label of the classifier in the confusion matrix created by the creation means is smaller among the plurality of classes;
result acquisition means for acquiring a final identification result by correcting the predicted value output by the classifier using the correction parameter corrected by the correction means;
A program to function as