JP7290596B2

JP7290596B2 - Determination device, determination method and determination program

Info

Publication number: JP7290596B2
Application number: JP2020057093A
Authority: JP
Inventors: 晋作清本
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2020-03-27
Filing date: 2020-03-27
Publication date: 2023-06-13
Anticipated expiration: 2040-03-27
Also published as: JP2021157502A

Description

本発明は、機械学習を用いた判定装置、判定方法及び判定プログラムに関する。 The present invention relates to a determination device, determination method, and determination program using machine learning.

従来、深層学習の推論における脅威の一つとして、敵対的サンプルを利用した攻撃がある。攻撃者は、意図的に誤分類を誘発することを目的として、ある入力データに微小な摂動を加え、これを学習済みモデルに投入する。教師あり学習では、訓練時に与えられたデータ、すなわち訓練データに基づき、識別境界面が形成される。識別境界面は、訓練データに依存するため、データの真の空間構造とは乖離して形成される場合がある。敵対的サンプルは、この乖離を利用することで、微小な摂動を加えるだけで、誤分類を誘発する。 Conventionally, one of the threats in deep learning inference is an attack using adversarial samples. An attacker adds a small perturbation to some input data and feeds it into a trained model with the aim of intentionally inducing misclassification. In supervised learning, a discriminative boundary surface is formed based on data given during training, ie, training data. Since the discriminative boundary plane depends on the training data, it may be formed away from the true spatial structure of the data. Adversarial samples take advantage of this divergence to induce misclassification with only a small perturbation.

この敵対的サンプルを利用した攻撃への対策としては、敵対的学習と呼ばれる学習手法がある。敵対的学習では、訓練データに対して意図的に摂動を加えることで、敵対的サンプルに対してロバストな識別モデルが生成される。例えば、非特許文献１で提案されたアルゴリズムは、本攻撃に対して高い堅牢性を持つことが知られている。 As a countermeasure against attacks using adversarial samples, there is a learning method called adversarial learning. In adversarial learning, the training data is deliberately perturbed to produce a discriminative model that is robust against adversarial samples. For example, the algorithm proposed in Non-Patent Document 1 is known to have high robustness against this attack.

Ａ．Ｍａｄｒｙｅｔａｌ．， “ＴｏｗａｒｄｓＤｅｅｐＬｅａｒｎｉｎｇＭｏｄｅｌｓＲｅｓｉｓｔａｎｔｔｏＡｄｖｅｒｓａｒｉａｌＡｔｔａｃｋｓ，” ＩＣＬＲ２０１８．A. Madry et al. , "Towards Deep Learning Models Resistant to Adversarial Attacks," ICLR 2018.

しかしながら、敵対的学習では、敵対的サンプルに相当する訓練データを入力する必要があるため、実際の攻撃手法に対して必ずしも適切とは限らなかった。また、既存の学習アルゴリズムに対して処理の改変を要し負荷も増大していた。 However, since adversarial learning requires input of training data corresponding to adversarial samples, it is not always suitable for actual attack techniques. In addition, modification of processing is required for existing learning algorithms, increasing the load.

本発明は、学習済モデルの誤判定を誘発する攻撃に対して容易に耐性を持たせられる判定装置、判定方法及び判定プログラムを提供することを目的とする。 SUMMARY OF THE INVENTION It is an object of the present invention to provide a determination device, a determination method, and a determination program that can easily provide resistance to an attack that induces an erroneous determination of a trained model.

本発明に係る判定装置は、入力データに対して変換処理を行い、互いに異なる複数の変換後データを生成する変換部と、機械学習の訓練データに対して前記変換処理が行われた互いに異なる複数の変換後データを用いて、それぞれ別々の学習済みモデルを生成する学習部と、判定対象データに対して前記変換処理が行われた互いに異なる複数の変換後データを、それぞれ対応する前記学習済みモデルに入力し、複数の個別判定結果を得る入力部と、前記複数の個別判定結果の分布に基づいて、総合判定結果を出力する判定部と、を備える。 A determination apparatus according to the present invention includes a conversion unit that performs a conversion process on input data and generates a plurality of different post-conversion data; a learning unit that generates separate trained models using the transformed data of the above; and an input unit for obtaining a plurality of individual determination results, and a determination unit for outputting a comprehensive determination result based on the distribution of the plurality of individual determination results.

前記判定部は、前記個別判定結果の多数決により、総合判定結果を出力してもよい。 The determination unit may output a comprehensive determination result based on a majority decision of the individual determination results.

前記変換部は、前記入力データとしての一次元ベクトルを、複数の行列それぞれにより変換してもよい。 The transform unit may transform the one-dimensional vector as the input data by each of a plurality of matrices.

前記行列は、単位行列における一部の０を１に置き換えたものであってもよい。 The matrix may be obtained by replacing some 0s in a unit matrix with 1s.

本発明に係る判定方法は、入力データに対して変換処理を行い、互いに異なる複数の変換後データを生成する変換ステップと、機械学習の訓練データに対して前記変換処理が行われた互いに異なる複数の変換後データを用いて、それぞれ別々の学習済みモデルを生成する学習ステップと、判定対象データに対して前記変換処理が行われた互いに異なる複数の変換後データを、それぞれ対応する前記学習済みモデルに入力し、複数の個別判定結果を得る入力ステップと、前記複数の個別判定結果の分布に基づいて、総合判定結果を出力する判定ステップと、をコンピュータが実行する。 A determination method according to the present invention includes a conversion step of performing a conversion process on input data to generate a plurality of different post-conversion data; a learning step of generating separate trained models using the transformed data of the above; to obtain a plurality of individual determination results; and a determination step of outputting a comprehensive determination result based on the distribution of the plurality of individual determination results.

本発明に係る判定プログラムは、前記判定装置としてコンピュータを機能させるためのものである。 A determination program according to the present invention is for causing a computer to function as the determination device.

本発明によれば、学習済モデルの誤判定を誘発する攻撃に対して容易に耐性を持たせられる。 According to the present invention, it is possible to easily provide resistance to an attack that induces an erroneous determination of a trained model.

実施形態における判定装置の機能構成を示す図である。It is a figure which shows the functional structure of the determination apparatus in embodiment. 実施形態における行列を例示する図である。It is a figure which illustrates the matrix in embodiment. 実施形態における判定方法の流れを示す図である。It is a figure which shows the flow of the determination method in embodiment.

以下、本発明の実施形態の一例について説明する。
図１は、本実施形態における判定装置１の機能構成を示す図である。
判定装置１は、サーバ又はパーソナルコンピュータ等の情報処理装置（コンピュータ）であり、制御部１０及び記憶部２０の他、各種データの入出力デバイス及び通信デバイス等を備える。 An example of an embodiment of the present invention will be described below.
FIG. 1 is a diagram showing the functional configuration of a determination device 1 according to this embodiment.
The determination device 1 is an information processing device (computer) such as a server or a personal computer, and includes a control unit 10 and a storage unit 20, input/output devices for various data, communication devices, and the like.

制御部１０は、判定装置１の全体を制御する部分であり、記憶部２０に記憶された各種プログラムを適宜読み出して実行することにより、本実施形態における各機能を実現する。制御部１０は、ＣＰＵであってよい。 The control unit 10 is a part that controls the entire determination device 1, and implements each function in the present embodiment by appropriately reading and executing various programs stored in the storage unit 20. FIG. The control unit 10 may be a CPU.

記憶部２０は、ハードウェア群を判定装置１として機能させるための各種プログラム、及び各種データ等の記憶領域であり、ＲＯＭ、ＲＡＭ、フラッシュメモリ又はハードディスク（ＨＤＤ）等であってよい。具体的には、記憶部２０は、本実施形態の各機能を制御部１０に実行させるためのプログラム（判定プログラム）、学習モデル等を記憶する。 The storage unit 20 is a storage area for various programs, various data, and the like for causing the hardware group to function as the determination device 1, and may be a ROM, RAM, flash memory, hard disk (HDD), or the like. Specifically, the storage unit 20 stores a program (determination program) for causing the control unit 10 to execute each function of the present embodiment, a learning model, and the like.

制御部１０は、変換部１１と、学習部１２と、入力部１３と、判定部１４とを備える。
制御部１０は、これらの機能部により、入力データに対して機械学習を用いた判定結果を出力する。 The control unit 10 includes a conversion unit 11 , a learning unit 12 , an input unit 13 and a determination unit 14 .
The control unit 10 uses these functional units to output determination results using machine learning for input data.

変換部１１は、入力データ、すなわち機械学習の訓練データ又は判定対象データに対して変換処理を行い、互いに異なる複数の変換後データを生成する。
入力データは、一次元ベクトルであってよい。変換部１１は、例えば、入力データとしての一次元ベクトルａを、複数の行列Ｈ_ｉそれぞれにより、ａ’_ｉ＝Ｈ_ｉ・ａと変換する。 The conversion unit 11 performs conversion processing on input data, that is, machine learning training data or determination target data, and generates a plurality of different post-conversion data.
Input data may be a one-dimensional vector. The conversion unit 11 converts, for example, a one-dimensional vector a as input data to a′ _i =H _i ·a using each of the plurality of matrices H _i .

図２は、本実施形態における行列Ｈ_ｉを例示する図である。
行列Ｈ_ｉは、例えば、単位行列における一部の０を１に置き換えたものであってよい。
この他、１を０に置き換え、行の入れ替え等が行われてもよいが、単位行列の成分が保持されることが好ましく、これにより、変換後も元の入力データの特徴が保持され、さらに、非線形項を含むことにより攻撃への耐性が強化される。 FIG. 2 is a diagram illustrating the matrix H _i in this embodiment.
The matrix H _i may be, for example, a unit matrix in which some 0s are replaced with 1s.
In addition, 1 may be replaced with 0, rows may be replaced, etc., but it is preferable to retain the elements of the unit matrix, so that the characteristics of the original input data are retained even after conversion. , the inclusion of nonlinear terms enhances resistance to attacks.

なお、変換処理は、行列変換には限られない。所定の変換規則が学習モデル（ＡＩ）毎に定義され、訓練データ及び判定対象データに対して共通の変換処理が行われる。 Note that the conversion process is not limited to matrix conversion. A predetermined conversion rule is defined for each learning model (AI), and common conversion processing is performed on training data and determination target data.

学習部１２は、機械学習の訓練データに対して変換部１１による変換処理が行われた互いに異なる複数の変換後データを用いて、それぞれ別々の独立したＤＮＮ（ＤｅｅｐＮｅｕｒａｌＮｅｔｗｏｒｋ）等による学習済みモデルを生成する。
なお、学習アルゴリズムは、複数の学習モデルで共通していてもよいし、異なっていてもよい。 The learning unit 12 uses a plurality of different post-transformation data obtained by transforming training data for machine learning by the transforming unit 11, and prepares a trained model by a separate DNN (Deep Neural Network) or the like. to generate
Note that the learning algorithm may be common to a plurality of learning models, or may be different.

入力部１３は、判定対象データに対して変換部１１による変換処理が行われた互いに異なる複数の変換後データを、それぞれ対応する学習済みモデルに入力し、複数の個別判定結果を得る。 The input unit 13 inputs a plurality of different post-conversion data obtained by converting the determination target data by the conversion unit 11 into the corresponding learned models, and obtains a plurality of individual determination results.

判定部１４は、複数の個別判定結果の分布に基づいて、総合判定結果を出力する。
このとき、判定部１４は、個別判定結果の多数決により、総合判定結果を出力してよい。例えば、２種類の分類を判定する場合は、個別判定結果の数が多い方の分類を、３種類以上の分類を判定する場合は、個別判定結果の数が最も多い分類を総合判定結果とする。 The determination unit 14 outputs a comprehensive determination result based on the distribution of the individual determination results.
At this time, the determination unit 14 may output the overall determination result based on the majority of the individual determination results. For example, when judging two types of classification, the classification with the largest number of individual judgment results is judged, and when judging three or more classifications, the classification with the largest number of individual judgment results is taken as the overall judgment result. .

図３は、本実施形態における判定方法の流れを示す図である。
まず、変換部１１は、複数（例えば、３つ以上）の学習モデルＡＩ_ｉに対応して、同一の訓練データＸ（ベクトル群）を互いに異なる変換規則（例えば、行列Ｈ_ｉ）により変換し、複数の変換後データＸ’_ｉを生成する（１）。 FIG. 3 is a diagram showing the flow of the determination method in this embodiment.
First, the conversion unit 11 converts the same training data X (vector group) according to different conversion rules (for example, matrix H _i ) corresponding to a plurality of (for example, three or more) learning models AI _i , A plurality of transformed data _X'i are generated (1).

続いて、学習部１２は、変換後データＸ’_ｉを用いて、学習モデルＡＩ_ｉそれぞれに対して、独立した機械学習を行う（２）。
これにより、同一の訓練データを学習した互いに異なる複数の学習済みモデルＡＩ_ｉが構成される。 Subsequently, the learning unit 12 performs independent machine learning for each learning model AI _i using the converted data X′ _i (2).
As a result, a plurality of different trained models AI _i learned from the same training data are constructed.

その後、判定装置１に判定対象データａが入力されると、変換部１１は、学習時と同一の変換規則により、複数の変換後データａ’_ｉを生成する（３）。 After that, when the determination target data a is input to the determination device 1, the conversion unit 11 generates a plurality of post-conversion data _a'i according to the same conversion rules as those used during learning (3).

次に、入力部１３は、生成された変換後データａ’_ｉを、それぞれ対応する学習済みモデルＡＩ_ｉに入力することで、複数の個別判定結果を得る（４）。 Next, the input unit 13 obtains a plurality of individual judgment results by inputting the generated post-conversion data a′ _i to the corresponding learned models AI _i (4).

そして、判定部１４は、これらの個別判定結果に基づいて、多数決等により総合判定結果を出力する（５）。 Then, based on these individual determination results, the determination unit 14 outputs a comprehensive determination result by majority vote (5).

本実施形態によれば、判定装置１は、機械学習の訓練データに対して前記変換処理が行われた互いに異なる複数の変換後データを用いて、それぞれ別々の学習済みモデルを生成し、判定対象データに対して同一の変換処理が行われた互いに異なる複数の変換後データを、それぞれ対応する学習済みモデルに入力する。そして、判定装置１は、複数の個別判定結果の分布に基づいて、総合判定結果を出力する。 According to the present embodiment, the determination device 1 generates separate learned models using a plurality of different post-transformation data obtained by performing the transformation process on training data for machine learning, A plurality of different post-transformation data obtained by subjecting the data to the same transformation process are input to the corresponding learned models. Then, the determination device 1 outputs a comprehensive determination result based on the distribution of the individual determination results.

これにより、判定装置１は、敵対的サンプルにより、いずれかの学習済みモデルで誤判定が発生したとしても、独立した学習済みモデルの大半で誤判定させることは容易ではないため、個別判定結果を総合することにより、確度の高い判定結果を出力できる。したがって、判定装置１は、学習済モデルの誤判定を誘発する攻撃に対して容易に耐性を持たせられる。 As a result, even if an erroneous judgment occurs in one of the trained models due to the hostile sample, the judging device 1 cannot easily make an erroneous judgment in most of the independent trained models. By synthesizing, a judgment result with high accuracy can be output. Therefore, the determination device 1 is easily resistant to attacks that induce erroneous determination of the trained model.

判定装置１は、個別判定結果の多数決により、総合判定結果を出力することで、容易に確度の高い判定を行える。 The determination device 1 can easily make a highly accurate determination by outputting the overall determination result based on the majority of the individual determination results.

判定装置１は、入力データとしての一次元ベクトルを、複数の行列それぞれにより変換することで、変換処理に掛かる負荷を軽量化できる。したがって、判定装置１は、軽量な前処理を行うのみで、攻撃に対する耐性を容易に高めることができる。 The determination device 1 transforms a one-dimensional vector as input data by each of a plurality of matrices, thereby reducing the load on the transformation process. Therefore, the determination device 1 can easily improve resistance to attacks by only performing light preprocessing.

変換処理に用いる行列は、単位行列における一部の０を１に置き換えたものであってよい。これにより、判定装置１は、入力データの特徴を損なうことなく、非線形項の追加によって攻撃への耐性を高めることができる。 A matrix used for conversion processing may be obtained by replacing some 0s in a unit matrix with 1s. As a result, the determination device 1 can increase resistance to attacks by adding nonlinear terms without impairing the characteristics of the input data.

以上、本発明の実施形態について説明したが、本発明は前述した実施形態に限るものではない。また、前述した実施形態に記載された効果は、本発明から生じる最も好適な効果を列挙したに過ぎず、本発明による効果は、実施形態に記載されたものに限定されるものではない。 Although the embodiments of the present invention have been described above, the present invention is not limited to the above-described embodiments. Moreover, the effects described in the above-described embodiments are merely enumerations of the most suitable effects produced by the present invention, and the effects of the present invention are not limited to those described in the embodiments.

判定装置１による判定方法は、ソフトウェアにより実現される。ソフトウェアによって実現される場合には、このソフトウェアを構成するプログラムが、情報処理装置（コンピュータ）にインストールされる。また、これらのプログラムは、ＣＤ－ＲＯＭのようなリムーバブルメディアに記録されてユーザに配布されてもよいし、ネットワークを介してユーザのコンピュータにダウンロードされることにより配布されてもよい。さらに、これらのプログラムは、ダウンロードされることなくネットワークを介したＷｅｂサービスとしてユーザのコンピュータに提供されてもよい。 The determination method by the determination device 1 is realized by software. When it is implemented by software, a program constituting this software is installed in an information processing device (computer). Further, these programs may be recorded on removable media such as CD-ROMs and distributed to users, or may be distributed by being downloaded to users' computers via a network. Furthermore, these programs may be provided to the user's computer as a web service through the network without being downloaded.

１判定装置
１０制御部
１１変換部
１２学習部
１３入力部
１４判定部
２０記憶部 1 determination device 10 control unit 11 conversion unit 12 learning unit 13 input unit 14 determination unit 20 storage unit

Claims

a conversion unit that performs conversion processing on input data and generates a plurality of different post-conversion data;
a learning unit that generates separate trained models using a plurality of different post-transformation data obtained by subjecting the training data for machine learning to the transformation process;
an input unit for obtaining a plurality of individual determination results by inputting a plurality of different converted data obtained by performing the conversion processing on the determination target data to the corresponding trained models;
a determination unit that outputs a comprehensive determination result based on the distribution of the plurality of individual determination results ;
The transform unit is a determination device that transforms the one-dimensional vector as the input data by each of a plurality of matrices .

2. The determination device according to claim 1, wherein the determination unit outputs a comprehensive determination result based on a majority decision of the individual determination results.

3. The determination device according to claim 1 , wherein the matrix is obtained by replacing some 0s in a unit matrix with 1s.

a conversion step of performing conversion processing on input data to generate a plurality of different post-conversion data;
a learning step of generating separate trained models using a plurality of different post-transformation data obtained by subjecting the training data for machine learning to the transformation process;
an input step of obtaining a plurality of individual determination results by inputting a plurality of different converted data obtained by performing the conversion process on the determination target data into the corresponding learned models;
A computer executes a judgment step of outputting a comprehensive judgment result based on the distribution of the plurality of individual judgment results,
The determination method , wherein in the transforming step, the one-dimensional vector as the input data is transformed by each of a plurality of matrices .

A determination program for causing a computer to function as the determination device according to any one of claims 1 to 3 .