JP6835559B2

JP6835559B2 - Privacy protection data provision system

Info

Publication number: JP6835559B2
Application number: JP2016239460A
Authority: JP
Inventors: 雄一清; 拓史奥村; 大須賀　昭彦; 昭彦大須賀
Original assignee: THE UNIVERSITY OF ELECTRO-COMUNICATINS; Mitsubishi Research Institute Inc
Current assignee: THE UNIVERSITY OF ELECTRO-COMUNICATINS; Mitsubishi Research Institute Inc
Priority date: 2016-12-09
Filing date: 2016-12-09
Publication date: 2021-02-24
Anticipated expiration: 2036-12-09
Also published as: JP2018097467A

Description

本発明は、プライバシ保護データ提供システムに関する。 The present invention relates to the privacy protection data providing system.

近年、個人データなどのプライバシ保護が必要なデータを公開する際に、差分プライバシと称される処理を施して、個々のデータのプライバシを確保した上で、適正なデータ解析が実行できるようにしたものが提案されている。 In recent years, when publishing data that requires privacy protection, such as personal data, a process called differential privacy has been performed to ensure the privacy of individual data, and then appropriate data analysis can be performed. Things have been proposed.

データに対して差分プライバシの処理を施す際には、プライバシの保護レベルが、「ε」で示されるプライバシ指標で示される。プライバシ指標「ε」の値が０に近づくほど、データの保護レベルが高く、プライバシ指標「ε」の値が大きいほど、データの保護レベルが低くなる。 When processing differential privacy on data, the privacy protection level is indicated by the privacy index indicated by "ε". The closer the value of the privacy index "ε" is to 0, the higher the data protection level, and the larger the value of the privacy index "ε", the lower the data protection level.

具体的には、あるデータベースＤを匿名化して差分プライバシの処理を施す匿名学習アルゴリズムＡが存在するとき、この匿名学習アルゴリズムＡは、確率的要素を含むアルゴリズムになる。すなわち、データベースＤを、確率的要素を含む匿名学習アルゴリズムＡで匿名化したときには、確率的要素を含むために、処理を施す毎に異なる匿名化済データｓ１，ｓ２，・・・，ｓｎが得られる。ここで、データベースＤと、そのデータベースＤから１レコードだけ異なるデータとしたデータベースＤ′とを用意し、それぞれのデータベースＤ，Ｄ′の集合Ｓの特定のデータｓｉ（データｓｉはデータｓ１〜ｓｎのいずれか）になる確率の比が、プライバシ指標「ε」を使ったｅｘｐ（ε）以下になるとき、この匿名学習アルゴリズムＡは、差分プライバシを満たすアルゴリズムになる。 Specifically, when there is an anonymous learning algorithm A that anonymizes a certain database D and performs differential privacy processing, the anonymous learning algorithm A becomes an algorithm including a probabilistic element. That is, when the database D is anonymized by the anonymous learning algorithm A including the stochastic element, different anonymized data s1, s2, ..., Sn are obtained each time the processing is performed because the database D contains the stochastic element. Be done. Here, a database D and a database D'with data different from the database D by one record are prepared, and specific data si of the set S of the respective databases D and D'(data si is the data s1 to sn. When the ratio of the probability of becoming (any) is less than or equal to exp (ε) using the privacy index “ε”, this anonymous learning algorithm A becomes an algorithm that satisfies the differential privacy.

この差分プライバシを満たす点を、より分かりやすく述べると、例えば、多数の個人情報からなる特定のデータベースＤに、ある任意の一人のデータを追加（又は削除）したものを、データベースＤ′とする。ここで、データベースＤを匿名学習アルゴリズムＡで差分プライバシの処理を施して匿名化した結果と、データベースＤ′を匿名学習アルゴリズムＡで差分プライバシの処理を施して匿名化した結果とが、ほとんど変わらないとき（つまり上述した閾値ｅｘｐ（ε）を超えないとき）、プライバシが守られた状態で、データベースＤが公開されたと言える。 To describe the point of satisfying this differential privacy more clearly, for example, a database D'in which the data of an arbitrary person is added (or deleted) to a specific database D composed of a large amount of personal information is referred to as a database D'. Here, there is almost no difference between the result of anonymizing the database D by performing differential privacy processing by the anonymous learning algorithm A and the result of anonymizing the database D'by applying the differential privacy processing by the anonymous learning algorithm A. When (that is, when the above-mentioned threshold exp (ε) is not exceeded), it can be said that the database D is open to the public while the privacy is maintained.

これは、データベースＤを構成する各データで特定される個人から見たとき、一人一人のデータの有無に関わらず、結果がほぼ同じであるため、プライバシが守られた状態と見なせることになる。言い換えると、データベースＤとデータベースＤ′のいずれであっても、結果が同じになることを意味している。
特許文献１には、差分プライバシを満たして、データを集計する手法の一例についての記載がある。 This can be regarded as a state in which privacy is protected because the results are almost the same regardless of the presence or absence of each person's data when viewed from the individual specified by each data constituting the database D. In other words, it means that the result will be the same regardless of whether it is database D or database D'.
Patent Document 1 describes an example of a method of aggregating data by satisfying differential privacy.

特開２０１６−１２０７４号公報Japanese Unexamined Patent Publication No. 2016-12704

上述したように、差分プライバシの処理を施す匿名学習アルゴリズムを作成することで、データの匿名化が可能であるが、実際には、どのようなデータベース構成であっても、確率の比がｅｘｐ（ε）以下になる条件を満たして、かつニューラルモデルの精度が高くなるような機械学習を行う匿名学習アルゴリズムの作成は難しいという問題があった。 As described above, data can be anonymized by creating an anonymous learning algorithm that performs differential privacy processing, but in reality, the probability ratio is exp (regardless of the database configuration). ε) There is a problem that it is difficult to create an anonymous learning algorithm that performs machine learning that satisfies the following conditions and increases the accuracy of the neural model.

本発明は、匿名化された深層学習モデルを形成する際に、どのようなデータであっても、精度の高い好適な匿名化された深層学習モデルが得られるプライバシ保護データ提供システムを提供することを目的とする。 The present invention, in forming the anonymized deep learning model, whatever the data and provides privacy protection data providing system that precise preferred anonymized deep learning model is obtained The purpose is.

本発明の一側面のプライバシ保護データ提供システムは、データベース内の生データに対して、深層学習アルゴリズムを適用して深層学習モデルを得る深層学習処理部と、深層学習処理部で得られた深層学習モデルに対して、差分プライバシに基づく匿名化処理を施して、匿名モデルを得る匿名化処理部とを備えたプライバシ保護データ提供システムである。
ここで、匿名化処理部は、深層学習モデルに含まれる重みパラメータ及びバイアスパラメータに対して、それぞれのパラメータ値にラプラス分布に基づいた誤差を与えると共に、ラプラス分布に基づいて誤差を与えた各パラメータが、最大値及び最小値で示される閾値の範囲を超えたとき、閾値の範囲に制限するようにしたことを特徴とする。 The privacy protection data providing system of one aspect of the present invention is a deep learning processing unit that applies a deep learning algorithm to raw data in a database to obtain a deep learning model, and a deep learning processing unit obtained by the deep learning processing unit. It is a privacy protection data providing system provided with an anonymization processing unit that obtains an anonymization model by performing anonymization processing based on differential privacy on the model.
Here, the anonymization processing unit gives an error based on the Laplace distribution to each parameter value for the weight parameter and the bias parameter included in the deep learning model, and each parameter to which an error is given based on the Laplace distribution. However, when the range of the threshold value indicated by the maximum value and the minimum value is exceeded, the limit is limited to the range of the threshold value.

また、本発明の他の側面のプライバシ保護データ提供システムは、データベース内の生データに対して、差分プライバシに基づく匿名化処理を施しながら、深層学習アルゴリズムを適用して深層学習済の匿名モデルを得る深層学習処理部を備えたプライバシ保護データ提供システムである。
ここで、深層学習処理部は、深層学習モデルを得る演算時に使用する重みパラメータ及びバイアスパラメータに対して、それぞれのパラメータ値にラプラス分布に基づいた誤差を与えると共に、ラプラス分布に基づいて誤差を与えた各パラメータが、最大値及び最小値で示される閾値の範囲を超えたとき、閾値の範囲に制限するようにしたことを特徴とする。 Further, the privacy-protected data providing system of another aspect of the present invention applies a deep learning algorithm to anonymize the raw data in the database based on differential privacy to obtain a deep-learned anonymous model. It is a privacy protection data providing system equipped with a deep learning processing unit to obtain.
Here, the deep learning processing unit gives an error based on the Laplace distribution to each parameter value for the weight parameter and the bias parameter used in the calculation for obtaining the deep learning model, and also gives an error based on the Laplace distribution. When each parameter exceeds the threshold range indicated by the maximum value and the minimum value, the parameter is limited to the threshold range.

本発明によれば、ラプラス分布に基づいて誤差を与えた各パラメータが、最大値及び最小値で示される閾値の範囲を超えたとき、閾値の範囲に制限するようにしたことで、誤差を与えてデータの匿名化を行っても、データの変動範囲を適正な範囲に制限することができ、適切な匿名化ができるようになる。その結果、匿名化による深層学習モデルの精度低下を軽減できるようになる。 According to the present invention, when each parameter to which an error is given based on the Laplace distribution exceeds the threshold range indicated by the maximum value and the minimum value, the error is given by limiting to the threshold range. Even if the data is anonymized, the fluctuation range of the data can be limited to an appropriate range, and the appropriate anonymization can be performed. As a result, it becomes possible to reduce the decrease in accuracy of the deep learning model due to anonymization.

本発明の第１の実施の形態例による処理システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the processing system according to 1st Embodiment of this invention. 本発明の第１の実施の形態例による匿名化処理部内で、ラプラス分布に基づいた誤差を与える構成例を示すブロック図である。It is a block diagram which shows the structural example which gives an error based on a Laplace distribution in the anonymization processing part by the 1st Embodiment of this invention. 本発明の第１の実施の形態例による処理の流れの例を示すフローチャートである。It is a flowchart which shows the example of the processing flow by the example of 1st Embodiment of this invention. 本発明の第１の実施の形態例による深層学習の概要を示す説明図である。It is explanatory drawing which shows the outline of the deep learning by the example of 1st Embodiment of this invention. 本発明の第１の実施の形態例による実験例を示す説明図である。It is explanatory drawing which shows the experimental example by the 1st Embodiment of this invention. 本発明の第２の実施の形態例による処理システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the processing system according to the 2nd Embodiment of this invention. 本発明の第２の実施の形態例による処理の流れの例を示すフローチャートである。It is a flowchart which shows the example of the processing flow by the 2nd Embodiment example of this invention. 本発明の第２の実施の形態例による実験例を示す説明図である。It is explanatory drawing which shows the experimental example by the 2nd Embodiment of this invention. 本発明の各実施の形態例による誤差の付与と閾値への制限例（例１）の概略を示す説明図である。It is explanatory drawing which shows the outline of the example of giving an error and limiting to a threshold value (Example 1) by each embodiment of this invention. 本発明の各実施の形態例による誤差の付与と閾値への制限例（例２）の概略を示す説明図である。It is explanatory drawing which shows the outline of the example (Example 2) of giving an error and limiting to a threshold value by each embodiment of this invention.

＜１．第１の実施の形態例＞
以下、本発明の第１の実施の形態例を、図１〜図５を参照して説明する。 <1. Example of the first embodiment>
Hereinafter, examples of the first embodiment of the present invention will be described with reference to FIGS. 1 to 5.

［システム全体の構成］
図１は、第１の実施の形態例のプライバシ保護データ提供システムの構成を示す。
データベース１には、個人情報が含まれる多数の生データが蓄積され、データベース１に蓄積された生データが、深層学習処理部２に供給される。深層学習処理部２は、予め用意された深層学習アルゴリズムを適用した演算を行い、生データを深層学習した深層学習モデル３を得る。 [System-wide configuration]
FIG. 1 shows the configuration of the privacy protection data providing system of the first embodiment.
A large amount of raw data including personal information is accumulated in the database 1, and the raw data accumulated in the database 1 is supplied to the deep learning processing unit 2. The deep learning processing unit 2 performs an operation applying a deep learning algorithm prepared in advance to obtain a deep learning model 3 in which raw data is deep learned.

そして、深層学習処理部２で得た深層学習モデル３が、匿名化処理部１０に供給される。匿名化処理部１０は、供給された深層学習モデル３に対して、差分プライバシに基づく匿名化処理を施して、匿名化済みの深層学習モデル４（以下、「匿名化モデル４」と称する）を得る。 Then, the deep learning model 3 obtained by the deep learning processing unit 2 is supplied to the anonymization processing unit 10. The anonymization processing unit 10 performs anonymization processing based on differential privacy on the supplied deep learning model 3 to obtain an anonymized deep learning model 4 (hereinafter referred to as “anonymization model 4”). obtain.

匿名化処理部１０が、差分プライバシに基づいて匿名化モデル４を得る際には、深層学習モデル３に含まれる重みパラメータ及びバイアスパラメータに対して、それぞれのパラメータ値にラプラス分布に基づいて誤差を与えて、差分プライバシの処理を施す。但し、それぞれのパラメータ値にラプラス分布に基づいた誤差を与える際には、その誤差として、最大値及び最小値を示す閾値で制限するようにした。
ラプラス分布に基づいた誤差を与えるということは、誤差を与えたパラメータ値が、確率的要素を含む値になり、結果的に匿名化が行われた匿名化モデル４が得られることになる。 When the anonymization processing unit 10 obtains the anonymization model 4 based on the differential privacy, an error is added to each parameter value based on the Laplace distribution with respect to the weight parameter and the bias parameter included in the deep learning model 3. Give and perform differential privacy processing. However, when giving an error based on the Laplace distribution to each parameter value, the error is limited by the threshold value indicating the maximum value and the minimum value.
Giving an error based on the Laplace distribution means that the parameter value to which the error is given becomes a value including a stochastic element, and as a result, an anonymization model 4 in which anonymization is performed is obtained.

［ε−差分プライバシの処理構成］
図２は、匿名化処理部１０の機能を示すブロック図である。
図２に示すように、匿名化処理部１０は、データ入力部１１、ε入力部１２、パラメータ構造決定部１３、パラメータ初期値決定部１４、閾値決定部１５、閾値超え判定部１６及び閾値計算部１７を備える。更に、匿名化処理部１０は、匿名化演算部１８及びデータ出力部１９を備える。 [Ε-Differential privacy processing configuration]
FIG. 2 is a block diagram showing the function of the anonymization processing unit 10.
As shown in FIG. 2, the anonymization processing unit 10 includes a data input unit 11, an ε input unit 12, a parameter structure determination unit 13, a parameter initial value determination unit 14, a threshold value determination unit 15, a threshold value exceeding determination unit 16, and a threshold value calculation. A unit 17 is provided. Further, the anonymization processing unit 10 includes an anonymization calculation unit 18 and a data output unit 19.

データ入力部１１には、深層学習モデルのデータが入力され、このデータが匿名化演算部１８に供給される。ε入力部１２には、差分プライバシの処理を行う際の指標「ε」が入力され、指標「ε」が、匿名化演算部１８に供給される。 The data of the deep learning model is input to the data input unit 11, and this data is supplied to the anonymization calculation unit 18. An index "ε" for performing differential privacy processing is input to the ε input unit 12, and the index "ε" is supplied to the anonymization calculation unit 18.

パラメータ構造決定部１３は、深層学習モデル３のパラメータ構造を決める機能を有し、このパラメータ構造決定部１３で決定された深層学習モデル３のパラメータ構造が、匿名化演算部１８に供給される。なお、パラメータ構造決定部１３で決定されるパラメータ構造には、少なくとも重みパラメータとバイアスパラメータが含まれる。そして、匿名化演算部１８は、これら重みパラメータとバイアスパラメータに誤差を与える処理を行う。 The parameter structure determination unit 13 has a function of determining the parameter structure of the deep learning model 3, and the parameter structure of the deep learning model 3 determined by the parameter structure determination unit 13 is supplied to the anonymization calculation unit 18. The parameter structure determined by the parameter structure determination unit 13 includes at least a weight parameter and a bias parameter. Then, the anonymization calculation unit 18 performs a process of giving an error to these weight parameters and bias parameters.

パラメータ初期値決定部１４は、上述した重みパラメータとバイアスパラメータのパラメータ初期値を決定する。このパラメータ初期値は、匿名化演算部１８に供給され、匿名化演算部１８は、このパラメータ初期値を用いて、パラメータ構造決定部１３で決定されるパラメータ構造の初期値を決定する。 The parameter initial value determination unit 14 determines the parameter initial values of the weight parameter and the bias parameter described above. This parameter initial value is supplied to the anonymization calculation unit 18, and the anonymization calculation unit 18 determines the initial value of the parameter structure determined by the parameter structure determination unit 13 using this parameter initial value.

閾値決定部１５は、ラプラス分布に基づいて得た誤差を設定する際の最大値と最小値を制限するための閾値を決定する。この閾値決定部１５における閾値の決定の際には、後述する閾値計算部１７での計算結果が利用される。
閾値超え判定部１６は、匿名化演算部１８が演算を行う際に、パラメータ構造決定部１３で決定した誤差値が、閾値決定部１５で決定した閾値（最大値又は最小値）を超えたか否かを判定する。 The threshold value determination unit 15 determines a threshold value for limiting the maximum value and the minimum value when setting the error obtained based on the Laplace distribution. When determining the threshold value in the threshold value determination unit 15, the calculation result in the threshold value calculation unit 17 described later is used.
In the threshold value exceeding determination unit 16, whether or not the error value determined by the parameter structure determination unit 13 exceeds the threshold value (maximum value or minimum value) determined by the threshold value determination unit 15 when the anonymization calculation unit 18 performs the calculation. Is determined.

閾値計算部１７は、閾値を設定するための計算を行い、計算結果を匿名化演算部１８に供給する。
匿名化演算部１８は、閾値超え判定部１６での判定結果が、閾値を超えていた場合には閾値を誤差値とする処理を行う。匿名化演算部１８で演算した結果は、データ出力部１９から出力される。 The threshold value calculation unit 17 performs a calculation for setting the threshold value, and supplies the calculation result to the anonymization calculation unit 18.
The anonymization calculation unit 18 performs a process of setting the threshold value as an error value when the determination result in the threshold value exceeding determination unit 16 exceeds the threshold value. The result of calculation by the anonymization calculation unit 18 is output from the data output unit 19.

［全体の処理の流れ］
図３は、第１の実施の形態例のプライバシ保護データ提供システムでの処理の流れを示すフローチャートである。
まず、深層学習処理部２は、データベース１から生データを取得する（ステップＳ１１）。そして、深層学習処理部２は、取得した生データに対して、予め用意された深層学習アルゴリズムを適用して深層学習を行い（ステップＳ１２）、深層学習処理の結果として、深層学習済モデルを取得する（ステップＳ１３）。 [Overall processing flow]
FIG. 3 is a flowchart showing a processing flow in the privacy protection data providing system of the first embodiment.
First, the deep learning processing unit 2 acquires raw data from the database 1 (step S11). Then, the deep learning processing unit 2 applies a deep learning algorithm prepared in advance to the acquired raw data to perform deep learning (step S12), and as a result of the deep learning processing, acquires a deep learning model. (Step S13).

次に、ステップＳ１３で取得した深層学習済モデルに対して、匿名化処理部１０が、匿名化処理を行う（ステップＳ１４）。この匿名化処理を行う際には、閾値による制限を設定した上で、ラプラス分布に基づく誤差の付与を行う。
なお、ステップＳ１４において、匿名化処理の制限に使用される閾値は、匿名化処理部１０における、重みパラメータの変動量の最大値及び最小値を示す閾値と、バイアスパラメータの変動量の最大値及び最小値を示す閾値である。これらの閾値の生成処理（ステップＳ２０）の詳細については数式を用いて後述する。
そして、匿名化処理部１０によるステップＳ１４での匿名化処理の実行で、匿名化モデルを取得し（ステップＳ１５）、得られた匿名化モデルをデータ出力部１９から出力する。 Next, the anonymization processing unit 10 performs anonymization processing on the deep learning model acquired in step S13 (step S14). When performing this anonymization process, an error is added based on the Laplace distribution after setting a limit by a threshold value.
In step S14, the threshold values used to limit the anonymization process are the threshold value indicating the maximum and minimum values of the fluctuation amount of the weight parameter in the anonymization processing unit 10, the maximum value of the fluctuation amount of the bias parameter, and It is a threshold value indicating the minimum value. Details of these threshold generation processes (step S20) will be described later using mathematical formulas.
Then, by executing the anonymization process in step S14 by the anonymization processing unit 10, an anonymization model is acquired (step S15), and the obtained anonymization model is output from the data output unit 19.

［深層学習の詳細］
次に、ここまで説明したステップＳ１２〜Ｓ１５の各処理の詳細について説明する。
まず、図４を参照して、深層学習が行われる例について説明する。
図４において、Ｈ^（ｌ）は、深層学習の１番目の層を示す。図４はＬ＝３の例であり、全体でＬ＋１個の層を持っている。入力層はＨ^（０）、出力層はＨ^（Ｌ）である。それぞれの層は、複数（又は１つ）のノードを有する。ノードＮ_ｉ ^（ｌ）は、層Ｈ^（ｌ）のｉ番目のノードを表し、ｎ^（ｌ）は層Ｈ^（ｌ）におけるノードの個数を表す。層Ｈ^（ｌ）には、ノードＮ_１ ^（ｌ），Ｎ_２ ^（ｌ），・・・，Ｎ_ｎ（ｌ） ^（ｌ）がある。 [Details of deep learning]
Next, the details of each process of steps S12 to S15 described so far will be described.
First, an example in which deep learning is performed will be described with reference to FIG.
In FIG. 4, H ^(l) represents the first layer of deep learning. FIG. 4 is an example of L = 3, and has L + 1 layers as a whole. The input layer is H ⁽⁰⁾ and the output layer is H ^(L) . Each layer has a plurality of (or one) nodes. Node _N ^{i (l)} represents the i-th node in layer ^{^{H (l), n (l}} ) is the number of nodes in layer ^{H (l).} The layer H ^(l) has nodes N ₁ ^(l) , N ₂ ^(l) , ..., N _{n (l)} ^(l) .

また、図４において、ｗ_ｉｊ ^（ｌ）は、ノードＮ_ｉ ^{（ｌ−１）}とノードＮ_ｊ ^（ｌ）の間の重みパラメータを表す。ｂ_ｊ ^（ｌ）は、ノードＮ_ｊ ^（ｌ）へのバイアスパラメータを表す。Ｆ^（ｌ）は、層Ｈ^（ｌ）の活性化関数を表す。ｘ_ｉ ^（ｌ）はノードＮ_ｉ ^（ｌ）への入力を表し、ｙ_ｉ ^（ｌ）はノードＮ_ｉ ^（ｌ）からの出力を表す。
これらの入出力の値は、以下の式で計算される。 Further, in FIG. 4, _wij ^(l) represents a weight parameter between the node N _i ^(l-1) and the node N _j ^(l). b _j ^(l) represents a bias parameter to node N _j ^(l). F ^(l) represents the activation function of layer H ^(l). x _i ^(l) represents the input to the node _Ni ^(l) _{, and y i} ^(l) represents the output from the node _Ni ^(l).
These input / output values are calculated by the following formula.

ここで、ｔ_ｉは、ノードＮ_ｉ ^（Ｌ）の目標出力値を表し、Ｍは誤差関数を表す。誤差関数Ｍは、入力としてｙ_ｉ ^（Ｌ）及びｔ_ｉを取り、その誤差の値を返す。
学習データは、いくつかのバッチと呼ばれるまとまりに分割される。以下のプロセスは各バッチに対して行われる。 Here, _{t i} represents the target output value of the node _N ^{i (L),} M represents an error function. Error function M takes _y ^{i (L)} and _{t i} as input, returns the value of the error.
The training data is divided into groups called batches. The following process is performed for each batch.

バッチ内の各レコードに対して、深層学習アルゴリズムにより、ｙ_ｉ ^（Ｌ）を計算する（ｉ＝１，・・・，ｎ^（Ｌ））。
次に、深層学習アルゴリズムにより、各ノードＮ_ｉ ^（ｌ）における誤差信号（δ_ｉ ^（ｌ）とおく）を計算する。ｌ＝Ｌのとき、δ_ｉ ^（Ｌ）は以下の［数２］式のように計算される。 For each record in the batch, y _i ^(L) is calculated by the deep learning algorithm (i = 1, ..., n ^(L) ).
Next, the deep learning algorithm to calculate each node _N ⁱ (put and δ _i ^(l)) an error signal in ^(l). When l = L, δ _i ^(L) is calculated by the following equation [Equation 2].

ｌ＝１，・・・・，Ｌ−１に対しては、δ_ｉ ^（ｌ）は以下の［数３］式のように計算される。 For l = 1, ..., L-1, δ _i ^(l) is calculated by the following equation [Equation 3].

そして、深層学習アルゴリズムにより、δ_ｉ ^（ｌ）をバッチ内の各レコードに対して計算し、その総和を新たにδ_ｉ ^（ｌ）とおく。
次に、変動量Δｗ_ｉｊ ^（ｌ）を、以下のように定義する。 _{Then, δ i} ^(l) is calculated for each record in the batch by the deep learning algorithm, and the sum is newly set as δ _i ^(l) .
Next, the variation amount [Delta] w _ij a ^(l), defined as follows.

最後に、深層学習アルゴリズムにより、各重みパラメータｗ_ｉｊ ^（ｌ） for ｌ＝１，・・・，Ｌ，ｉ＝１，・・・，ｎ^{（ｌ−１）}， and ｊ＝１，・・・，ｎ^（ｌ）を、以下の［数５］式のように更新する。 Finally, according to the deep learning algorithm, each weight parameter _wij ^(l) for l = 1, ..., L, i = 1, ..., n ^(l-1) , and j = 1, ... , N ^(l) is updated as shown in the following equation [Equation 5].

ここで、学習率α、正則項λは、事前に決定しておく。
バイアスパラメータに関しては、以下のように更新する。 Here, the learning rate α and the regular term λ are determined in advance.
The bias parameters are updated as follows.

ここで、Δｂ_ｊ ^（ｌ）＝δ_ｊ ^（ｌ）である。
この［数１］式から［数６］式のプロセスを、全てのバッチに対して行う。
また、このプロセスを複数回繰り返す。この繰り返し回数をエポック数と呼ぶ。エポック数は、深層学習を行う前に事前引用文献、又は学習を進めながら決定する。 Here, Δb _j ^(l) = δ _j ^(l) .
The process of the formulas [Equation 1] to [Equation 6] is performed for all batches.
Also, this process is repeated multiple times. This number of repetitions is called the epoch number. The number of epochs is determined by pre-cited references or while proceeding with learning before deep learning.

［ε−差分プライバシの詳細］
次に、ε−差分プライバシについて説明する。
例えば、データベースＤとデータベースＤ′は、最大で１レコードだけ異なるとする。ランダム機構Ａは、出力の全ての集合Ｙについて、以下の［数７］式の条件が成り立つとき、ε−差分プライバシを実現する。 [Ε-Details of differential privacy]
Next, ε-difference privacy will be described.
For example, database D and database D'are different by a maximum of one record. The random mechanism A realizes ε-difference privacy when the following condition of Eq. [Equation 7] is satisfied for all sets Y of outputs.

データベースＤとデータベースＤ′とを、１レコードだけ異なるデータベースであると考える。入力のデータベースとして理論上可能性のある全てのデータベースの集合をＱとおく。このとき、ｆを、ｆ：Ｑ→Ｒである関数とする。ここで、全てのデータベースＤ及びデータベースＤ′に対して以下の［数８］式が成立するとき、Δｆをｆのグローバルセンシティビティ（global sensitivity）、つまりｆの値が取り得る範囲と定義する。 Consider database D and database D'as databases that differ by one record. Let Q be the set of all databases that are theoretically possible as input databases. At this time, let f be a function such that f: Q → R. Here, when the following [Equation 8] equation holds for all databases D and database D', Δf is defined as the global sensitivity of f, that is, the range in which the value of f can be taken.

次に、ラプラスメカニズムと呼ばれる、ε−差分プライバシを満たす匿名化のメカニズムを説明する。
Lap(v)を、平均０、スケールがｖであるラプラス分布に基づいてランダムな誤差を出力する関数であるとする。このとき、ある関数ｆに対して、ランダムメカニズムＡが、ｆ（Ｄ）＋Lap（Δｆ／ε）を出力するとき、ランダムメカニズムＡは、ε−差分プライバシを満たす。 Next, an anonymization mechanism that satisfies ε-differential privacy, called the Laplace mechanism, will be described.
Let Lap (v) be a function that outputs a random error based on a Laplace distribution with a mean of 0 and a scale of v. At this time, when the random mechanism A outputs f (D) + Lap (Δf / ε) for a certain function f, the random mechanism A satisfies ε-difference privacy.

ここでは、誤差ｂを与える対象の変数が、１つのデータの有無によって変動し得る値の幅の最大値をｄとおく。ここでの最大値ｄは、実際の値ではなく、匿名化前のデータベースとして想定し得る値の幅から算出する。そして、誤差ｂ＝ｄ／εとする。つまり、最大値ｄの値が大きく、εが小さいほど、誤差ｂの値が大きくなり、与えられる誤差が大きくなる。 Here, let d be the maximum value of the range of values in which the variable to be given the error b can fluctuate depending on the presence or absence of one data. The maximum value d here is calculated from the range of values that can be assumed as a database before anonymization, not the actual value. Then, the error b = d / ε is set. That is, the larger the value of the maximum value d and the smaller the ε, the larger the value of the error b and the larger the given error.

なお、深層学習の重みパラメータやバイアスパラメータは複数存在する。これらパラメータの集合に対してε−差分プライバシを満たすこともできるが、本実施の形態では、個々のパラメータに対して個別にε−差分プライバシを満たすようにする。
このように個々のパラメータに対して個別にε−差分プライバシを満たすようにする場合には、ランダム機構Ａは、各パラメータにおける出力の全ての集合Ｙについて、以下の式が成り立ち、個々のパラメータに対して個別にε−差分プライバシを満たすことになる。なお、データベースＤとデータベースＤ′は、最大で１レコードだけ異なる。 There are a plurality of weight parameters and bias parameters for deep learning. Although it is possible to satisfy the ε-differential privacy for the set of these parameters, in the present embodiment, the ε-differential privacy is individually satisfied for each parameter.
In this way, when the ε-difference privacy is individually satisfied for each parameter, the random mechanism A holds the following equation for all sets Y of outputs in each parameter, and the following equation holds for each parameter. On the other hand, the ε-differential privacy is satisfied individually. Note that database D and database D'are different by a maximum of one record.

［各パラメータの閾値設定例］
次に、重みパラメータｗ_ｉｊ ^（ｌ）とバイアスパラメータｂ_ｊ ^（ｌ）に対して値の閾値を設定する処理について説明する。なお、この処理は、図３のステップＳ２０の処理に相当する。
この処理は、１レコードだけ異なるときに変わりうる値の、理論上の最大値（グローバルセンシティビティ）を減少させることで、パラメータに与える誤差を減少させるために行われる。これにより、深層学習モデルの精度低下を軽減させる、つまり精度の向上を図ることができる。 [Example of threshold setting for each parameter]
Next, a description will be given of a process for setting a threshold value for the weight parameter _w ^{ij (l)} and the bias parameter _b ^{j (l).} This process corresponds to the process of step S20 in FIG.
This process is performed to reduce the error given to the parameter by reducing the theoretical maximum value (global sensitivity) of the value that can change when only one record is different. As a result, it is possible to reduce the decrease in accuracy of the deep learning model, that is, to improve the accuracy.

ここでは、重みパラメータｗ_ｉｊ ^（ｌ）の最大値をｗ_ｍａｘ、最小値をｗ_ｍｉｎとする。また、バイアスパラメータｂ_ｊ ^（ｌ）の最大値をｂ_ｍａｘ、最小値をｂ_ｍｉｎとする。
また、本実施の形態では、深層学習への入力値（学習データ）にも閾値を設定する。この入力値の閾値は、ここでは［０，１］とする。ここでの閾値[０，１]とは、最小値を“０”とし、最大値を“１”として、“０”以上“１”以下に制限することを意味する。 Here, the maximum value of the weight parameter _wij ^(l) _{is w max} , and the minimum value is w _min . Further, the maximum value of the bias parameter b _j ^(l) _{is b max} , and the minimum value is b _min .
Further, in the present embodiment, a threshold value is also set for the input value (learning data) for deep learning. The threshold value of this input value is set to [0,1] here. The threshold value [0,1] here means that the minimum value is "0" and the maximum value is "1", and the threshold value is limited to "0" or more and "1" or less.

本実施の形態では、匿名化処理部１０は、深層学習を行った後、学習済重みパラメータｗ_ｉｊ ^（ｌ）に対して誤差を与える。つまり、深層学習時の全てのｉ，ｊ，ｌ（図３参照）に対して、ｗ_ｉｊ ^（ｌ）＋Lap（ｗ_ｍａｘ−ｗ_ｍｉｎ／ε）を計算する。この計算結果を、ｒ_ｉｊ ^（ｌ）とおく。もし、計算結果ｒ_ｉｊ ^（ｌ）の値が、最大値ｗ_ｍａｘを超えた場合、重みパラメータｗ_ｉｊ ^（ｌ）の値を最大値（閾値）ｗ_ｍａｘに修正する。
同様に、もし計算結果ｒ_ｉｊ ^（ｌ）の値が、最小値ｗ_ｍｉｎを下回った場合、重みパラメータｗ_ｉｊ ^（ｌ）の値を最小値（閾値）ｗ_ｍｉｎに修正する。 In the present embodiment, the anonymization processing unit 10 gives an error to _{the learned weight parameter wij} ^{(l) after performing deep learning.} _{That is, wij} ^(l) + Lap (w _max −w _min / ε) is calculated for all i, j, l (see FIG. 3) during deep learning. This calculation result is referred to as _rij ^(l) . If the value of the calculation result _rij ^(l) _{exceeds the maximum value w max} , the value of the weight parameter _wij ^(l) is corrected to the maximum value (threshold value) w _max.
Similarly, if the value of the _{calculation result rij} ^(l) is less than the _{minimum value w min} _{, the value of the weight parameter wij} ^(l) is corrected to the minimum value (threshold value) w _min.

また、この最大値及び最小値で制限する処理を、バイアスパラメータｂ_ｊ ^（ｌ）に対しても行う。つまり、バイアスパラメータｂ_ｊ ^（ｌ）の計算結果を、ｍｉｎ（ｂ_ｍａｘ，ｍａｘ（ｂ_ｍｉｎ，ｂ_ｊ ^（ｌ）＋Lap（（ｂ_ｍａｘ−ｂ_ｍｉｎ）／ε）））に設定する。 Further, the process of limiting this maximum value and the minimum value is also performed with respect to the bias parameters b _{j ^(l).} That is, the calculation result of the bias parameter b _j ^(l) is set to min (b _max , max (b _min , b _j ^(l) + Lap ((b _max −b _min ) / ε)))).

［閾値を設定したときにε−差分プライバシを満たすことの説明］
次に、閾値（最大値、最小値）で誤差を制限したときのパラメータが、ε−差分プライバシを満たしたものであることを説明する。
上述したように、本実施の形態では、深層学習時の重みパラメータｗ_ｉｊ ^（ｌ）やバイアスパラメータｂ_ｊ ^（ｌ）（図４参照）として、重みパラメータｗ_ｉｊ ^（ｌ）の理論上の最大幅（グローバルセンシティビティ）は（ｗ_ｍａｘ−ｗ_ｍｉｎ）であり、バイアスパラメータｂ_ｊ ^（ｌ）の理論上の最大幅（グローバルセンシティビティ）は（ｂ_ｍａｘ−ｂ_ｍｉｎ）である。次に説明するように、学習済み重みパラメータｗ_ｊ ^（ｌ）の計算結果を、ｍｉｎ（ｗ_ｍａｘ，ｍａｘ（ｗ_ｍｉｎ，ｂ_ｊ ^（ｌ）＋Lap（（ｗ_ｍａｘ−ｗ_ｍｉｎ）／ε）））に設定し、学習済みバイアスパラメータｂ_ｊ ^（ｌ）の計算結果を、ｍｉｎ（ｂ_ｍａｘ，ｍａｘ（ｂ_ｍｉｎ，ｂ_ｊ ^（ｌ）＋Lap（（ｂ_ｍａｘ−ｂ_ｍｉｎ）／ε）））に設定することで、ε−差分プライバシを満たすことができる。 [Explanation of satisfying ε-differential privacy when threshold is set]
Next, it will be described that the parameters when the error is limited by the threshold value (maximum value, minimum value) satisfy the ε-difference privacy.
As described above, in this embodiment, as the weight parameters _w ^ij during deep learning ^(l) and the bias parameter _b ^{j (l)} (see FIG. 4), the maximum width of the theoretical weight parameter _w ^{ij (l)} (Global sensitivity) is (w _max −w _min ), and the theoretical maximum width (global sensitivity _{) of the bias parameter b j} ^(l _{) is (b max} −b _min ). As will be described next, _{the calculation result of the learned weight parameter w j} ^(l) is calculated as min (w _max , max (w _min , b _j ^(l) + Lap ((w _max −w _min ) / ε))). And set the calculation result of the learned bias parameter b _j ^(l) _{to min (b max} , max (b _min , b _j ^(l) + Lap ((b _max −b _min ) / ε)))). Therefore, the ε-differential privacy can be satisfied.

ランダムメカニズムＡが、ｍｉｎ（ｆ_ｍｉｎ，ｍａｘ（ｆ_ｍａｘ，ｆ（Ｄ）＋Lap（Δｆ／ε）））を出力するとき、ランダムメカニズムＡはε−差分プライバシを実現する。ここで、ｆ_ｍａｘ及びｆ_ｍｉｎは、ｆ（Ｄ）が取り得る理論上の最大値と最小値である。
ここで、データベースＤと、そのデータベースＤに対して１レコードだけ異なるデータベースＤ′をおく。
また、Ｆ（Ｄ）＝ｆ（Ｄ）＋Lap（Δｆ／ε）とおく。Ｆ（Ｄ）の値が［ｆ_ｍｉｎ，ｆ_ｍａｘ］の範囲に入るとき、［数７］式が成立する。 When the random mechanism A _{outputs min (f min} , max (f _max , f (D) + Lap (Δf / ε))), the random mechanism A realizes ε-differential privacy. Here, f _max and f _min are the theoretical maximum and minimum values that f (D) can take.
Here, a database D and a database D'that differs from the database D by one record are placed.
Further, it is set as F (D) = f (D) + Lap (Δf / ε). When the value of F (D) _{falls within the range of [f min} , f _max ], the equation [Equation 7] is established.

次に、Ｆ（Ｄ）の値がｆ_ｍｉｎを下回る場合を考える。このとき、Ａ（Ｄ）の出力値はｆ_ｍｉｎになる。Ａ（Ｄ）の出力がｆ_ｍｉｎになる確率は、次の［数９］式で表される。 Next, consider the case where the value of F (D) is _{less than f min.} At this time, the output value of A (D) becomes _{f min.} The probability that the output of A (D) _{will be f min} is expressed by the following equation [Equation 9].

［数９］式において、Lap（ｖ，ｕ）は、スケールパラメータがｖであり、平均との差がｕである、ラプラス分布の確率密度関数の値を表す。
同様に、Ａ（Ｄ′）の出力値がｆ_ｍｉｎとなる確率は、次の［数１０］式で表される。 In the equation [Equation 9], Lap (v, u) represents the value of the probability density function of the Laplace distribution in which the scale parameter is v and the difference from the average is u.
Similarly, the probability that the output value of A (D') is f _min is expressed by the following equation [Equation 10].

［数９］式の値と、［数１０］式の値の比は、最大で［数１１］式で表される。 The ratio of the value of the formula [Equation 9] to the value of the formula [Equation 10] is expressed by the formula [Equation 11] at the maximum.

ここで、｜ｆ（Ｄ）−ｆ（Ｄ′）｜≦Δｆであるから、［数１１］式の値は、ｅｘｐ（ε）以下である。したがって、ε−差分プライバシを満たす。 Here, since | f (D) −f (D ′) | ≦ Δf, the value of the equation [Equation 11] is exp (ε) or less. Therefore, it satisfies ε-difference privacy.

次に、Ｆ（Ｄ）の値がｆ_ｍａｘ以上となる場合を考える。このとき、Ａ（Ｄ）の出力値はｆ_ｍａｘに制限される。Ａ（Ｄ）の出力がｆ_ｍａｘとなる確率は、次の［数１２］式で表される。 Next, consider the case where the value of F (D) is f _{max or more.} At this time, the output value of A (D) is limited to _{f max.} The probability that the output of A (D) _{becomes f max} is expressed by the following equation [Equation 12].

同様に、Ａ（Ｄ′）の出力値がｆ_ｍａｘとなる確率は、次の［数１３］式で表される。 Similarly, the probability that the output value of A (D') _{becomes f max} is expressed by the following equation [Equation 13].

［数１２］式の値と、［数１３］式の値の比は、最大で［数１４］式で表される。 The ratio of the value of the formula [Equation 12] to the value of the formula [Equation 13] is expressed by the formula [Equation 14] at the maximum.

ここで、｜ｆ（Ｄ）−ｆ（Ｄ′）｜≦Δｆであるから、［数１４］式の値は、ｅｘｐ（ε）以下である。したがって、ε−差分プライバシを満たす。
このように誤差を最大値と最小値の閾値に制限することがε−差分プライバシを満たすことは、全てのパラメータについて成立する。したがって、本実施の形態のように各パラメータの誤差を閾値で制限することで、ε−差分プライバシが成り立つ。 Here, since | f (D) −f (D ′) | ≦ Δf, the value of the equation [Equation 14] is exp (ε) or less. Therefore, it satisfies ε-difference privacy.
Limiting the error to the maximum and minimum thresholds in this way satisfies the ε-differential privacy holds for all parameters. Therefore, by limiting the error of each parameter with a threshold value as in the present embodiment, ε-differential privacy is established.

図９は、ここまで数式を用いて説明した、誤差を最大値と最小値の閾値に制限する処理の概略を示すものである。図９に示すように、例えばあるパラメータが取り得る値の範囲が“０”以上“１”以下であり、ある時点でのパラメータ値が０．８であるとする（グローバルセンシティビティは、最大値“１”と最小値“０”の差）。そして、このパラメータ値“０．８”に誤差を付与して、誤差付与済のパラメータ値が“１．１”になったとき、パラメータ値を閾値の範囲の上限値である“１”に制限する処理が行われる。
なお、この図９に示す例は、パラメータを閾値で制限する概略を非常に簡略化して示すものであり、実際の閾値に制限する処理は、ここまで数式を参照して説明した様々な条件を考慮して行われるものである。 FIG. 9 shows an outline of the process of limiting the error to the threshold value of the maximum value and the minimum value, which has been described by using the mathematical formula so far. As shown in FIG. 9, for example, it is assumed that the range of values that a certain parameter can take is "0" or more and "1" or less, and the parameter value at a certain point in time is 0.8 (global sensitivity is the maximum value). Difference between "1" and minimum value "0"). Then, an error is added to this parameter value "0.8", and when the parameter value to which the error has been added becomes "1.1", the parameter value is limited to "1" which is the upper limit value of the threshold range. Processing is performed.
In addition, the example shown in FIG. 9 shows the outline of limiting the parameter by the threshold value in a very simplified manner, and the process of limiting the parameter to the actual threshold value includes various conditions described with reference to the mathematical formulas so far. It is done in consideration.

［実データで評価した例］
図５は、本実施の形態の処理を、評価用のデータセットに対して実行した場合の例を示す。ここでは、評価用のデータセットとして、プライバシ保護データマイニングの分野で広く利用されている、［アダルトデータセット（Adult data set）］を利用する。［アダルトデータセット］は、１５種類の属性（年齢、性別、人種、年収、など）から構成されており、欠損値を含むレコードを除外して、４５，２２２レコードから成る。年収の属性は、各レコードの人物の年収が、５万ドルを超えているか否かの２値を取る。
そして、年収を除く１４の属性から、年収が５万ドルを超えているか否かを予測する深層学習システムを構築する。 [Example of evaluation using actual data]
FIG. 5 shows an example when the processing of the present embodiment is executed on the data set for evaluation. Here, [Adult data set], which is widely used in the field of privacy protection data mining, is used as a data set for evaluation. The [Adult Data Set] is composed of 15 types of attributes (age, gender, race, annual income, etc.), and is composed of 45,222 records excluding records including missing values. The annual income attribute takes two values as to whether or not the annual income of a person on each record exceeds $ 50,000.
Then, from 14 attributes excluding the annual income, a deep learning system that predicts whether or not the annual income exceeds 50,000 dollars will be constructed.

まず、差分プライバシを満たすような匿名化を行わない、生データに対して事前実験を行い、深層学習モデルの精度が高くなるような深層学習アルゴリズムの構造を決定した。学習率は０．０１、バッチサイズは５０、エポック数は５００、正則項は０．００１、中間層の数は４（入力層、出力層を含めると、全部で５層）が良い結果を出した。 First, we conducted a preliminary experiment on raw data without anonymization to satisfy the differential privacy, and determined the structure of the deep learning algorithm so that the accuracy of the deep learning model would be high. Good results are obtained when the learning rate is 0.01, the batch size is 50, the number of epochs is 500, the regular term is 0.001, and the number of intermediate layers is 4 (5 layers in total including the input layer and output layer). did.

ここでは、１０分割交差検定を行って、差分プライバシを満たす匿名化を行うと共に、その匿名化を行う際に、誤差の最大値と最小値を閾値に制限する処理を行った場合の匿名化モデルの精度を計測した。この例では、精度を評価する手法として、手法［ａｃｃｕｒａｃｙ］と手法［ｆ−ｍｅａｓｕｒｅ］を用いた。１０分割交差検定は、データセットを９：１の比率で２つに分け、比率９の方のデータをトレーニングデータとし、比率１のデータをテストデータとする。すなわち、比率９のトレーニングデータを使って学習を行い、比率１のテストデータから、給料を除く１４種類の属性を入力として学習済みの深層学習モデルに投入して、給料を予測する処理を行う。そして、その予測結果と、実際の値を比較して評価を行う。この評価を１０回行うようにして、各レコードが一度ずつテストデータに含まれるようにする。 Here, a 10-fold cross-validation test is performed to perform anonymization that satisfies the differential privacy, and when the anonymization is performed, an anonymization model is performed when the maximum and minimum values of the error are limited to the threshold value. The accuracy of was measured. In this example, the method [accuracy] and the method [f-meter] were used as methods for evaluating the accuracy. In the 10-fold cross-validation test, the data set is divided into two at a ratio of 9: 1, the data of the ratio 9 is used as training data, and the data of ratio 1 is used as test data. That is, learning is performed using the training data of the ratio 9, and 14 types of attributes excluding the salary are input into the trained deep learning model from the test data of the ratio 1 to perform a process of predicting the salary. Then, the prediction result is compared with the actual value for evaluation. This evaluation is performed 10 times so that each record is included in the test data once.

手法［ａｃｃｕｒａｃｙ］と手法［ｆ−ｍｅａｓｕｒｅ］の２つの評価指標の値（図５の縦軸）は、いずれも０から１までの値であり、１に近いほど精度が高いことを示す。図５の横軸はデータセットの数（バッチサイズ）を示し、図５Ａ、図５Ｂ、図５Ｃは、それぞれε＝１、ε＝１０、ε＝１００の場合を示す。
例えば、図５Ｃに示す例では、手法［ａｃｃｕｒａｃｙ］での評価指標値が０．８５、手法［ｆ−ｍｅａｓｕｒｅ］の評価指標値が０．７９となり、いずれも良好な精度が確保されていることが分かる。 The values of the two evaluation indexes (vertical axis in FIG. 5) of the method [accuracy] and the method [f-mere] are both values from 0 to 1, and the closer to 1 the higher the accuracy. The horizontal axis of FIG. 5 indicates the number of data sets (batch size), and FIGS. 5A, 5B, and 5C show the cases of ε = 1, ε = 10, and ε = 100, respectively.
For example, in the example shown in FIG. 5C, the evaluation index value of the method [accuracy] is 0.85, and the evaluation index value of the method [f-mease] is 0.79, and good accuracy is ensured in both cases. I understand.

＜２．第２の実施の形態例＞
次に、本発明の第２の実施の形態例を、図６〜図８を参照して説明する。この第２の実施の形態例を説明する図６〜図８において、第１の実施の形態例で説明した図１〜図５と同一の構成及び処理については同一符号を付し、詳細な説明を省略する。 <2. Example of the second embodiment>
Next, an example of the second embodiment of the present invention will be described with reference to FIGS. 6 to 8. In FIGS. 6 to 8 for explaining the second embodiment, the same components and processes as those in FIGS. 1 to 5 described in the first embodiment are designated by the same reference numerals and will be described in detail. Is omitted.

［システム全体の構成］
図６は、第２の実施の形態例のプライバシ保護データ提供システムの構成を示す。
データベース１には、個人情報が含まれる多数の生データが蓄積され、データベース１に蓄積された生データが、深層学習処理部２０に供給される。深層学習処理部２０は、予め用意された深層学習アルゴリズムを適用した演算を行うと同時に、深層学習の演算時に、差分プライバシに基づく匿名化処理を施して、匿名化済みの深層学習モデルである、匿名化モデル４を得る。 [System-wide configuration]
FIG. 6 shows the configuration of the privacy protection data providing system of the second embodiment.
A large amount of raw data including personal information is accumulated in the database 1, and the raw data accumulated in the database 1 is supplied to the deep learning processing unit 20. The deep learning processing unit 20 is an anonymized deep learning model by performing an operation applying a deep learning algorithm prepared in advance and at the same time performing an anonymization process based on differential privacy at the time of the deep learning operation. Obtain anonymization model 4.

深層学習処理部２０が、差分プライバシに基づいて匿名化モデル４を得る際には、深層学習アルゴリズムで使用する重みパラメータ及びバイアスパラメータに対して、それぞれのパラメータ値の変動量にラプラス分布に基づいて誤差を与えて、差分プライバシの処理を施す。但し、それぞれのパラメータ値の変動量にラプラス分布に基づいた誤差を与える際には、その誤差として、最大値及び最小値を示す閾値で制限するようにした。
ラプラス分布に基づいた誤差を与えるということは、誤差を与えたパラメータ値が、確率的要素を含む値になり、結果的に匿名化が行われた匿名化モデル４が得られることになる。
深層学習処理部２０が深層学習時に差分プライバシに基づいて匿名化モデル４を得るための誤差の生成は、図２に示した匿名化処理部１０での処理と同様の構成で実現される。 When the deep learning processing unit 20 obtains the anonymization model 4 based on the differential privacy, the weight parameter and the bias parameter used in the deep learning algorithm are based on the Laplace distribution based on the fluctuation amount of each parameter value. Give an error and perform differential privacy processing. However, when giving an error based on the Laplace distribution to the fluctuation amount of each parameter value, the error is limited by the threshold value indicating the maximum value and the minimum value.
Giving an error based on the Laplace distribution means that the parameter value to which the error is given becomes a value including a stochastic element, and as a result, an anonymization model 4 in which anonymization is performed is obtained.
The generation of an error for the deep learning processing unit 20 to obtain the anonymization model 4 based on the differential privacy during deep learning is realized by the same configuration as the processing in the anonymization processing unit 10 shown in FIG.

［全体の処理の流れ］
図７は、第２の実施の形態例のプライバシ保護データ提供システムでの処理の流れを示すフローチャートである。
まず、深層学習処理部２０は、データベース１から生データを取得する（ステップＳ３１）。そして、深層学習処理部２０は、取得した生データのパラメータの変動量に対して、グローバルセンシティビティによる制限を設定したラプラス分布に基づく誤差の付与を行いながら、予め用意された深層学習アルゴリズムを適用して深層学習を行う（ステップＳ３２）。このときには、深層学習を行いながら逐次的に、パラメータの変動量のグローバルセンシティビティを計算する。パラメータの変動量のグローバルセンシティビティを計算することで、グローバルセンシティビティとプライバシ指標「ε」からラプラス分布が決まり、ラプラス分布で誤差を与えることで、匿名化が行われる。そして、深層学習処理の結果として、匿名化モデルを取得し（ステップＳ３３）、得られた匿名化モデルをデータ出力部１９から出力する。 [Overall processing flow]
FIG. 7 is a flowchart showing a processing flow in the privacy protection data providing system of the second embodiment.
First, the deep learning processing unit 20 acquires raw data from the database 1 (step S31). Then, the deep learning processing unit 20 applies a deep learning algorithm prepared in advance while adding an error based on the Laplace distribution in which the limit by the global sensitivity is set to the fluctuation amount of the parameters of the acquired raw data. Then, deep learning is performed (step S32). At this time, the global sensitivity of the fluctuation amount of the parameter is calculated sequentially while performing deep learning. By calculating the global sensitivity of the fluctuation amount of the parameter, the Laplace distribution is determined from the global sensitivity and the privacy index "ε", and anonymization is performed by giving an error in the Laplace distribution. Then, as a result of the deep learning process, an anonymization model is acquired (step S33), and the obtained anonymization model is output from the data output unit 19.

ステップＳ３２において、匿名化処理の制限に使用される閾値は、深層学習処理部２０における、重みパラメータの変動量の最大値及び最小値を示す閾値と、バイアスパラメータの変動量の最大値及び最小値を示す閾値である。 In step S32, the threshold values used for limiting the anonymization process are the threshold value indicating the maximum and minimum values of the fluctuation amount of the weight parameter in the deep learning processing unit 20, and the maximum and minimum values of the fluctuation amount of the bias parameter. It is a threshold value indicating.

［深層学習の詳細］
次に、ここまで説明したステップＳ３１〜Ｓ３３の各処理の詳細について説明する。
本実施の形態例では、活性化関数と誤差関数を事前に決めて、匿名化された深層学習を行う。
例えば、ｆ（ｘ）＝ｍａｘ（０；ｘ）で定義されるＲｅＬＵが、深層学習の最終層を除く活性化関数として広く利用されている。
深層学習の利用目的として、カテゴリ分類の場合、最終層の活性化関数（Ｆ（Ｌ））
としてソフトマックス関数が、また、誤差関数としてクロスエントロピー誤差関数が広く利用されている。
ソフトマックス関数は、次の［数１５］式のように定義される。 [Details of deep learning]
Next, the details of each process of steps S31 to S33 described so far will be described.
In the embodiment of the present embodiment, the activation function and the error function are determined in advance, and anonymized deep learning is performed.
For example, ReLU defined by f (x) = max (0; x) is widely used as an activation function excluding the final layer of deep learning.
For the purpose of using deep learning, in the case of categorization, the activation function of the final layer (F (L))
The softmax function is widely used as an error function, and the cross entropy error function is widely used as an error function.
The softmax function is defined as the following equation [Equation 15].

また、クロスエントロピー誤差関数は、次の［数１６］式のように定義される。 Further, the cross entropy error function is defined as the following equation [Equation 16].

ここでは、匿名化された深層学習を行う場合、深層学習を行う最終層を除く各層は、活性化関数ReLUを、最終層の活性化関数としてソフトマックス関数を、誤差関数としてクロスエントロピー誤差関数を利用する。
最終層の活性化関数がソフトマックス関数であり、かつ、誤差関数がクロスエントロピー誤差関数の場合、誤差信号δ_ｊ（Ｌ） for ｊ＝１，・・・，ｎ^(L)の値は、次の［数１７］式に示すように計算される。 Here, when performing anonymized deep learning, each layer except the final layer for deep learning uses the activation function ReLU, the softmax function as the activation function of the final layer, and the cross entropy error function as the error function. Use.
When the activation function of the final layer is the softmax function and the error function is the cross entropy error function, the values of the error signals δ _j (L) for j = 1, ···, n ^(L) are as follows. It is calculated as shown in the formula [Equation 17] of.

［数１７］式において、ｙ_ｊ ^（Ｌ）はノードＮ_ｊ ^（Ｌ）の出力値を表し、ｔ_ｊ ^（Ｌ）はノードＮ_j ^（Ｌ）の目標出力値を表す。
最終層以外の層において活性化関数ReLUを使った場合、最終層以外の各ノードの誤差信号δ_ｊ ^（ｌ）＝１，・・・，Ｌ−１は次の［数１８］式で計算される。 In the equation [Equation 17], y _j ^(L) represents the output value of the node N _j ^(L) _{, and t j} ^(L) represents the target output value of the node N _j ^(L).
When the activation function ReLU is used in a layer other than the final layer, the error signals δ _j ^(l) = 1, ···, L-1 of each node other than the final layer are calculated by the following equation [Equation 18]. To.

ｘ_ｊ ^（１）の値として取り得る範囲は、［ｂ_ｊ ^（１）＋Σ_ｉｍｉｎ（ｗ_ｉ，ｊ ^（１），０），ｂ_ｊ ^（１）＋Σ_ｉｍａｘ（ｗ_ｉ，ｊ ^（１），０）］である。また、ｘ_ｊ ^（２）の値として取り得る範囲は、［ｂ_ｊ ^（２）＋Σ_ｉ（ｂ_ｉ ^（１）＋Σ_ｋｍａｘ（ｗ_ｋ，ｉ ^（１），０））ｍｉｎ（ｗ_ｉ，ｊ ^（２），０），ｂ_ｊ ^（２）＋Σ_ｉ（ｂ_ｉ ^（１）＋Σ_ｋｍａｘ（ｗ_ｋ，ｉ ^（１），０））ｍａｘ（ｗ_ｉ，ｊ ^（２），０）］となる。深層学習では、ｘ_ｊ ^（ｌ） for ｌ＝１，・・・，Ｌは、次の［数１９］式で計算される。 The range that can be taken as the value of _{x j} ⁽¹⁾ _{is [b j} ⁽¹⁾ + Σ _i min (wi _{, j} ⁽¹⁾ , 0), b _j ⁽¹⁾ + Σ _i max (wi _{, j} ^(1)). , 0)]. _Also, the possible range as the value of ^{x j (2)} _{^{is, [b j (2) +}} Σ i (b i (1) + Σ k max (w k, i (1), 0)) min (w i, j ^{_{^{(2), 0), b}}} j (2) + Σ i (b i (1) + Σ k max (w k, i (1), 0)) max (w i, j (2), a 0)] .. In deep learning, x _j ^(l) for l = 1, ..., L is calculated by the following equation [Equation 19].

ここで、ｍｉｎ（ｙ_ｉ ^（０））＝０であり、ｍａｘ（ｙ_ｉ ^（０））＝１である。これは、深層学習の第１層目への入力値を０以上１以下の範囲に正規化しているためである。また、最終層以外の層では、活性化関数ReLUを使っているので、ｌ＝１，・・・，Ｌ−１において、ｙ_ｊ ^（ｌ）は、次の［数２０］式によって計算される。 Here, min (y _i ⁽⁰⁾ ) = 0 and max (y _i ⁽⁰⁾ ) = 1. This is because the input value to the first layer of deep learning is normalized to the range of 0 or more and 1 or less. Further, since the activation function ReLU is used in the layers other than the final layer, y _j ^(l) is calculated by the following equation [Equation 20] at l = 1, ..., L-1. ..

これによって、ｍａｘ（ｙ_ｊ ^（ｌ））の値は、常に０以上であることがわかる。
次に、誤差信号δ_ｊ ^（ｌ）の取り得る値の範囲を計算する。深層学習モデルの出力値の範囲は、−１から１までであるので、次の［数２１］式のように定義される。 From this, it can be seen that the value of _{max (y j} ^{(l)) is always 0 or more.}
Next, the range of possible values of the error signal δ _j ^{(l) is calculated.} Since the range of the output value of the deep learning model is from -1 to 1, it is defined as the following equation [Equation 21].

また、ｌ＝１，・・・，Ｌ−１について、次の［数２２］式で示される。ここで、全てのｊとｌについて、ｍｉｎ（δ_ｊ ^（ｌ））であり、ｍａｘ（δ_ｊ ^（ｌ））≧０である。 Further, l = 1, ..., L-1 is expressed by the following equation [Equation 22]. Here, for all j and l, min (δ _j ^(l) ) and max (δ _j ^(l) ) ≧ 0.

最終的には、次の［数２３］式が得られる。 Finally, the following equation [Equation 23] is obtained.

ｂ_ｊ ^（ｌ）については、次の［数２４］式で示される。 b _j ^(l) is expressed by the following equation [Equation 24].

また、ｌ＝１，・・・，Ｌ−１について、次の［数２５］式で示される。 Further, l = 1, ..., L-1 is expressed by the following equation [Equation 25].

既に述べたように、重みパラメータの変動量Δｗ_ｉｊ ^（ｌ）と、バイアスパラメータの変動量Δｂ_ｊ ^（ｌ）に基づいて、重みパラメータとバイアスパラメータを、［数５］式と［数６］式により更新する。つまり、データ入力ごとに毎回、重みパラメータとバイアスパラメータを更新する。
ここで本実施の形態例では、このときの変動量にラプラス分布に基づく誤差を与える。重みパラメータの変動量Δｗ_ｉｊ ^（ｌ）と、バイアスパラメータの変動量Δｂ_ｊ ^（ｌ）についても、値の閾値を設定する。 As already mentioned, the variation amount [Delta] w _ij of the weighting parameters _^(l), based on the bias parameters of the variation amount [Delta] b j _^(l), the weighting parameters and the bias parameter, [Expression 5] equation [6] where Update by. That is, the weight and bias parameters are updated each time data is entered.
Here, in the example of the present embodiment, an error based on the Laplace distribution is given to the fluctuation amount at this time. A variation amount [Delta] w _ij of the weighting parameters ^(l), for a bias parameter variation amount Δb _j ^(l), sets the threshold value.

ここでは、Δｗ_ｍａｘとΔｗ_ｍｉｎを、重みパラメータの変動量Δｗ_ｉｊ ^（ｌ）の最大値と最小値とする。また、Δｂ_ｍａｘとΔｂ_ｍｉｎを、バイアスパラメータΔｂ_ｊ ^（ｌ）の最大値と最小値とする。 Here, Δw _max and Δw _min are the maximum and minimum values of the fluctuation amount Δw _ij ^{(l) of the weight parameter.} Further, Δb _max and Δb _min are set as the maximum and minimum values of the bias parameters Δb _j ^(l).

また、深層学習のエポック数をＥとおく。各バッチに対して学習を行う際に、それぞれのｗ_ｉｊ ^（ｌ）とｂ_ｊ ^（ｌ）に対して、重みパラメータの変動量Δｗ_ｉｊ ^（ｌ）を、ｍｉｎ（Δｗ_ｍａｘ，ｍｉｎ（Δｗ_ｍａｘ，ｗ_ｉｊ ^（ｌ）＋Lap（（Δｗ_ｍａｘ−Δｗ_ｍｉｎ）・Ｅ／ε）））に設定する。また、バイアスパラメータの変動量Δｂ_ｊ ^（ｌ）をｍｉｎ（Δｂ_ｍａｘ，ｍａｘ（Δｂ_ｍｉｎ，ｂ_ｊ ^（ｌ）＋Lap（（Δｂ_ｍａｘ−Δｂ_ｍｉｎ）・Ｅ／ε）））に設定する。 Also, let E be the number of epochs for deep learning. When performing learning for each batch, for each _w ^{ij (l)} and _b ^{j (l),} the variation amount [Delta] w _ij of the weighting parameters ^{_{(l), min (Δw max}} , min (Δw max, w _ij ^(l) + Lap ((Δw _max −Δw _min ) ・ E / ε)))). Further, the fluctuation amount Δb _j ^(l) of the bias parameter is set to min (Δb _max , max (Δb _min , b _j ^(l) + Lap ((Δb _{max −} _{Δb min} ) · E / ε))).

［閾値を設定したときにε−差分プライバシを満たすことの説明］
次に、深層学習を行う際に、パラメータを閾値（最大値,最小値）で誤差を制限した匿名モデルが、ε−差分プライバシを満たしたものであることを説明する。
各重みパラメータとバイアスパラメータは、［数５］式と［数６］式に基づいて更新される。［数５］式と［数６］式において、重みパラメータの変動量Δｗ_ｉｊ ^（ｌ）とバイアスパラメータの変動量Δｂ_ｊ ^（ｌ）は学習の入力値に依存して変わるが、それ以外の値は入力値に依存しない。したがって、第１の実施の形態で、閾値を設定したときにε−差分プライバシを満たすことを証明した場合と同様に、Δｗ_ｉｊ ^（ｌ）をｍｉｎ（Δｗ_ｍａｘ，ｍａｘ（Δｗ_ｍｉｎ，ｗ_ｉｊ ^（ｌ）＋Lap（（Δｗ_ｍａｘ−Δｗ_ｍｉｎ）・Ｅ／ε）））に設定し、また、Δｂ_ｊ ^（ｌ）をｍｉｎ（Δｂ_ｍａｘ，ｍａｘ（Δｂ_ｍｉｎ，ｂ_ｊ ^（ｌ）＋Lap（（Δｂ_ｍａｘ−Δｂ_ｍｉｎ）・Ｅ／ε)))に設定することで、各エポックのイテレーションは、パラメータベース（ε／Ｅ）−差分プライバシを満たす。
全体でＥエポックあるので、次に説明する証明より、最終的にε−差分プライバシを満たす。 [Explanation of satisfying ε-differential privacy when threshold is set]
Next, it will be described that the anonymous model in which the error is limited by the threshold value (maximum value, minimum value) of the parameters when performing deep learning satisfies the ε-differential privacy.
Each weight parameter and bias parameter is updated based on the [Equation 5] and [Equation 6] equations. In Equation 5 equation [6] where it variation [Delta] w _ij of the weighting parameters _^(l) and the bias parameter variation amount [Delta] b j _^(l) will vary depending on the input value of the learning, other values Does not depend on the input value. Thus, in the first embodiment, as in the case of proving that satisfying ε- difference privacy when setting the threshold value, [Delta] w _ij ^(l) a _{min (Δw max, max (Δw} min, w ij ( ^l) + Lap ((Δw _max −Δw _min ) · E / ε))), and set Δb _j ^(l) to min (Δb _max , max (Δb _min , b _j ^(l)) + Lap ((Δb _max)) By setting −Δb _min ) · E / ε)))), the iteration of each epoch satisfies the parameter base (ε / E) − differential privacy.
Since there is an E epoch as a whole, the ε-differential privacy is finally satisfied from the proof described below.

ランダムメカニズムＡが、ｄ個のランダムメカニズムＡ_１，・・・，Ａ_ｄから成り立っており、これを１回ずつ続けて実施するものとする。ここでは、ｉ≧２において、Ａ_ｉは入力としてＡ_ｉ−１の出力値を取る。Ａ_ｄの出力値が、Ａの出力値となる。
各Ａ_ｉは、パラメータベースε_ｉ−差分プライバシを満たすものとする。このとき、Ａはパラメータベース（Σ_ｉ＝１ ^ｄε_ｉ）の差分プライバシを実現する。 Random mechanism A is, d number of random mechanism A _{1, ···,} which consists of A _d, shall be carried out continues to this once. Here, when i ≧ 2, A _i takes the output value _{of A i-1} as an input. _{The output value of Ad} becomes the output value of A.
Each A _i shall satisfy the parameter base ε _i − differential privacy. At this time, A _{realizes a parameter-based (Σ i = 1} ^d ε _i ) differential privacy.

ランダムメカニズムＡは、ｄ個のランダムメカニズムＡ_１，・・・，Ａ_ｄから成り立っており、これを１回ずつ続けて実施するものとする。ｉ≧２において、Ａ_ｉは入力としてＡ_ｉ−１の出力値を取る。Ａ_ｄの出力値が、Ａの出力値となる。ここで、各Ａ_ｉは、ε_ｉ−差分プライバシを満たすものとする。このとき、ランダムメカニズムＡは（Σ_ｉ＝１ ^ｄε_ｉ）−差分プライバシを実現する。
この処理は各パラメータに対して実行されるので、ここでのランダムメカニズムＡは、パラメータベース（Σ_ｉ＝１ ^ｄε_ｉ）−差分プライバシを実現する。 Random mechanism A is, d number of random mechanism A _{1, ···,} which consists of A _d, shall be carried out continues to this once. When i ≧ 2, A _i takes the output value _{of A i-1} as an input. _{The output value of Ad} becomes the output value of A. Here, it is assumed that each A _i satisfies ε _i − differential privacy. At this time, the random mechanism A _{realizes (Σ i = 1} ^d ε _i ) -differential privacy.
Since this process is executed for each parameter, the random mechanism A here _{realizes parameter base (Σ i = 1} ^d ε _i ) -differential privacy.

図１０は、第２の実施の形態例での、誤差を最大値と最小値の閾値に制限する処理の概略を示すものである。図１０に示すように、例えばあるパラメータの変動量として取り得る最大の範囲が“０”以上“１”以下であり、ある時点での変動量が０．６であるとする。そして、学習しながら逐次的に算出された閾値の範囲が、“０．３”以上“０．７”以下であるとする（この場合のグローバルセンシティビティは、０．７―０．３＝０．４）。この閾値の範囲（グローバルセンシティビティ）とプライバシ指標「ε」からラプラス分布が決まる。ラプラス分布で誤差を与える処理が行われる。なお、グローバルセンシティビティ（Δｆ）は、既に説明した［数８］式で計算されるものである。
ここで、図１０に示すように、パラメータの変動量“０．５”に誤差を付与して、誤差付与済のパラメータの変動量が“０．１”になったとき、その時点での閾値の範囲の下限値である“０．３”に制限する処理が行われる。ラプラス分布はグローバルセンシティビティとプライバシ指標「ε」から計算されるため、グローバルセンシティビティの値を小さく（つまり閾値の幅を小さく）することで、ラプラス分布の誤差を小さくすることができ、深層学習の精度の向上につながる。
この図１０に示す例についても、図９の例と同様に、パラメータの変動量を閾値で制限する概略を非常に簡略化して示すものであり、実際の閾値に制限する処理は、ここまで数式を参照して説明した様々な条件を考慮して行われるものである。
また、第２の実施の形態例の場合でも、グローバルセンシティビティ（Δｆ）が、パラメータの変動量として取り得る最大の範囲と一致する場合には、図９に示す状態で閾値の制限が行われることになる。 FIG. 10 shows an outline of the process of limiting the error to the threshold values of the maximum value and the minimum value in the second embodiment. As shown in FIG. 10, for example, it is assumed that the maximum range that can be taken as the fluctuation amount of a certain parameter is “0” or more and “1” or less, and the fluctuation amount at a certain point in time is 0.6. Then, it is assumed that the range of the threshold values calculated sequentially while learning is "0.3" or more and "0.7" or less (the global sensitivity in this case is 0.7-0.3 = 0). .4). The Laplace distribution is determined from this threshold range (global sensitivity) and the privacy index "ε". Processing that gives an error in the Laplace distribution is performed. The global sensitivity (Δf) is calculated by the equation [Equation 8] already described.
Here, as shown in FIG. 10, when an error is added to the parameter fluctuation amount “0.5” and the error-added parameter fluctuation amount becomes “0.1”, the threshold value at that time is reached. The process of limiting to "0.3", which is the lower limit of the range of, is performed. Since the Laplace distribution is calculated from the global sensitivity and privacy index "ε", the error of the Laplace distribution can be reduced by reducing the value of global sensitivity (that is, reducing the threshold width), and deep learning. It leads to the improvement of the accuracy of.
Similar to the example of FIG. 9, the example shown in FIG. 10 also shows the outline of limiting the fluctuation amount of the parameter by the threshold value in a very simplified manner. It is carried out in consideration of various conditions explained with reference to.
Further, even in the case of the second embodiment, if the global sensitivity (Δf) matches the maximum range that can be taken as the fluctuation amount of the parameter, the threshold value is limited in the state shown in FIG. It will be.

［実データで評価した例］
図８は、本実施の形態の処理を、評価用のデータセットに対して実行した場合の例を示す。この図８の例は、第１の実施の形態で説明した図５での評価と同じ条件で行ったものである。
図８の横軸はデータセットの数（バッチサイズ）を示し、図８Ａ、図８Ｂ、図８Ｃは、それぞれε＝１、ε＝１０、ε＝１００の場合を示す。
図８Ａ、図８Ｂ、図８Ｃに示すように、いずれの場合でも良好な精度が確保されていることが分かる。ここで、図５（第１の実施の形態例）と、図８（第２の実施の形態例）とを比較すると分かるように、εの値が小さいときは、第１の実施の形態例の方が、高い精度が得られる。一方、εの値が大きいときは、第２の実施の形態例の方が、高い精度が得られる。但し、この結果は使用するデータセットによって変わるものであり、いずれの実施の形態を適用するのが好ましいかは、使用するデータセットによって異なる。 [Example of evaluation using actual data]
FIG. 8 shows an example in which the processing of the present embodiment is executed on the data set for evaluation. The example of FIG. 8 is performed under the same conditions as the evaluation of FIG. 5 described in the first embodiment.
The horizontal axis of FIG. 8 indicates the number of data sets (batch size), and FIGS. 8A, 8B, and 8C show the cases of ε = 1, ε = 10, and ε = 100, respectively.
As shown in FIGS. 8A, 8B, and 8C, it can be seen that good accuracy is ensured in all cases. Here, as can be seen by comparing FIG. 5 (example of the first embodiment) and FIG. 8 (example of the second embodiment), when the value of ε is small, the example of the first embodiment Higher accuracy can be obtained. On the other hand, when the value of ε is large, higher accuracy can be obtained in the second embodiment. However, this result varies depending on the data set used, and which embodiment is preferably applied depends on the data set used.

なお、図５及び図８に示す評価例では、予測した年収が５万ドル以下で、実際の年収が５万ドル以下である場合の回数をＴＮ、予測した年収が５万ドル以下で、実際の年収が５万ドルを超えている場合の回数をＦＮとした。また、予測した年収が５万ドルを超えていて、実際に５万ドルを超えている場合の回数をＴＰ、予測した年収が５万ドルを超えていて、実際の年収が５万ドル以下である場合の回数をＦＰとした。
このとき、手法［ａｃｃｕｒａｃｙ］では、［数２６］式での評価を行う。また、手法［ｆ−ｍｅａｓｕｒｅ］では、［数２７］式での評価を行う。 In the evaluation examples shown in FIGS. 5 and 8, the number of times when the predicted annual income is 50,000 dollars or less and the actual annual income is 50,000 dollars or less is TN, and the predicted annual income is 50,000 dollars or less. The number of times when the annual income exceeds 50,000 dollars is defined as FN. In addition, the number of times when the predicted annual income exceeds 50,000 dollars and actually exceeds 50,000 dollars is TP, the predicted annual income exceeds 50,000 dollars, and the actual annual income is 50,000 dollars or less. The number of times in a certain case was defined as FP.
At this time, in the method [accuracy], the evaluation is performed by the equation [Equation 26]. Further, in the method [f-meter], the evaluation is performed by the equation [Equation 27].

以上説明したように、本発明の各実施の形態によると、ラプラス分布に基づいた誤差を与えて匿名化を行う際に、その誤差の最大値と最小値を閾値で制限するようにしたことで、匿名化を行う際に与える誤差を一定の範囲に制限することができ、誤差が少ない適切な匿名化を行うことができる。その結果、深層学習モデルの精度低下を軽減できるようになる。 As described above, according to each embodiment of the present invention, when anonymization is performed by giving an error based on the Laplace distribution, the maximum value and the minimum value of the error are limited by a threshold value. , The error given when performing anonymization can be limited to a certain range, and appropriate anonymization with less error can be performed. As a result, it becomes possible to reduce the decrease in accuracy of the deep learning model.

なお、ここまで説明した数式は、本発明の各実施の形態を適用する場合の好適な一例を示したものであり、本発明は、これらの数式で説明した処理に限定されるものではない。 The mathematical formulas described so far show suitable examples when applying each embodiment of the present invention, and the present invention is not limited to the processes described in these mathematical formulas.

１…データベース（生データ）、２…深層学習処理部、３…深層学習モデル、４…匿名化モデル（匿名化済の深層学習モデル）、１０…匿名化処理部（閾値制限付き差分プライバシ適用）、１１…データ入力部、１２…ε入力部、１３…パラメータ構造決定部、１４…パラメータ初期値決定部、１５…閾値決定部、１６…閾値超え判定部、１７…閾値計算部、１８…匿名化演算部、１９…データ出力部、２０…機械学習処理部（差分プライバシ適用） 1 ... database (raw data), 2 ... deep learning processing unit, 3 ... deep learning model, 4 ... anonymized model (anonymized deep learning model), 10 ... anonymized processing unit (differential privacy application with threshold limitation) , 11 ... data input unit, 12 ... ε input unit, 13 ... parameter structure determination unit, 14 ... parameter initial value determination unit, 15 ... threshold determination unit, 16 ... threshold exceeding determination unit, 17 ... threshold calculation unit, 18 ... anonymous Chemical calculation unit, 19 ... Data output unit, 20 ... Machine learning processing unit (differential privacy applied)

Claims

A deep learning processing unit that applies a deep learning algorithm to the raw data in the database to obtain a deep learning model,
A privacy protection data providing system including an anonymization processing unit that obtains an anonymization model by performing anonymization processing based on differential privacy on the deep learning model obtained by the deep learning processing unit.
The anonymization processing unit gives an error based on the Laplace distribution to each parameter value for the weight parameter and the bias parameter included in the deep learning model, and each parameter to which the error is given based on the Laplace distribution , A privacy protection data providing system, characterized in that when the range of the thresholds indicated by the maximum value and the minimum value is exceeded, the range of the thresholds is limited.

It is a privacy protection data providing system equipped with a deep learning processing unit that applies a deep learning algorithm to obtain an anonymous model that has been deep learned while performing anonymization processing based on differential privacy on the raw data in the database.
The deep learning processing unit gives an error based on the Laplace distribution to each parameter value for the weight parameter and the bias parameter used in the calculation for obtaining the deep learning model, and gives an error based on the Laplace distribution. A privacy protection data providing system characterized in that when a parameter exceeds a range of thresholds indicated by a maximum value and a minimum value, the parameter is limited to the range of the threshold value.

When the deep learning processing unit obtains a deep learning model, it sequentially calculates global sensitivities and performs a process of acquiring the Laplace distribution based on the calculated global sensitivities.
The privacy protection data providing system according to claim 2, wherein an error is given based on the sequentially acquired Laplace distribution.