JP2018097467A

JP2018097467A - Privacy protection data providing system and privacy protection data providing method

Info

Publication number: JP2018097467A
Application number: JP2016239460A
Authority: JP
Inventors: 雄一清; Yuichi Sei; 拓史奥村; Takushi Okumura; 大須賀　昭彦; Akihiko Osuga; 昭彦大須賀
Original assignee: Mitsubishi Research Institute Inc; University of Electro Communications NUC
Current assignee: Mitsubishi Research Institute Inc; University of Electro Communications NUC
Priority date: 2016-12-09
Filing date: 2016-12-09
Publication date: 2018-06-21
Anticipated expiration: 2036-12-09
Also published as: JP6835559B2

Abstract

PROBLEM TO BE SOLVED: To acquire an accurate, suitable and anonymous deep layer learning model, regardless of types of data, when acquiring the anonymous deep layer learning model.SOLUTION: An error based on a Laplace distribution is given to a parameter value in a deep layer learning model in which a deep layer learning has been performed and when each parameter to which the error has been given on the basis of the Laplace distribution exceeds a range of thresholds indicated as a maximum value and a minimum value, each parameter is caused to be limited within the range of the thresholds for anonymization. Or an error based on the Laplace distribution is given to a parameter value used in a calculation at the time of the calculation to obtain the deep layer learning model, and when each parameter to which the error has been given on the basis of the Laplace distribution exceeds the range of the thresholds indicated as the maximum value and the minimum value, it is caused to be limited within the range of the thresholds for anonymization.SELECTED DRAWING: Figure 3

Description

本発明は、プライバシ保護データ提供システム及びプライバシ保護データ提供方法に関する。 The present invention relates to a privacy protection data providing system and a privacy protection data providing method.

近年、個人データなどのプライバシ保護が必要なデータを公開する際に、差分プライバシと称される処理を施して、個々のデータのプライバシを確保した上で、適正なデータ解析が実行できるようにしたものが提案されている。 In recent years, when publishing data that requires privacy protection, such as personal data, a process called differential privacy has been applied to ensure the privacy of individual data and to enable appropriate data analysis. Things have been proposed.

データに対して差分プライバシの処理を施す際には、プライバシの保護レベルが、「ε」で示されるプライバシ指標で示される。プライバシ指標「ε」の値が０に近づくほど、データの保護レベルが高く、プライバシ指標「ε」の値が大きいほど、データの保護レベルが低くなる。 When differential privacy processing is performed on data, the privacy protection level is indicated by a privacy index indicated by “ε”. The closer the value of the privacy index “ε” is to 0, the higher the data protection level is. The larger the value of the privacy index “ε” is, the lower the data protection level is.

具体的には、あるデータベースＤを匿名化して差分プライバシの処理を施す匿名学習アルゴリズムＡが存在するとき、この匿名学習アルゴリズムＡは、確率的要素を含むアルゴリズムになる。すなわち、データベースＤを、確率的要素を含む匿名学習アルゴリズムＡで匿名化したときには、確率的要素を含むために、処理を施す毎に異なる匿名化済データｓ１，ｓ２，・・・，ｓｎが得られる。ここで、データベースＤと、そのデータベースＤから１レコードだけ異なるデータとしたデータベースＤ′とを用意し、それぞれのデータベースＤ，Ｄ′の集合Ｓの特定のデータｓｉ（データｓｉはデータｓ１〜ｓｎのいずれか）になる確率の比が、プライバシ指標「ε」を使ったｅｘｐ（ε）以下になるとき、この匿名学習アルゴリズムＡは、差分プライバシを満たすアルゴリズムになる。 Specifically, when there is an anonymous learning algorithm A that anonymizes a certain database D and performs differential privacy processing, the anonymous learning algorithm A is an algorithm including a stochastic element. That is, when the database D is anonymized by the anonymous learning algorithm A including the probabilistic element, the anonymized data s1, s2,. It is done. Here, a database D and a database D ′ that is different from the database D by one record are prepared, and specific data si of the set S of the databases D and D ′ (the data si is the data s1 to sn). The anonymous learning algorithm A is an algorithm that satisfies the difference privacy when the ratio of the probability of becoming any one is equal to or less than exp (ε) using the privacy index “ε”.

この差分プライバシを満たす点を、より分かりやすく述べると、例えば、多数の個人情報からなる特定のデータベースＤに、ある任意の一人のデータを追加（又は削除）したものを、データベースＤ′とする。ここで、データベースＤを匿名学習アルゴリズムＡで差分プライバシの処理を施して匿名化した結果と、データベースＤ′を匿名学習アルゴリズムＡで差分プライバシの処理を施して匿名化した結果とが、ほとんど変わらないとき（つまり上述した閾値ｅｘｐ（ε）を超えないとき）、プライバシが守られた状態で、データベースＤが公開されたと言える。 The points satisfying this differential privacy will be described in a more easy-to-understand manner. For example, a database D ′ is obtained by adding (or deleting) data of an arbitrary person to a specific database D composed of a large number of personal information. Here, the result of anonymizing the database D with the anonymous learning algorithm A and the anonymization of the database D is almost the same as the result of anonymizing the database D ′ with the anonymous learning algorithm A and the difference privacy processing. Sometimes (that is, when the above-described threshold exp (ε) is not exceeded), it can be said that the database D has been released in a state where privacy is protected.

これは、データベースＤを構成する各データで特定される個人から見たとき、一人一人のデータの有無に関わらず、結果がほぼ同じであるため、プライバシが守られた状態と見なせることになる。言い換えると、データベースＤとデータベースＤ′のいずれであっても、結果が同じになることを意味している。
特許文献１には、差分プライバシを満たして、データを集計する手法の一例についての記載がある。 This can be regarded as a state in which privacy is protected because the results are almost the same regardless of the presence or absence of each person's data when viewed from the individual specified by each data constituting the database D. In other words, it means that the result is the same for either database D or database D ′.
Patent Document 1 describes an example of a method for totaling data while satisfying differential privacy.

特開２０１６−１２０７４号公報Japanese Patent Laid-Open No. 2006-12074

上述したように、差分プライバシの処理を施す匿名学習アルゴリズムを作成することで、データの匿名化が可能であるが、実際には、どのようなデータベース構成であっても、確率の比がｅｘｐ（ε）以下になる条件を満たして、かつニューラルモデルの精度が高くなるような機械学習を行う匿名学習アルゴリズムの作成は難しいという問題があった。 As described above, it is possible to anonymize data by creating an anonymous learning algorithm that performs differential privacy processing. Actually, however, the ratio of probabilities is exp ( There is a problem that it is difficult to create an anonymous learning algorithm that performs machine learning that satisfies the condition of [epsilon]) and that increases the accuracy of the neural model.

本発明は、匿名化された深層学習モデルを形成する際に、どのようなデータであっても、精度の高い好適な匿名化された深層学習モデルが得られるプライバシ保護データ提供システム及びプライバシ保護データ提供方法を提供することを目的とする。 The present invention provides a privacy protection data providing system and privacy protection data that can provide a highly accurate and suitable anonymized deep learning model for any data when forming an anonymized deep learning model An object is to provide a providing method.

本発明の一側面のプライバシ保護データ提供システムは、データベース内の生データに対して、深層学習アルゴリズムを適用して深層学習モデルを得る深層学習処理部と、深層学習処理部で得られた深層学習モデルに対して、差分プライバシに基づく匿名化処理を施して、匿名モデルを得る匿名化処理部とを備えたプライバシ保護データ提供システムである。
ここで、匿名化処理部は、深層学習モデルに含まれる重みパラメータ及びバイアスパラメータに対して、それぞれのパラメータ値にラプラス分布に基づいた誤差を与えると共に、ラプラス分布に基づいて誤差を与えた各パラメータが、最大値及び最小値で示される閾値の範囲を超えたとき、閾値の範囲に制限するようにしたことを特徴とする。 A privacy protection data providing system according to an aspect of the present invention includes a deep learning processing unit that obtains a deep learning model by applying a deep learning algorithm to raw data in a database, and deep learning obtained by the deep learning processing unit. A privacy protection data providing system including an anonymization processing unit that obtains an anonymous model by performing anonymization processing based on differential privacy for a model.
Here, the anonymization processing unit gives an error based on the Laplace distribution to each parameter value for the weight parameter and the bias parameter included in the deep learning model, and each parameter that gives an error based on the Laplace distribution. However, when the threshold value range indicated by the maximum value and the minimum value is exceeded, the threshold value range is limited.

また、本発明の他の側面のプライバシ保護データ提供システムは、データベース内の生データに対して、差分プライバシに基づく匿名化処理を施しながら、深層学習アルゴリズムを適用して深層学習済の匿名モデルを得る深層学習処理部を備えたプライバシ保護データ提供システムである。
ここで、深層学習処理部は、深層学習モデルを得る演算時に使用する重みパラメータ及びバイアスパラメータに対して、それぞれのパラメータ値にラプラス分布に基づいた誤差を与えると共に、ラプラス分布に基づいて誤差を与えた各パラメータが、最大値及び最小値で示される閾値の範囲を超えたとき、閾値の範囲に制限するようにしたことを特徴とする。 In addition, the privacy protection data providing system according to another aspect of the present invention applies a deep learning algorithm to a deeply learned anonymous model while performing anonymization processing based on differential privacy for raw data in a database. A privacy protection data providing system including a deep learning processing unit.
Here, the deep learning processing unit gives an error based on the Laplace distribution to each parameter value and gives an error based on the Laplace distribution to the weight parameter and the bias parameter used in the calculation for obtaining the deep learning model. When each parameter exceeds the threshold range indicated by the maximum value and the minimum value, the parameter is limited to the threshold range.

本発明の一側面のプライバシ保護データ提供方法は、データベース内の生データに対して、深層学習アルゴリズムを適用して深層学習モデルを得る深層学習処理手順と、記深層学習処理手順で得られた深層学習モデルに対して、差分プライバシに基づく匿名化処理を施す匿名化処理手順と、を含む。
ここで、匿名化処理手順は、深層学習モデルに含まれる重みパラメータ及びバイアスパラメータに対して、それぞれのパラメータ値にラプラス分布に基づいた誤差を与えると共に、ラプラス分布に基づいて誤差を与えた各パラメータが、最大値及び最小値で示される閾値の範囲を超えたとき、閾値の範囲に制限するようにしたことを特徴とする。 The privacy protection data providing method according to one aspect of the present invention includes a deep learning processing procedure for obtaining a deep learning model by applying a deep learning algorithm to raw data in a database, and a deep layer obtained by the deep learning processing procedure. An anonymization processing procedure for performing an anonymization processing based on differential privacy for the learning model.
Here, the anonymization processing procedure gives each parameter value an error based on the Laplace distribution for each of the weight parameter and the bias parameter included in the deep learning model, and each parameter that gives the error based on the Laplace distribution. However, when the threshold value range indicated by the maximum value and the minimum value is exceeded, the threshold value range is limited.

本発明の他の側面のプライバシ保護データ提供方法は、データベース内の生データに対して、差分プライバシに基づく匿名化処理を施しながら、深層学習アルゴリズムを適用して深層学習済の匿名モデルを得る深層学習処理手順を含む。
ここで、深層学習処理手順は、深層学習モデルを得る演算時に使用する重みパラメータ及びバイアスパラメータに対して、それぞれのパラメータ値にラプラス分布に基づいた誤差を与えると共に、ラプラス分布に基づいて誤差を与えた各パラメータが、最大値及び最小値で示される閾値の範囲を超えたとき、閾値の範囲に制限するようにしたことを特徴とする。 The privacy protection data providing method according to another aspect of the present invention provides a deep learning anonymous model by applying a deep learning algorithm to a raw data in a database while applying anonymization processing based on differential privacy. Includes a learning procedure.
Here, in the deep learning processing procedure, an error based on the Laplace distribution is given to each parameter value and an error is given based on the Laplace distribution to the weight parameter and the bias parameter used at the time of obtaining the deep learning model. When each parameter exceeds the threshold range indicated by the maximum value and the minimum value, the parameter is limited to the threshold range.

本発明によれば、ラプラス分布に基づいて誤差を与えた各パラメータが、最大値及び最小値で示される閾値の範囲を超えたとき、閾値の範囲に制限するようにしたことで、誤差を与えてデータの匿名化を行っても、データの変動範囲を適正な範囲に制限することができ、適切な匿名化ができるようになる。その結果、匿名化による深層学習モデルの精度低下を軽減できるようになる。 According to the present invention, when each parameter that gives an error based on the Laplace distribution exceeds the threshold range indicated by the maximum value and the minimum value, the error is given by limiting to the threshold range. Even if data anonymization is performed, the data fluctuation range can be limited to an appropriate range, and appropriate anonymization can be performed. As a result, the accuracy degradation of the deep learning model due to anonymization can be reduced.

本発明の第１の実施の形態例による処理システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the processing system by the 1st Example of this invention. 本発明の第１の実施の形態例による匿名化処理部内で、ラプラス分布に基づいた誤差を与える構成例を示すブロック図である。It is a block diagram which shows the structural example which gives the error based on the Laplace distribution within the anonymization process part by the 1st Example of this invention. 本発明の第１の実施の形態例による処理の流れの例を示すフローチャートである。It is a flowchart which shows the example of the flow of the process by the 1st Example of this invention. 本発明の第１の実施の形態例による深層学習の概要を示す説明図である。It is explanatory drawing which shows the outline | summary of the deep learning by the 1st Example of this invention. 本発明の第１の実施の形態例による実験例を示す説明図である。It is explanatory drawing which shows the experiment example by the 1st Example of this invention. 本発明の第２の実施の形態例による処理システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the processing system by the 2nd Example of this invention. 本発明の第２の実施の形態例による処理の流れの例を示すフローチャートである。It is a flowchart which shows the example of the flow of the process by the 2nd Example of this invention. 本発明の第２の実施の形態例による実験例を示す説明図である。It is explanatory drawing which shows the experiment example by the 2nd Embodiment of this invention. 本発明の各実施の形態例による誤差の付与と閾値への制限例（例１）の概略を示す説明図である。It is explanatory drawing which shows the outline of the example (Example 1) of the provision of the error by each Example of this invention, and the restriction | limiting to a threshold value. 本発明の各実施の形態例による誤差の付与と閾値への制限例（例２）の概略を示す説明図である。It is explanatory drawing which shows the outline of the restriction | limiting example (Example 2) of the provision of the error by each embodiment of this invention, and a threshold value.

＜１．第１の実施の形態例＞
以下、本発明の第１の実施の形態例を、図１〜図５を参照して説明する。 <1. First Embodiment>
Hereinafter, a first embodiment of the present invention will be described with reference to FIGS.

［システム全体の構成］
図１は、第１の実施の形態例のプライバシ保護データ提供システムの構成を示す。
データベース１には、個人情報が含まれる多数の生データが蓄積され、データベース１に蓄積された生データが、深層学習処理部２に供給される。深層学習処理部２は、予め用意された深層学習アルゴリズムを適用した演算を行い、生データを深層学習した深層学習モデル３を得る。 [Entire system configuration]
FIG. 1 shows a configuration of a privacy protection data providing system according to the first embodiment.
A large amount of raw data including personal information is stored in the database 1, and the raw data stored in the database 1 is supplied to the deep learning processing unit 2. The deep learning processing unit 2 performs a calculation using a deep learning algorithm prepared in advance, and obtains a deep learning model 3 obtained by deep learning of raw data.

そして、深層学習処理部２で得た深層学習モデル３が、匿名化処理部１０に供給される。匿名化処理部１０は、供給された深層学習モデル３に対して、差分プライバシに基づく匿名化処理を施して、匿名化済みの深層学習モデル４（以下、「匿名化モデル４」と称する）を得る。 Then, the deep learning model 3 obtained by the deep learning processing unit 2 is supplied to the anonymization processing unit 10. The anonymization processing unit 10 performs anonymization processing based on differential privacy on the supplied deep learning model 3 and anonymized deep learning model 4 (hereinafter referred to as “anonymization model 4”). obtain.

匿名化処理部１０が、差分プライバシに基づいて匿名化モデル４を得る際には、深層学習モデル３に含まれる重みパラメータ及びバイアスパラメータに対して、それぞれのパラメータ値にラプラス分布に基づいて誤差を与えて、差分プライバシの処理を施す。但し、それぞれのパラメータ値にラプラス分布に基づいた誤差を与える際には、その誤差として、最大値及び最小値を示す閾値で制限するようにした。
ラプラス分布に基づいた誤差を与えるということは、誤差を与えたパラメータ値が、確率的要素を含む値になり、結果的に匿名化が行われた匿名化モデル４が得られることになる。 When the anonymization processing unit 10 obtains the anonymization model 4 based on differential privacy, an error based on the Laplace distribution is added to each parameter value for the weight parameter and the bias parameter included in the deep learning model 3. Given, the processing of differential privacy is performed. However, when an error based on the Laplace distribution is given to each parameter value, the error is limited by a threshold value indicating the maximum value and the minimum value.
Giving an error based on the Laplace distribution means that the parameter value giving the error becomes a value including a stochastic element, and as a result, the anonymization model 4 in which anonymization is performed is obtained.

［ε−差分プライバシの処理構成］
図２は、匿名化処理部１０の機能を示すブロック図である。
図２に示すように、匿名化処理部１０は、データ入力部１１、ε入力部１２、パラメータ構造決定部１３、パラメータ初期値決定部１４、閾値決定部１５、閾値超え判定部１６及び閾値計算部１７を備える。更に、匿名化処理部１０は、匿名化演算部１８及びデータ出力部１９を備える。 [Ε-differential privacy processing configuration]
FIG. 2 is a block diagram illustrating functions of the anonymization processing unit 10.
As shown in FIG. 2, the anonymization processing unit 10 includes a data input unit 11, an ε input unit 12, a parameter structure determination unit 13, a parameter initial value determination unit 14, a threshold value determination unit 15, a threshold value excess determination unit 16, and a threshold value calculation. The unit 17 is provided. Furthermore, the anonymization processing unit 10 includes an anonymization calculation unit 18 and a data output unit 19.

データ入力部１１には、深層学習モデルのデータが入力され、このデータが匿名化演算部１８に供給される。ε入力部１２には、差分プライバシの処理を行う際の指標「ε」が入力され、指標「ε」が、匿名化演算部１８に供給される。 Data of the deep learning model is input to the data input unit 11, and this data is supplied to the anonymization calculation unit 18. The index “ε” used when the differential privacy processing is performed is input to the ε input unit 12, and the index “ε” is supplied to the anonymization calculation unit 18.

パラメータ構造決定部１３は、深層学習モデル３のパラメータ構造を決める機能を有し、このパラメータ構造決定部１３で決定された深層学習モデル３のパラメータ構造が、匿名化演算部１８に供給される。なお、パラメータ構造決定部１３で決定されるパラメータ構造には、少なくとも重みパラメータとバイアスパラメータが含まれる。そして、匿名化演算部１８は、これら重みパラメータとバイアスパラメータに誤差を与える処理を行う。 The parameter structure determination unit 13 has a function of determining the parameter structure of the deep learning model 3, and the parameter structure of the deep learning model 3 determined by the parameter structure determination unit 13 is supplied to the anonymization calculation unit 18. The parameter structure determined by the parameter structure determination unit 13 includes at least a weight parameter and a bias parameter. Then, the anonymization calculation unit 18 performs a process of giving an error to these weight parameters and bias parameters.

パラメータ初期値決定部１４は、上述した重みパラメータとバイアスパラメータのパラメータ初期値を決定する。このパラメータ初期値は、匿名化演算部１８に供給され、匿名化演算部１８は、このパラメータ初期値を用いて、パラメータ構造決定部１３で決定されるパラメータ構造の初期値を決定する。 The parameter initial value determination unit 14 determines parameter initial values of the above-described weight parameter and bias parameter. The parameter initial value is supplied to the anonymization calculation unit 18, and the anonymization calculation unit 18 determines the initial value of the parameter structure determined by the parameter structure determination unit 13 using the parameter initial value.

閾値決定部１５は、ラプラス分布に基づいて得た誤差を設定する際の最大値と最小値を制限するための閾値を決定する。この閾値決定部１５における閾値の決定の際には、後述する閾値計算部１７での計算結果が利用される。
閾値超え判定部１６は、匿名化演算部１８が演算を行う際に、パラメータ構造決定部１３で決定した誤差値が、閾値決定部１５で決定した閾値（最大値又は最小値）を超えたか否かを判定する。 The threshold value determination unit 15 determines a threshold value for limiting the maximum value and the minimum value when setting the error obtained based on the Laplace distribution. When the threshold value is determined by the threshold value determination unit 15, the calculation result of the threshold value calculation unit 17 described later is used.
Whether the error value determined by the parameter structure determination unit 13 exceeds the threshold (maximum value or minimum value) determined by the threshold determination unit 15 when the anonymization calculation unit 18 performs the calculation. Determine whether.

閾値計算部１７は、閾値を設定するための計算を行い、計算結果を匿名化演算部１８に供給する。
匿名化演算部１８は、閾値超え判定部１６での判定結果が、閾値を超えていた場合には閾値を誤差値とする処理を行う。匿名化演算部１８で演算した結果は、データ出力部１９から出力される。 The threshold calculation unit 17 performs calculation for setting the threshold and supplies the calculation result to the anonymization calculation unit 18.
The anonymization calculation unit 18 performs processing using the threshold value as an error value when the determination result in the threshold value excess determination unit 16 exceeds the threshold value. The result calculated by the anonymization calculation unit 18 is output from the data output unit 19.

［全体の処理の流れ］
図３は、第１の実施の形態例のプライバシ保護データ提供システムでの処理の流れを示すフローチャートである。
まず、深層学習処理部２は、データベース１から生データを取得する（ステップＳ１１）。そして、深層学習処理部２は、取得した生データに対して、予め用意された深層学習アルゴリズムを適用して深層学習を行い（ステップＳ１２）、深層学習処理の結果として、深層学習済モデルを取得する（ステップＳ１３）。 [Overall process flow]
FIG. 3 is a flowchart showing a flow of processing in the privacy protection data providing system of the first exemplary embodiment.
First, the deep learning processing unit 2 acquires raw data from the database 1 (step S11). Then, the deep learning processing unit 2 performs deep learning on the acquired raw data by applying a deep learning algorithm prepared in advance (step S12), and acquires a deep learned model as a result of the deep learning processing. (Step S13).

次に、ステップＳ１３で取得した深層学習済モデルに対して、匿名化処理部１０が、匿名化処理を行う（ステップＳ１４）。この匿名化処理を行う際には、閾値による制限を設定した上で、ラプラス分布に基づく誤差の付与を行う。
なお、ステップＳ１４において、匿名化処理の制限に使用される閾値は、匿名化処理部１０における、重みパラメータの変動量の最大値及び最小値を示す閾値と、バイアスパラメータの変動量の最大値及び最小値を示す閾値である。これらの閾値の生成処理（ステップＳ２０）の詳細については数式を用いて後述する。
そして、匿名化処理部１０によるステップＳ１４での匿名化処理の実行で、匿名化モデルを取得し（ステップＳ１５）、得られた匿名化モデルをデータ出力部１９から出力する。 Next, the anonymization processing unit 10 performs anonymization processing on the deeply learned model acquired in step S13 (step S14). When this anonymization process is performed, an error is given based on the Laplace distribution after setting a limit based on a threshold.
In step S14, the threshold value used for the restriction of the anonymization process is the threshold value indicating the maximum value and the minimum value of the variation amount of the weight parameter in the anonymization processing unit 10, the maximum value of the variation amount of the bias parameter, and This is a threshold value indicating the minimum value. Details of the threshold generation processing (step S20) will be described later using mathematical expressions.
And anonymization model is acquired by execution of the anonymization process by step S14 by the anonymization process part 10 (step S15), and the obtained anonymization model is output from the data output part 19. FIG.

［深層学習の詳細］
次に、ここまで説明したステップＳ１２〜Ｓ１５の各処理の詳細について説明する。
まず、図４を参照して、深層学習が行われる例について説明する。
図４において、Ｈ^（ｌ）は、深層学習の１番目の層を示す。図４はＬ＝３の例であり、全体でＬ＋１個の層を持っている。入力層はＨ^（０）、出力層はＨ^（Ｌ）である。それぞれの層は、複数（又は１つ）のノードを有する。ノードＮ_ｉ ^（ｌ）は、層Ｈ^（ｌ）のｉ番目のノードを表し、ｎ^（ｌ）は層Ｈ^（ｌ）におけるノードの個数を表す。層Ｈ^（ｌ）には、ノードＮ_１ ^（ｌ），Ｎ_２ ^（ｌ），・・・，Ｎ_ｎ（ｌ） ^（ｌ）がある。 [Details of deep learning]
Next, details of each processing of steps S12 to S15 described so far will be described.
First, an example in which deep learning is performed will be described with reference to FIG.
In FIG. 4, H ^(l) indicates the first layer of deep learning. FIG. 4 shows an example of L = 3, and has L + 1 layers as a whole. The input layer is H ⁽⁰⁾ and the output layer is H ^(L) . Each layer has a plurality (or one) of nodes. Node _N ^{i (l)} represents the i-th node in layer ^{^{H (l), n (l}} ) is the number of nodes in layer ^{H (l).} In the layer H ^(l), there are nodes N ₁ ^(l) , N ₂ ^(l) ,..., N _{n (l)} ^(l) .

また、図４において、ｗ_ｉｊ ^（ｌ）は、ノードＮ_ｉ ^{（ｌ−１）}とノードＮ_ｊ ^（ｌ）の間の重みパラメータを表す。ｂ_ｊ ^（ｌ）は、ノードＮ_ｊ ^（ｌ）へのバイアスパラメータを表す。Ｆ^（ｌ）は、層Ｈ^（ｌ）の活性化関数を表す。ｘ_ｉ ^（ｌ）はノードＮ_ｉ ^（ｌ）への入力を表し、ｙ_ｉ ^（ｌ）はノードＮ_ｉ ^（ｌ）からの出力を表す。
これらの入出力の値は、以下の式で計算される。 In FIG. 4, w _ij ^(l) represents a weight parameter between the node N _i ^(l−1) and the node N _j ^(l) . b _j ^(l) represents a bias parameter to the node N _j ^(l) . F ^(l) represents the activation function of the layer H ^(l) . x _i ^(l) represents the input to node N _i ^(l) and y _i ^(l) represents the output from node N _i ^(l) .
These input / output values are calculated by the following equations.

ここで、ｔ_ｉは、ノードＮ_ｉ ^（Ｌ）の目標出力値を表し、Ｍは誤差関数を表す。誤差関数Ｍは、入力としてｙ_ｉ ^（Ｌ）及びｔ_ｉを取り、その誤差の値を返す。
学習データは、いくつかのバッチと呼ばれるまとまりに分割される。以下のプロセスは各バッチに対して行われる。 Here, t _i represents a target output value of the node N _i ^(L) , and M represents an error function. The error function M takes y _i ^(L) and t _i as inputs and returns the error value.
The learning data is divided into batches called batches. The following process is performed for each batch.

バッチ内の各レコードに対して、深層学習アルゴリズムにより、ｙ_ｉ ^（Ｌ）を計算する（ｉ＝１，・・・，ｎ^（Ｌ））。
次に、深層学習アルゴリズムにより、各ノードＮ_ｉ ^（ｌ）における誤差信号（δ_ｉ ^（ｌ）とおく）を計算する。ｌ＝Ｌのとき、δ_ｉ ^（Ｌ）は以下の［数２］式のように計算される。 For each record in the batch, y _i ^(L) is calculated by a deep learning algorithm (i = 1,..., N ^(L) ).
Next, an error signal (denoted as δ _i ^(l) ^{) at} each node N _i ^(l ) is calculated by a deep learning algorithm. When l = L, δ _i ^(L) is calculated as in the following [Equation 2].

ｌ＝１，・・・・，Ｌ−１に対しては、δ_ｉ ^（ｌ）は以下の［数３］式のように計算される。 For l = 1,..., L−1, δ _i ^(l) is calculated as in the following [Equation 3].

そして、深層学習アルゴリズムにより、δ_ｉ ^（ｌ）をバッチ内の各レコードに対して計算し、その総和を新たにδ_ｉ ^（ｌ）とおく。
次に、変動量Δｗ_ｉｊ ^（ｌ）を、以下のように定義する。 Then, δ _i ^(l) is calculated for each record in the batch by the deep learning algorithm, and the sum is newly set as δ _i ^(l) .
Next, the fluctuation amount Δw _ij ^(l) is defined as follows.

最後に、深層学習アルゴリズムにより、各重みパラメータｗ_ｉｊ ^（ｌ） for ｌ＝１，・・・，Ｌ，ｉ＝１，・・・，ｎ^{（ｌ−１）}， and ｊ＝１，・・・，ｎ^（ｌ）を、以下の［数５］式のように更新する。 Finally, each weight parameter w _ij ^(l) for l = 1,..., L, i = 1,..., N ^(l−1) , and j = 1,. , N ^(l) is updated as in the following [Equation 5].

ここで、学習率α、正則項λは、事前に決定しておく。
バイアスパラメータに関しては、以下のように更新する。 Here, the learning rate α and the regular term λ are determined in advance.
The bias parameter is updated as follows.

ここで、Δｂ_ｊ ^（ｌ）＝δ_ｊ ^（ｌ）である。
この［数１］式から［数６］式のプロセスを、全てのバッチに対して行う。
また、このプロセスを複数回繰り返す。この繰り返し回数をエポック数と呼ぶ。エポック数は、深層学習を行う前に事前引用文献、又は学習を進めながら決定する。 Here, Δb _j ^(l) = δ _j ^(l) .
The processes of [Expression 1] to [Expression 6] are performed for all batches.
This process is repeated several times. This number of repetitions is called the epoch number. The number of epochs is determined prior to deep learning, with prior citations or while learning is in progress.

［ε−差分プライバシの詳細］
次に、ε−差分プライバシについて説明する。
例えば、データベースＤとデータベースＤ′は、最大で１レコードだけ異なるとする。ランダム機構Ａは、出力の全ての集合Ｙについて、以下の［数７］式の条件が成り立つとき、ε−差分プライバシを実現する。 [Details of ε-differential privacy]
Next, ε-differential privacy will be described.
For example, it is assumed that the database D and the database D ′ differ by a maximum of one record. The random mechanism A realizes ε-difference privacy for all sets Y of outputs when the condition of the following [Equation 7] is satisfied.

データベースＤとデータベースＤ′とを、１レコードだけ異なるデータベースであると考える。入力のデータベースとして理論上可能性のある全てのデータベースの集合をＱとおく。このとき、ｆを、ｆ：Ｑ→Ｒである関数とする。ここで、全てのデータベースＤ及びデータベースＤ′に対して以下の［数８］式が成立するとき、Δｆをｆのグローバルセンシティビティ（global sensitivity）、つまりｆの値が取り得る範囲と定義する。 The database D and the database D ′ are considered to be different databases by one record. Let Q be the set of all theoretically possible databases as the input database. At this time, let f be a function of f: Q → R. Here, when the following [Equation 8] is established for all the databases D and D ′, Δf is defined as the global sensitivity of f, that is, the range that the value of f can take.

次に、ラプラスメカニズムと呼ばれる、ε−差分プライバシを満たす匿名化のメカニズムを説明する。
Lap(v)を、平均０、スケールがｖであるラプラス分布に基づいてランダムな誤差を出力する関数であるとする。このとき、ある関数ｆに対して、ランダムメカニズムＡが、ｆ（Ｄ）＋Lap（Δｆ／ε）を出力するとき、ランダムメカニズムＡは、ε−差分プライバシを満たす。 Next, an anonymization mechanism that satisfies the ε-differential privacy, called a Laplace mechanism, will be described.
Let Lap (v) be a function that outputs a random error based on a Laplace distribution with an average of 0 and a scale of v. At this time, when the random mechanism A outputs f (D) + Lap (Δf / ε) for a certain function f, the random mechanism A satisfies ε−differential privacy.

ここでは、誤差ｂを与える対象の変数が、１つのデータの有無によって変動し得る値の幅の最大値をｄとおく。ここでの最大値ｄは、実際の値ではなく、匿名化前のデータベースとして想定し得る値の幅から算出する。そして、誤差ｂ＝ｄ／εとする。つまり、最大値ｄの値が大きく、εが小さいほど、誤差ｂの値が大きくなり、与えられる誤差が大きくなる。 Here, the maximum value of the range of values that can vary depending on the presence or absence of one piece of data for the error b is set as d. The maximum value d here is not an actual value, but is calculated from a range of values that can be assumed as a database before anonymization. Then, an error b = d / ε is set. That is, the larger the maximum value d and the smaller ε, the larger the error b and the larger the given error.

なお、深層学習の重みパラメータやバイアスパラメータは複数存在する。これらパラメータの集合に対してε−差分プライバシを満たすこともできるが、本実施の形態では、個々のパラメータに対して個別にε−差分プライバシを満たすようにする。
このように個々のパラメータに対して個別にε−差分プライバシを満たすようにする場合には、ランダム機構Ａは、各パラメータにおける出力の全ての集合Ｙについて、以下の式が成り立ち、個々のパラメータに対して個別にε−差分プライバシを満たすことになる。なお、データベースＤとデータベースＤ′は、最大で１レコードだけ異なる。 There are a plurality of deep learning weight parameters and bias parameters. Although the ε-differential privacy can be satisfied for the set of parameters, in the present embodiment, the ε-differential privacy is individually satisfied for each parameter.
In this way, when satisfying ε-difference privacy individually for each parameter, the random mechanism A has the following formula for all sets Y of outputs in each parameter, and On the other hand, ε-differential privacy is satisfied individually. Note that the database D and the database D ′ differ by a maximum of one record.

［各パラメータの閾値設定例］
次に、重みパラメータｗ_ｉｊ ^（ｌ）とバイアスパラメータｂ_ｊ ^（ｌ）に対して値の閾値を設定する処理について説明する。なお、この処理は、図３のステップＳ２０の処理に相当する。
この処理は、１レコードだけ異なるときに変わりうる値の、理論上の最大値（グローバルセンシティビティ）を減少させることで、パラメータに与える誤差を減少させるために行われる。これにより、深層学習モデルの精度低下を軽減させる、つまり精度の向上を図ることができる。 [Threshold setting example for each parameter]
Next, processing for setting a threshold value for the weight parameter w _ij ^(l) and the bias parameter b _j ^(l) will be described. This process corresponds to the process of step S20 in FIG.
This processing is performed in order to reduce the error given to the parameter by reducing the theoretical maximum value (global sensitivity) of the value that can change when only one record differs. As a result, it is possible to reduce a decrease in accuracy of the deep learning model, that is, to improve accuracy.

ここでは、重みパラメータｗ_ｉｊ ^（ｌ）の最大値をｗ_ｍａｘ、最小値をｗ_ｍｉｎとする。また、バイアスパラメータｂ_ｊ ^（ｌ）の最大値をｂ_ｍａｘ、最小値をｂ_ｍｉｎとする。
また、本実施の形態では、深層学習への入力値（学習データ）にも閾値を設定する。この入力値の閾値は、ここでは［０，１］とする。ここでの閾値[０，１]とは、最小値を“０”とし、最大値を“１”として、“０”以上“１”以下に制限することを意味する。 Here, the maximum value of the weight parameter w _ij ^(l) is set to w _max and the minimum value is set to w _min . In addition, the maximum value of the bias parameter b _j ^(l) is b _max , and the minimum value is b _min .
In this embodiment, a threshold is also set for an input value (learning data) for deep learning. Here, the threshold value of the input value is [0, 1]. Here, the threshold value [0, 1] means that the minimum value is “0” and the maximum value is “1”, so that the threshold value is limited to “0” or more and “1” or less.

本実施の形態では、匿名化処理部１０は、深層学習を行った後、学習済重みパラメータｗ_ｉｊ ^（ｌ）に対して誤差を与える。つまり、深層学習時の全てのｉ，ｊ，ｌ（図３参照）に対して、ｗ_ｉｊ ^（ｌ）＋Lap（ｗ_ｍａｘ−ｗ_ｍｉｎ／ε）を計算する。この計算結果を、ｒ_ｉｊ ^（ｌ）とおく。もし、計算結果ｒ_ｉｊ ^（ｌ）の値が、最大値ｗ_ｍａｘを超えた場合、重みパラメータｗ_ｉｊ ^（ｌ）の値を最大値（閾値）ｗ_ｍａｘに修正する。
同様に、もし計算結果ｒ_ｉｊ ^（ｌ）の値が、最小値ｗ_ｍｉｎを下回った場合、重みパラメータｗ_ｉｊ ^（ｌ）の値を最小値（閾値）ｗ_ｍｉｎに修正する。 In the present embodiment, the anonymization processing unit 10 gives an error to the learned weight parameter w _ij ^(l) after performing deep learning. That is, w _ij ^(l) + Lap (w _max −w _min / ε) is calculated for all i, j, and l (see FIG. 3) during deep learning. Let this calculation result be r _ij ^(l) . If the value of the calculation result r _ij ^(l) exceeds the maximum value w _max , the value of the weight parameter w _ij ^(l) is corrected to the maximum value (threshold value) w _max .
Similarly, if the value of the calculation result r _ij ^(l) falls below the minimum value w _min , the value of the weight parameter w _ij ^(l) is corrected to the minimum value (threshold value) w _min .

また、この最大値及び最小値で制限する処理を、バイアスパラメータｂ_ｊ ^（ｌ）に対しても行う。つまり、バイアスパラメータｂ_ｊ ^（ｌ）の計算結果を、ｍｉｎ（ｂ_ｍａｘ，ｍａｘ（ｂ_ｍｉｎ，ｂ_ｊ ^（ｌ）＋Lap（（ｂ_ｍａｘ−ｂ_ｍｉｎ）／ε）））に設定する。 Further, the process of limiting the maximum value and the minimum value is also performed on the bias parameter b _j ^(l) . That is, the calculation result of the bias parameter b _j ^(l) is set to min (b _max , max (b _min , b _j ^(l) + Lap ((b _max −b _min ) / ε))).

［閾値を設定したときにε−差分プライバシを満たすことの説明］
次に、閾値（最大値、最小値）で誤差を制限したときのパラメータが、ε−差分プライバシを満たしたものであることを説明する。
上述したように、本実施の形態では、深層学習時の重みパラメータｗ_ｉｊ ^（ｌ）やバイアスパラメータｂ_ｊ ^（ｌ）（図４参照）として、重みパラメータｗ_ｉｊ ^（ｌ）の理論上の最大幅（グローバルセンシティビティ）は（ｗ_ｍａｘ−ｗ_ｍｉｎ）であり、バイアスパラメータｂ_ｊ ^（ｌ）の理論上の最大幅（グローバルセンシティビティ）は（ｂ_ｍａｘ−ｂ_ｍｉｎ）である。次に説明するように、学習済み重みパラメータｗ_ｊ ^（ｌ）の計算結果を、ｍｉｎ（ｗ_ｍａｘ，ｍａｘ（ｗ_ｍｉｎ，ｂ_ｊ ^（ｌ）＋Lap（（ｗ_ｍａｘ−ｗ_ｍｉｎ）／ε）））に設定し、学習済みバイアスパラメータｂ_ｊ ^（ｌ）の計算結果を、ｍｉｎ（ｂ_ｍａｘ，ｍａｘ（ｂ_ｍｉｎ，ｂ_ｊ ^（ｌ）＋Lap（（ｂ_ｍａｘ−ｂ_ｍｉｎ）／ε）））に設定することで、ε−差分プライバシを満たすことができる。 [Explanation of satisfying ε-differential privacy when a threshold is set]
Next, it will be described that the parameter when the error is limited by the threshold (maximum value, minimum value) satisfies ε-difference privacy.
As described above, in the present embodiment, the theoretical maximum width of the weight parameter w _ij ^(l) is used as the weight parameter w _ij ^(l) and the bias parameter b _j ^(l) (see FIG. 4 ⁾ during deep learning. (Global sensitivity) is (w _max −w _min ), and the theoretical maximum width (global sensitivity ⁾ of the bias parameter b _j ^(l ) is (b _max −b _min ). As will be described next, the calculation result of the learned weight parameter w _j ^{(l) is} expressed as min (w _max , max (w _min , b _j ^(l) + Lap ((w _max −w _min ) / ε))). And the calculation result of the learned bias parameter b _j ^(l) is set to min (b _max , max (b _min , b _j ^(l) + Lap ((b _max −b _min ) / ε))). Thus, ε-differential privacy can be satisfied.

ランダムメカニズムＡが、ｍｉｎ（ｆ_ｍｉｎ，ｍａｘ（ｆ_ｍａｘ，ｆ（Ｄ）＋Lap（Δｆ／ε）））を出力するとき、ランダムメカニズムＡはε−差分プライバシを実現する。ここで、ｆ_ｍａｘ及びｆ_ｍｉｎは、ｆ（Ｄ）が取り得る理論上の最大値と最小値である。
ここで、データベースＤと、そのデータベースＤに対して１レコードだけ異なるデータベースＤ′をおく。
また、Ｆ（Ｄ）＝ｆ（Ｄ）＋Lap（Δｆ／ε）とおく。Ｆ（Ｄ）の値が［ｆ_ｍｉｎ，ｆ_ｍａｘ］の範囲に入るとき、［数７］式が成立する。 When the random mechanism A outputs min (f _min , max (f _max , f (D) + Lap (Δf / ε))), the random mechanism A realizes ε-differential privacy. Here, f _max and f _min are the theoretical maximum and minimum values that f (D) can take.
Here, a database D and a database D ′ different from the database D by one record are set.
Further, F (D) = f (D) + Lap (Δf / ε) is set. When the value of F (D) falls within the range of [f _min , f _max ], [Formula 7] is established.

次に、Ｆ（Ｄ）の値がｆ_ｍｉｎを下回る場合を考える。このとき、Ａ（Ｄ）の出力値はｆ_ｍｉｎになる。Ａ（Ｄ）の出力がｆ_ｍｉｎになる確率は、次の［数９］式で表される。 Next, consider a case where the value of F (D) is _less than f _min . At this time, the output value of A (D) is f _min . The probability that the output of A (D) will be f _min is expressed by the following [Equation 9].

［数９］式において、Lap（ｖ，ｕ）は、スケールパラメータがｖであり、平均との差がｕである、ラプラス分布の確率密度関数の値を表す。
同様に、Ａ（Ｄ′）の出力値がｆ_ｍｉｎとなる確率は、次の［数１０］式で表される。 In the formula [9], Lap (v, u) represents the value of the probability density function of the Laplace distribution in which the scale parameter is v and the difference from the average is u.
Similarly, the probability that the output value of A (D ′) is f _min is expressed by the following [Equation 10].

［数９］式の値と、［数１０］式の値の比は、最大で［数１１］式で表される。 The ratio of the value of [Formula 9] and the value of [Formula 10] is represented by [Formula 11] at the maximum.

ここで、｜ｆ（Ｄ）−ｆ（Ｄ′）｜≦Δｆであるから、［数１１］式の値は、ｅｘｐ（ε）以下である。したがって、ε−差分プライバシを満たす。 Here, since | f (D) −f (D ′) | ≦ Δf, the value of the equation [11] is not more than exp (ε). Therefore, ε-differential privacy is satisfied.

次に、Ｆ（Ｄ）の値がｆ_ｍａｘ以上となる場合を考える。このとき、Ａ（Ｄ）の出力値はｆ_ｍａｘに制限される。Ａ（Ｄ）の出力がｆ_ｍａｘとなる確率は、次の［数１２］式で表される。 Next, consider a case where the value of F (D) is greater than or equal to f _max . At this time, the output value of A (D) is limited to f _max . The probability that the output of A (D) will be f _max is expressed by the following [Equation 12].

同様に、Ａ（Ｄ′）の出力値がｆ_ｍａｘとなる確率は、次の［数１３］式で表される。 Similarly, the probability that the output value of A (D ′) is f _max is expressed by the following [Equation 13].

［数１２］式の値と、［数１３］式の値の比は、最大で［数１４］式で表される。 The ratio of the value of [Expression 12] and the value of [Expression 13] is expressed by [Expression 14] at the maximum.

ここで、｜ｆ（Ｄ）−ｆ（Ｄ′）｜≦Δｆであるから、［数１４］式の値は、ｅｘｐ（ε）以下である。したがって、ε−差分プライバシを満たす。
このように誤差を最大値と最小値の閾値に制限することがε−差分プライバシを満たすことは、全てのパラメータについて成立する。したがって、本実施の形態のように各パラメータの誤差を閾値で制限することで、ε−差分プライバシが成り立つ。 Here, since | f (D) −f (D ′) | ≦ Δf, the value of the equation (14) is not more than exp (ε). Therefore, ε-differential privacy is satisfied.
In this way, limiting the error to the threshold value of the maximum value and the minimum value satisfies the ε-difference privacy for all parameters. Therefore, ε-difference privacy is established by limiting the error of each parameter with a threshold as in the present embodiment.

図９は、ここまで数式を用いて説明した、誤差を最大値と最小値の閾値に制限する処理の概略を示すものである。図９に示すように、例えばあるパラメータが取り得る値の範囲が“０”以上“１”以下であり、ある時点でのパラメータ値が０．８であるとする（グローバルセンシティビティは、最大値“１”と最小値“０”の差）。そして、このパラメータ値“０．８”に誤差を付与して、誤差付与済のパラメータ値が“１．１”になったとき、パラメータ値を閾値の範囲の上限値である“１”に制限する処理が行われる。
なお、この図９に示す例は、パラメータを閾値で制限する概略を非常に簡略化して示すものであり、実際の閾値に制限する処理は、ここまで数式を参照して説明した様々な条件を考慮して行われるものである。 FIG. 9 shows an outline of the processing for limiting the error to the threshold value of the maximum value and the minimum value, which has been described so far by using mathematical expressions. As shown in FIG. 9, for example, it is assumed that the range of values that a certain parameter can take is “0” or more and “1” or less, and the parameter value at a certain time point is 0.8 (the global sensitivity is the maximum value). Difference between “1” and minimum value “0”). When an error is given to the parameter value “0.8” and the parameter value with the error given becomes “1.1”, the parameter value is limited to “1” which is the upper limit value of the threshold range. Processing is performed.
Note that the example shown in FIG. 9 shows a very simplified outline of limiting parameters with thresholds, and the process of limiting to actual thresholds is based on the various conditions described above with reference to mathematical expressions. It is done with consideration.

［実データで評価した例］
図５は、本実施の形態の処理を、評価用のデータセットに対して実行した場合の例を示す。ここでは、評価用のデータセットとして、プライバシ保護データマイニングの分野で広く利用されている、［アダルトデータセット（Adult data set）］を利用する。［アダルトデータセット］は、１５種類の属性（年齢、性別、人種、年収、など）から構成されており、欠損値を含むレコードを除外して、４５，２２２レコードから成る。年収の属性は、各レコードの人物の年収が、５万ドルを超えているか否かの２値を取る。
そして、年収を除く１４の属性から、年収が５万ドルを超えているか否かを予測する深層学習システムを構築する。 [Examples evaluated with actual data]
FIG. 5 shows an example when the processing of the present embodiment is performed on a data set for evaluation. Here, as an evaluation data set, [Adult data set], which is widely used in the field of privacy protection data mining, is used. [Adult data set] is composed of 15 types of attributes (age, gender, race, annual income, etc.), and is composed of 45,222 records excluding records containing missing values. The attribute of annual income takes a binary value indicating whether the annual income of the person in each record exceeds 50,000 dollars.
Then, a deep learning system that predicts whether the annual income exceeds $ 50,000 is constructed from the 14 attributes excluding the annual income.

まず、差分プライバシを満たすような匿名化を行わない、生データに対して事前実験を行い、深層学習モデルの精度が高くなるような深層学習アルゴリズムの構造を決定した。学習率は０．０１、バッチサイズは５０、エポック数は５００、正則項は０．００１、中間層の数は４（入力層、出力層を含めると、全部で５層）が良い結果を出した。 First, a preliminary experiment was performed on the raw data without anonymization satisfying the differential privacy, and the structure of the deep learning algorithm was determined so that the accuracy of the deep learning model was increased. The learning rate is 0.01, the batch size is 50, the epoch number is 500, the regular term is 0.001, and the number of intermediate layers is 4 (5 layers in total including the input and output layers). did.

ここでは、１０分割交差検定を行って、差分プライバシを満たす匿名化を行うと共に、その匿名化を行う際に、誤差の最大値と最小値を閾値に制限する処理を行った場合の匿名化モデルの精度を計測した。この例では、精度を評価する手法として、手法［ａｃｃｕｒａｃｙ］と手法［ｆ−ｍｅａｓｕｒｅ］を用いた。１０分割交差検定は、データセットを９：１の比率で２つに分け、比率９の方のデータをトレーニングデータとし、比率１のデータをテストデータとする。すなわち、比率９のトレーニングデータを使って学習を行い、比率１のテストデータから、給料を除く１４種類の属性を入力として学習済みの深層学習モデルに投入して、給料を予測する処理を行う。そして、その予測結果と、実際の値を比較して評価を行う。この評価を１０回行うようにして、各レコードが一度ずつテストデータに含まれるようにする。 Here, an anonymization model in the case of performing an anonymization satisfying the difference privacy by performing a 10-fold cross-validation and performing a process of limiting the maximum value and the minimum value of the error to a threshold value when performing the anonymization. The accuracy of was measured. In this example, the method [accuracy] and the method [f-measure] are used as methods for evaluating accuracy. In the 10-fold cross-validation, the data set is divided into two at a ratio of 9: 1, the data of the ratio 9 is used as training data, and the data of ratio 1 is used as test data. That is, learning is performed using the training data of the ratio 9, and 14 types of attributes excluding the salary are input to the learned deep learning model from the test data of the ratio 1, and the salary is predicted. Then, the evaluation result is compared with the actual value for evaluation. This evaluation is performed 10 times so that each record is included in the test data once.

手法［ａｃｃｕｒａｃｙ］と手法［ｆ−ｍｅａｓｕｒｅ］の２つの評価指標の値（図５の縦軸）は、いずれも０から１までの値であり、１に近いほど精度が高いことを示す。図５の横軸はデータセットの数（バッチサイズ）を示し、図５Ａ、図５Ｂ、図５Ｃは、それぞれε＝１、ε＝１０、ε＝１００の場合を示す。
例えば、図５Ｃに示す例では、手法［ａｃｃｕｒａｃｙ］での評価指標値が０．８５、手法［ｆ−ｍｅａｓｕｒｅ］の評価指標値が０．７９となり、いずれも良好な精度が確保されていることが分かる。 The two evaluation index values (the vertical axis in FIG. 5) of the method [accuracy] and the method [f-measure] are values from 0 to 1, and the closer to 1, the higher the accuracy. The horizontal axis of FIG. 5 shows the number of data sets (batch size), and FIGS. 5A, 5B, and 5C show cases where ε = 1, ε = 10, and ε = 100, respectively.
For example, in the example illustrated in FIG. 5C, the evaluation index value in the method [accuracy] is 0.85, and the evaluation index value in the method [f-measure] is 0.79, both of which ensure good accuracy. I understand.

＜２．第２の実施の形態例＞
次に、本発明の第２の実施の形態例を、図６〜図８を参照して説明する。この第２の実施の形態例を説明する図６〜図８において、第１の実施の形態例で説明した図１〜図５と同一の構成及び処理については同一符号を付し、詳細な説明を省略する。 <2. Second Embodiment>
Next, a second embodiment of the present invention will be described with reference to FIGS. 6 to 8 for explaining the second embodiment, the same components and processes as those in FIGS. 1 to 5 explained in the first embodiment are denoted by the same reference numerals, and detailed description will be given. Is omitted.

［システム全体の構成］
図６は、第２の実施の形態例のプライバシ保護データ提供システムの構成を示す。
データベース１には、個人情報が含まれる多数の生データが蓄積され、データベース１に蓄積された生データが、深層学習処理部２０に供給される。深層学習処理部２０は、予め用意された深層学習アルゴリズムを適用した演算を行うと同時に、深層学習の演算時に、差分プライバシに基づく匿名化処理を施して、匿名化済みの深層学習モデルである、匿名化モデル４を得る。 [Entire system configuration]
FIG. 6 shows the configuration of a privacy protection data providing system according to the second embodiment.
A large amount of raw data including personal information is stored in the database 1, and the raw data stored in the database 1 is supplied to the deep learning processing unit 20. The deep learning processing unit 20 is a deep learning model that has been anonymized by performing an operation applying a deep learning algorithm prepared in advance and performing anonymization processing based on differential privacy at the time of deep learning operation. Anonymization model 4 is obtained.

深層学習処理部２０が、差分プライバシに基づいて匿名化モデル４を得る際には、深層学習アルゴリズムで使用する重みパラメータ及びバイアスパラメータに対して、それぞれのパラメータ値の変動量にラプラス分布に基づいて誤差を与えて、差分プライバシの処理を施す。但し、それぞれのパラメータ値の変動量にラプラス分布に基づいた誤差を与える際には、その誤差として、最大値及び最小値を示す閾値で制限するようにした。
ラプラス分布に基づいた誤差を与えるということは、誤差を与えたパラメータ値が、確率的要素を含む値になり、結果的に匿名化が行われた匿名化モデル４が得られることになる。
深層学習処理部２０が深層学習時に差分プライバシに基づいて匿名化モデル４を得るための誤差の生成は、図２に示した匿名化処理部１０での処理と同様の構成で実現される。 When the deep learning processing unit 20 obtains the anonymization model 4 based on differential privacy, the variation amount of each parameter value is based on the Laplace distribution for the weight parameter and bias parameter used in the deep learning algorithm. An error is given and differential privacy processing is performed. However, when an error based on the Laplace distribution is given to the fluctuation amount of each parameter value, the error is limited by a threshold value indicating the maximum value and the minimum value.
Giving an error based on the Laplace distribution means that the parameter value giving the error becomes a value including a stochastic element, and as a result, the anonymization model 4 in which anonymization is performed is obtained.
The generation of an error for the deep learning processing unit 20 to obtain the anonymization model 4 based on differential privacy during deep learning is realized with the same configuration as the processing in the anonymization processing unit 10 shown in FIG.

［全体の処理の流れ］
図７は、第２の実施の形態例のプライバシ保護データ提供システムでの処理の流れを示すフローチャートである。
まず、深層学習処理部２０は、データベース１から生データを取得する（ステップＳ３１）。そして、深層学習処理部２０は、取得した生データのパラメータの変動量に対して、グローバルセンシティビティによる制限を設定したラプラス分布に基づく誤差の付与を行いながら、予め用意された深層学習アルゴリズムを適用して深層学習を行う（ステップＳ３２）。このときには、深層学習を行いながら逐次的に、パラメータの変動量のグローバルセンシティビティを計算する。パラメータの変動量のグローバルセンシティビティを計算することで、グローバルセンシティビティとプライバシ指標「ε」からラプラス分布が決まり、ラプラス分布で誤差を与えることで、匿名化が行われる。そして、深層学習処理の結果として、匿名化モデルを取得し（ステップＳ３３）、得られた匿名化モデルをデータ出力部１９から出力する。 [Overall process flow]
FIG. 7 is a flowchart showing a flow of processing in the privacy protection data providing system of the second exemplary embodiment.
First, the deep learning processing unit 20 acquires raw data from the database 1 (step S31). Then, the deep learning processing unit 20 applies a deep learning algorithm prepared in advance while assigning an error based on a Laplace distribution in which a restriction by global sensitivity is set to the fluctuation amount of the parameter of the acquired raw data. Then, deep learning is performed (step S32). At this time, the global sensitivity of the parameter variation is calculated sequentially while performing deep learning. By calculating the global sensitivity of the parameter variation, the Laplace distribution is determined from the global sensitivity and the privacy index “ε”, and anonymization is performed by giving an error in the Laplace distribution. Then, an anonymization model is acquired as a result of the deep learning process (step S33), and the obtained anonymization model is output from the data output unit 19.

ステップＳ３２において、匿名化処理の制限に使用される閾値は、深層学習処理部２０における、重みパラメータの変動量の最大値及び最小値を示す閾値と、バイアスパラメータの変動量の最大値及び最小値を示す閾値である。 In step S32, the threshold values used for the restriction of the anonymization process are the threshold value indicating the maximum value and the minimum value of the variation amount of the weight parameter in the deep learning processing unit 20, and the maximum value and minimum value of the variation amount of the bias parameter It is a threshold which shows.

［深層学習の詳細］
次に、ここまで説明したステップＳ３１〜Ｓ３３の各処理の詳細について説明する。
本実施の形態例では、活性化関数と誤差関数を事前に決めて、匿名化された深層学習を行う。
例えば、ｆ（ｘ）＝ｍａｘ（０；ｘ）で定義されるＲｅＬＵが、深層学習の最終層を除く活性化関数として広く利用されている。
深層学習の利用目的として、カテゴリ分類の場合、最終層の活性化関数（Ｆ（Ｌ））
としてソフトマックス関数が、また、誤差関数としてクロスエントロピー誤差関数が広く利用されている。
ソフトマックス関数は、次の［数１５］式のように定義される。 [Details of deep learning]
Next, details of each processing of steps S31 to S33 described so far will be described.
In the present embodiment, an activation function and an error function are determined in advance, and anonymized deep learning is performed.
For example, ReLU defined by f (x) = max (0; x) is widely used as an activation function excluding the final layer of deep learning.
For the purpose of using deep learning, in the case of category classification, the activation function (F (L)) of the final layer
A softmax function is widely used, and a cross-entropy error function is widely used as an error function.
The softmax function is defined as in the following [Equation 15].

また、クロスエントロピー誤差関数は、次の［数１６］式のように定義される。 The cross entropy error function is defined as the following [Equation 16].

ここでは、匿名化された深層学習を行う場合、深層学習を行う最終層を除く各層は、活性化関数ReLUを、最終層の活性化関数としてソフトマックス関数を、誤差関数としてクロスエントロピー誤差関数を利用する。
最終層の活性化関数がソフトマックス関数であり、かつ、誤差関数がクロスエントロピー誤差関数の場合、誤差信号δ_ｊ（Ｌ） for ｊ＝１，・・・，ｎ^(L)の値は、次の［数１７］式に示すように計算される。 Here, when anonymized deep learning is performed, each layer except the final layer that performs deep learning has an activation function ReLU, a softmax function as an activation function of the final layer, and a cross-entropy error function as an error function. Use.
When the activation function of the final layer is a softmax function and the error function is a cross-entropy error function, the value of the error signal δ _j (L) for j = 1,..., N ^(L) is This is calculated as shown in [Equation 17].

［数１７］式において、ｙ_ｊ ^（Ｌ）はノードＮ_ｊ ^（Ｌ）の出力値を表し、ｔ_ｊ ^（Ｌ）はノードＮ_j ^（Ｌ）の目標出力値を表す。
最終層以外の層において活性化関数ReLUを使った場合、最終層以外の各ノードの誤差信号δ_ｊ ^（ｌ）＝１，・・・，Ｌ−１は次の［数１８］式で計算される。 In Equation 17, y _j ^(L) represents the output value of the node N _j ^(L) , and t _j ^(L) represents the target output value of the node N _j ^(L) .
When the activation function ReLU is used in a layer other than the final layer, error signals δ _j ^(l) = 1,..., L−1 of each node other than the final layer are calculated by the following [Equation 18]. The

ｘ_ｊ ^（１）の値として取り得る範囲は、［ｂ_ｊ ^（１）＋Σ_ｉｍｉｎ（ｗ_ｉ，ｊ ^（１），０），ｂ_ｊ ^（１）＋Σ_ｉｍａｘ（ｗ_ｉ，ｊ ^（１），０）］である。また、ｘ_ｊ ^（２）の値として取り得る範囲は、［ｂ_ｊ ^（２）＋Σ_ｉ（ｂ_ｉ ^（１）＋Σ_ｋｍａｘ（ｗ_ｋ，ｉ ^（１），０））ｍｉｎ（ｗ_ｉ，ｊ ^（２），０），ｂ_ｊ ^（２）＋Σ_ｉ（ｂ_ｉ ^（１）＋Σ_ｋｍａｘ（ｗ_ｋ，ｉ ^（１），０））ｍａｘ（ｗ_ｉ，ｊ ^（２），０）］となる。深層学習では、ｘ_ｊ ^（ｌ） for ｌ＝１，・・・，Ｌは、次の［数１９］式で計算される。 The possible range for the value of x _j ⁽¹⁾ is [b _j ⁽¹⁾ + Σ _i min (wi _{, j} ⁽¹⁾ , 0), b _j ⁽¹⁾ + Σ _i max (wi _{, j} ⁽¹⁾ , 0)]. Further, the range that can be taken as the value of x _j ⁽²⁾ is [b _j ⁽²⁾ + Σ _i (b _i ⁽¹⁾ + Σ _k max (w _{k, i} ⁽¹⁾ , 0)) min (w _{i, j} ⁽²⁾ , 0), b _j ⁽²⁾ + Σ _i (b _i ⁽¹⁾ + Σ _k max (w _{k, i} ⁽¹⁾ , 0)) max (w _{i, j} ⁽²⁾ , 0)] . In deep learning, x _j ^(l) for l = 1,..., L is calculated by the following [Equation 19].

ここで、ｍｉｎ（ｙ_ｉ ^（０））＝０であり、ｍａｘ（ｙ_ｉ ^（０））＝１である。これは、深層学習の第１層目への入力値を０以上１以下の範囲に正規化しているためである。また、最終層以外の層では、活性化関数ReLUを使っているので、ｌ＝１，・・・，Ｌ−１において、ｙ_ｊ ^（ｌ）は、次の［数２０］式によって計算される。 Here, min (y _i ⁽⁰⁾ ) = 0 and max (y _i ⁽⁰⁾ ) = 1. This is because the input value to the first layer of deep learning is normalized to a range of 0 or more and 1 or less. In addition, since the activation function ReLU is used in the layers other than the final layer, y _j ^(l) is calculated by the following [Equation 20] at l = 1,..., L−1. .

これによって、ｍａｘ（ｙ_ｊ ^（ｌ））の値は、常に０以上であることがわかる。
次に、誤差信号δ_ｊ ^（ｌ）の取り得る値の範囲を計算する。深層学習モデルの出力値の範囲は、−１から１までであるので、次の［数２１］式のように定義される。 This shows that the value of max (y _j ^(l) ) is always 0 or more.
Next, a range of possible values of the error signal δ _j ^(l) is calculated. Since the range of the output value of the deep learning model is from −1 to 1, it is defined as the following [Expression 21].

また、ｌ＝１，・・・，Ｌ−１について、次の［数２２］式で示される。ここで、全てのｊとｌについて、ｍｉｎ（δ_ｊ ^（ｌ））であり、ｍａｘ（δ_ｊ ^（ｌ））≧０である。 Further, l = 1,..., L−1 is expressed by the following [Equation 22]. Here, for all j and l, min (δ _j ^(l) ) and max (δ _j ^(l) ) ≧ 0.

最終的には、次の［数２３］式が得られる。 Ultimately, the following [Equation 23] is obtained.

ｂ_ｊ ^（ｌ）については、次の［数２４］式で示される。 b _j ^(l) is expressed by the following [Equation 24].

また、ｌ＝１，・・・，Ｌ−１について、次の［数２５］式で示される。 Further, l = 1,..., L−1 is expressed by the following [Equation 25].

既に述べたように、重みパラメータの変動量Δｗ_ｉｊ ^（ｌ）と、バイアスパラメータの変動量Δｂ_ｊ ^（ｌ）に基づいて、重みパラメータとバイアスパラメータを、［数５］式と［数６］式により更新する。つまり、データ入力ごとに毎回、重みパラメータとバイアスパラメータを更新する。
ここで本実施の形態例では、このときの変動量にラプラス分布に基づく誤差を与える。重みパラメータの変動量Δｗ_ｉｊ ^（ｌ）と、バイアスパラメータの変動量Δｂ_ｊ ^（ｌ）についても、値の閾値を設定する。 As already described, the weighting parameter and the biasing parameter are expressed by the following [Equation 5] and [Equation 6] based on the weighting parameter variation Δw _ij ^(l) and the bias parameter variation Δb _j ^(l) . Update with That is, the weight parameter and the bias parameter are updated every time data is input.
Here, in the present embodiment, an error based on the Laplace distribution is given to the fluctuation amount at this time. A threshold value is also set for the fluctuation amount Δw _ij ^(l) of the weight parameter and the fluctuation amount Δb _j ^(l) of the bias parameter.

ここでは、Δｗ_ｍａｘとΔｗ_ｍｉｎを、重みパラメータの変動量Δｗ_ｉｊ ^（ｌ）の最大値と最小値とする。また、Δｂ_ｍａｘとΔｂ_ｍｉｎを、バイアスパラメータΔｂ_ｊ ^（ｌ）の最大値と最小値とする。 Here, Δw _max and Δw _min are the maximum value and the minimum value of the variation amount Δw _ij ^(l) of the weight parameter. Also, Δb _max and Δb _min are the maximum and minimum values of the bias parameter Δb _j ^(l) .

また、深層学習のエポック数をＥとおく。各バッチに対して学習を行う際に、それぞれのｗ_ｉｊ ^（ｌ）とｂ_ｊ ^（ｌ）に対して、重みパラメータの変動量Δｗ_ｉｊ ^（ｌ）を、ｍｉｎ（Δｗ_ｍａｘ，ｍｉｎ（Δｗ_ｍａｘ，ｗ_ｉｊ ^（ｌ）＋Lap（（Δｗ_ｍａｘ−Δｗ_ｍｉｎ）・Ｅ／ε）））に設定する。また、バイアスパラメータの変動量Δｂ_ｊ ^（ｌ）をｍｉｎ（Δｂ_ｍａｘ，ｍａｘ（Δｂ_ｍｉｎ，ｂ_ｊ ^（ｌ）＋Lap（（Δｂ_ｍａｘ−Δｂ_ｍｉｎ）・Ｅ／ε）））に設定する。 Also, let E be the epoch number for deep learning. When learning is performed for each batch, the weight parameter fluctuation amount Δw _ij ^(l) is set to min (Δw _max , min (Δw _max , ⁾ for each w _ij ^(l) and b _j ^(l) . w _ij ^(l) + Lap ((Δw _max −Δw _min ) · E / ε))). The bias parameter variation Δb _j ^{(l) is set} to min (Δb _max , max (Δb _min , b _j ^(l) + Lap ((Δb _max −Δb _min ) · E / ε))).

［閾値を設定したときにε−差分プライバシを満たすことの説明］
次に、深層学習を行う際に、パラメータを閾値（最大値,最小値）で誤差を制限した匿名モデルが、ε−差分プライバシを満たしたものであることを説明する。
各重みパラメータとバイアスパラメータは、［数５］式と［数６］式に基づいて更新される。［数５］式と［数６］式において、重みパラメータの変動量Δｗ_ｉｊ ^（ｌ）とバイアスパラメータの変動量Δｂ_ｊ ^（ｌ）は学習の入力値に依存して変わるが、それ以外の値は入力値に依存しない。したがって、第１の実施の形態で、閾値を設定したときにε−差分プライバシを満たすことを証明した場合と同様に、Δｗ_ｉｊ ^（ｌ）をｍｉｎ（Δｗ_ｍａｘ，ｍａｘ（Δｗ_ｍｉｎ，ｗ_ｉｊ ^（ｌ）＋Lap（（Δｗ_ｍａｘ−Δｗ_ｍｉｎ）・Ｅ／ε）））に設定し、また、Δｂ_ｊ ^（ｌ）をｍｉｎ（Δｂ_ｍａｘ，ｍａｘ（Δｂ_ｍｉｎ，ｂ_ｊ ^（ｌ）＋Lap（（Δｂ_ｍａｘ−Δｂ_ｍｉｎ）・Ｅ／ε)))に設定することで、各エポックのイテレーションは、パラメータベース（ε／Ｅ）−差分プライバシを満たす。
全体でＥエポックあるので、次に説明する証明より、最終的にε−差分プライバシを満たす。 [Explanation of satisfying ε-differential privacy when a threshold is set]
Next, it will be described that the anonymous model in which the error is limited by the threshold value (maximum value, minimum value) when the deep learning is performed satisfies the ε-difference privacy.
Each weight parameter and bias parameter are updated based on [Formula 5] and [Formula 6]. In [Expression 5] and [Expression 6], the weight parameter variation Δw _ij ^(l) and the bias parameter variation Δb _j ^(l) vary depending on the learning input value, but other values are used. Does not depend on the input value. Accordingly, in the first embodiment, Δw _ij ^{(l) is changed} to min (Δw _max , max (Δw _min , w _ij ⁽ ⁾ , as in the case where it is proved that ε-differential privacy is satisfied when the threshold is set. ^l) + Lap ((Δw _max −Δw _min ) · E / ε))), and Δb _j ^{(l) is set} to min (Δb _max , max (Δb _min , b _j ^(l) + Lap ((Δb _max) −Δb _min ) · E / ε))), each epoch iteration satisfies the parameter base (ε / E) −differential privacy.
Since there are E epochs as a whole, the ε-differential privacy is finally satisfied from the proof described below.

ランダムメカニズムＡが、ｄ個のランダムメカニズムＡ_１，・・・，Ａ_ｄから成り立っており、これを１回ずつ続けて実施するものとする。ここでは、ｉ≧２において、Ａ_ｉは入力としてＡ_ｉ−１の出力値を取る。Ａ_ｄの出力値が、Ａの出力値となる。
各Ａ_ｉは、パラメータベースε_ｉ−差分プライバシを満たすものとする。このとき、Ａはパラメータベース（Σ_ｉ＝１ ^ｄε_ｉ）の差分プライバシを実現する。 The random mechanism A is composed of d random mechanisms A ₁ ,..., A _d , and this is performed continuously once. Here, when i ≧ 2, A _i takes an output value of A _i−1 as an input. The output value of _Ad becomes the output value of A.
Each A _i shall satisfy the parameter base ε _i -differential privacy. At this time, A realizes a parameter-based (Σ _{i = 1} ^d ε _i ) differential privacy.

ランダムメカニズムＡは、ｄ個のランダムメカニズムＡ_１，・・・，Ａ_ｄから成り立っており、これを１回ずつ続けて実施するものとする。ｉ≧２において、Ａ_ｉは入力としてＡ_ｉ−１の出力値を取る。Ａ_ｄの出力値が、Ａの出力値となる。ここで、各Ａ_ｉは、ε_ｉ−差分プライバシを満たすものとする。このとき、ランダムメカニズムＡは（Σ_ｉ＝１ ^ｄε_ｉ）−差分プライバシを実現する。
この処理は各パラメータに対して実行されるので、ここでのランダムメカニズムＡは、パラメータベース（Σ_ｉ＝１ ^ｄε_ｉ）−差分プライバシを実現する。 The random mechanism A is composed of d random mechanisms A ₁ ,..., A _d , and this is performed continuously once. For i ≧ 2, A _i takes the output value of A _i−1 as an input. The output value of _Ad becomes the output value of A. Here, it is assumed that each A _i satisfies ε _i -difference privacy. At this time, the random mechanism A realizes (Σ _{i = 1} ^d ε _i ) −differential privacy.
Since this process is executed for each parameter, the random mechanism A here realizes a parameter base (Σ _{i = 1} ^d ε _i ) -differential privacy.

図１０は、第２の実施の形態例での、誤差を最大値と最小値の閾値に制限する処理の概略を示すものである。図１０に示すように、例えばあるパラメータの変動量として取り得る最大の範囲が“０”以上“１”以下であり、ある時点での変動量が０．６であるとする。そして、学習しながら逐次的に算出された閾値の範囲が、“０．３”以上“０．７”以下であるとする（この場合のグローバルセンシティビティは、０．７―０．３＝０．４）。この閾値の範囲（グローバルセンシティビティ）とプライバシ指標「ε」からラプラス分布が決まる。ラプラス分布で誤差を与える処理が行われる。なお、グローバルセンシティビティ（Δｆ）は、既に説明した［数８］式で計算されるものである。
ここで、図１０に示すように、パラメータの変動量“０．５”に誤差を付与して、誤差付与済のパラメータの変動量が“０．１”になったとき、その時点での閾値の範囲の下限値である“０．３”に制限する処理が行われる。ラプラス分布はグローバルセンシティビティとプライバシ指標「ε」から計算されるため、グローバルセンシティビティの値を小さく（つまり閾値の幅を小さく）することで、ラプラス分布の誤差を小さくすることができ、深層学習の精度の向上につながる。
この図１０に示す例についても、図９の例と同様に、パラメータの変動量を閾値で制限する概略を非常に簡略化して示すものであり、実際の閾値に制限する処理は、ここまで数式を参照して説明した様々な条件を考慮して行われるものである。
また、第２の実施の形態例の場合でも、グローバルセンシティビティ（Δｆ）が、パラメータの変動量として取り得る最大の範囲と一致する場合には、図９に示す状態で閾値の制限が行われることになる。 FIG. 10 shows an outline of the process for limiting the error to the threshold values of the maximum value and the minimum value in the second embodiment. As shown in FIG. 10, for example, it is assumed that the maximum range that can be taken as a variation amount of a certain parameter is “0” or more and “1” or less, and the variation amount at a certain time is 0.6. Then, it is assumed that the range of threshold values sequentially calculated while learning is “0.3” or more and “0.7” or less (the global sensitivity in this case is 0.7−0.3 = 0). .4). The Laplace distribution is determined from this threshold range (global sensitivity) and the privacy index “ε”. A process of giving an error with a Laplace distribution is performed. Note that the global sensitivity (Δf) is calculated by the equation [8] already described.
Here, as shown in FIG. 10, when an error is given to the parameter variation “0.5” and the parameter variation with the error added becomes “0.1”, the threshold value at that time The process of limiting to “0.3” which is the lower limit value of the range is performed. Since the Laplace distribution is calculated from the global sensitivity and the privacy index “ε”, the error of the Laplace distribution can be reduced by reducing the global sensitivity value (that is, the threshold width), and deep learning Leads to improved accuracy.
As in the example of FIG. 9, the example shown in FIG. 10 also shows a very simplified outline of limiting the amount of parameter fluctuation with a threshold value. This is performed in consideration of various conditions described with reference to FIG.
Even in the case of the second embodiment, when the global sensitivity (Δf) matches the maximum range that can be taken as the amount of parameter fluctuation, the threshold is limited in the state shown in FIG. It will be.

［実データで評価した例］
図８は、本実施の形態の処理を、評価用のデータセットに対して実行した場合の例を示す。この図８の例は、第１の実施の形態で説明した図５での評価と同じ条件で行ったものである。
図８の横軸はデータセットの数（バッチサイズ）を示し、図８Ａ、図８Ｂ、図８Ｃは、それぞれε＝１、ε＝１０、ε＝１００の場合を示す。
図８Ａ、図８Ｂ、図８Ｃに示すように、いずれの場合でも良好な精度が確保されていることが分かる。ここで、図５（第１の実施の形態例）と、図８（第２の実施の形態例）とを比較すると分かるように、εの値が小さいときは、第１の実施の形態例の方が、高い精度が得られる。一方、εの値が大きいときは、第２の実施の形態例の方が、高い精度が得られる。但し、この結果は使用するデータセットによって変わるものであり、いずれの実施の形態を適用するのが好ましいかは、使用するデータセットによって異なる。 [Examples evaluated with actual data]
FIG. 8 shows an example when the processing of the present embodiment is performed on a data set for evaluation. The example of FIG. 8 is performed under the same conditions as the evaluation in FIG. 5 described in the first embodiment.
The horizontal axis of FIG. 8 shows the number of data sets (batch size), and FIGS. 8A, 8B, and 8C show cases where ε = 1, ε = 10, and ε = 100, respectively.
As shown in FIGS. 8A, 8B, and 8C, it can be seen that good accuracy is ensured in any case. Here, as can be seen by comparing FIG. 5 (first embodiment) and FIG. 8 (second embodiment), when the value of ε is small, the first embodiment. The higher accuracy is obtained. On the other hand, when the value of ε is large, the second embodiment can obtain higher accuracy. However, this result varies depending on the data set to be used, and which embodiment is preferably applied differs depending on the data set to be used.

なお、図５及び図８に示す評価例では、予測した年収が５万ドル以下で、実際の年収が５万ドル以下である場合の回数をＴＮ、予測した年収が５万ドル以下で、実際の年収が５万ドルを超えている場合の回数をＦＮとした。また、予測した年収が５万ドルを超えていて、実際に５万ドルを超えている場合の回数をＴＰ、予測した年収が５万ドルを超えていて、実際の年収が５万ドル以下である場合の回数をＦＰとした。
このとき、手法［ａｃｃｕｒａｃｙ］では、［数２６］式での評価を行う。また、手法［ｆ−ｍｅａｓｕｒｅ］では、［数２７］式での評価を行う。 In the evaluation examples shown in FIGS. 5 and 8, the number of times when the predicted annual income is less than $ 50,000 and the actual annual income is less than $ 50,000 is TN, and the predicted annual income is less than $ 50,000 and is actually FN is the number of times that the annual income exceeds $ 50,000. Also, if the predicted annual income is over 50,000 dollars and the actual annual income is over 50,000 dollars, TP, the predicted annual income is over 50,000 dollars, and the actual annual income is under 50,000 dollars The number of times in some cases was defined as FP.
At this time, in the method [accuracy], evaluation is performed using the formula [26]. In the method [f-measure], the evaluation is performed using the equation [27].

以上説明したように、本発明の各実施の形態によると、ラプラス分布に基づいた誤差を与えて匿名化を行う際に、その誤差の最大値と最小値を閾値で制限するようにしたことで、匿名化を行う際に与える誤差を一定の範囲に制限することができ、誤差が少ない適切な匿名化を行うことができる。その結果、深層学習モデルの精度低下を軽減できるようになる。 As described above, according to each embodiment of the present invention, when anonymization is performed by giving an error based on the Laplace distribution, the maximum value and the minimum value of the error are limited by a threshold value. The error given when anonymizing can be limited to a certain range, and appropriate anonymization with little error can be performed. As a result, it is possible to reduce a decrease in accuracy of the deep learning model.

なお、ここまで説明した数式は、本発明の各実施の形態を適用する場合の好適な一例を示したものであり、本発明は、これらの数式で説明した処理に限定されるものではない。 The mathematical formulas described so far show a suitable example in the case of applying each embodiment of the present invention, and the present invention is not limited to the processing described with these mathematical formulas.

１…データベース（生データ）、２…深層学習処理部、３…深層学習モデル、４…匿名化モデル（匿名化済の深層学習モデル）、１０…匿名化処理部（閾値制限付き差分プライバシ適用）、１１…データ入力部、１２…ε入力部、１３…パラメータ構造決定部、１４…パラメータ初期値決定部、１５…閾値決定部、１６…閾値超え判定部、１７…閾値計算部、１８…匿名化演算部、１９…データ出力部、２０…機械学習処理部（差分プライバシ適用） DESCRIPTION OF SYMBOLS 1 ... Database (raw data), 2 ... Deep learning processing part, 3 ... Deep learning model, 4 ... Anonymization model (anonymized deep learning model), 10 ... Anonymization processing part (difference privacy application with threshold limitation) , 11 ... Data input section, 12 ... ε input section, 13 ... Parameter structure determination section, 14 ... Parameter initial value determination section, 15 ... Threshold determination section, 16 ... Threshold exceedance determination section, 17 ... Threshold calculation section, 18 ... Anonymous Calculation unit, 19 ... data output unit, 20 ... machine learning processing unit (difference privacy applied)

Claims

A deep learning processing unit that applies a deep learning algorithm to the raw data in the database to obtain a deep learning model;
An anonymization processing unit that obtains an anonymous model by performing anonymization processing based on differential privacy for the deep learning model obtained by the deep learning processing unit, and a privacy protection data providing system comprising:
The anonymization processing unit gives an error based on the Laplace distribution to each parameter value for the weight parameter and the bias parameter included in the deep learning model, and each parameter giving an error based on the Laplace distribution A privacy protection data providing system characterized in that when the threshold value range indicated by the maximum value and the minimum value is exceeded, the threshold value range is limited.

It is a privacy protection data providing system including a deep learning processing unit that obtains an anonymous model that has been deeply learned by applying a deep learning algorithm while performing anonymization processing based on differential privacy for raw data in a database,
The deep learning processing unit gives each parameter value an error based on the Laplace distribution and gives an error based on the Laplace distribution to the weight parameter and the bias parameter used at the time of calculating the deep learning model. A privacy protection data providing system characterized in that when a parameter exceeds a threshold range indicated by a maximum value and a minimum value, the parameter is limited to the threshold range.

When the deep learning processing unit obtains a deep learning model, it sequentially calculates global sensitivity, performs processing to acquire the Laplace distribution based on the calculated global sensitivity,
The privacy protection data providing system according to claim 2, wherein an error based on the sequentially acquired Laplace distribution is given.

Deep learning processing procedure to obtain deep learning model by applying deep learning algorithm to raw data in database,
Anonymization processing procedure for performing anonymization processing based on differential privacy for the deep learning model obtained in the deep learning processing procedure,
The anonymization processing procedure gives an error based on the Laplace distribution to each parameter value for the weight parameter and the bias parameter included in the deep learning model, and each parameter that gives an error based on the Laplace distribution A privacy protection data providing method, characterized in that, when the range of the threshold value indicated by the maximum value and the minimum value is exceeded, the range is limited to the threshold value range.

Including deep learning processing procedure to obtain deep learning learned anonymous model by applying deep learning algorithm while performing anonymization processing based on differential privacy for raw data in database,
The deep learning processing procedure gives each parameter value an error based on the Laplace distribution and an error based on the Laplace distribution for each of the weight parameter and the bias parameter used in the calculation for obtaining the deep learning model. A privacy protection data providing method, characterized in that, when a parameter exceeds a threshold range indicated by a maximum value and a minimum value, the parameter is limited to the threshold range.

When obtaining a deep learning model in the deep learning processing procedure, sequentially calculate global sensitivity, obtain the Laplace distribution based on the calculated global sensitivity,
The privacy protection data providing method according to claim 5, wherein an error based on the sequentially acquired Laplace distribution is given.