JP5914291B2

JP5914291B2 - Transition probability calculation device, total value calculation device, transition probability calculation method, total value calculation method

Info

Publication number: JP5914291B2
Application number: JP2012230230A
Authority: JP
Inventors: 大五十嵐; 亮菊池; 千田　浩司; 浩司千田
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2012-10-17
Filing date: 2012-10-17
Publication date: 2016-05-11
Anticipated expiration: 2032-10-17
Also published as: JP2014081844A

Description

本発明は、データベースのデータを秘匿するセキュリティ技術に関する。 The present invention relates to a security technique for concealing database data.

データベースにおいて、確率的な手法により秘匿された個別データ群から統計値を算出するためには、秘匿の際のルールである遷移確率を用いている。そして、遷移確率を算出する方法としては、非特許文献１の５．１節に示された方法が知られている。また、連続値である数値属性を秘匿化する方法としては、非特許文献２に示された方法が知られている。 In the database, in order to calculate a statistical value from an individual data group concealed by a probabilistic method, a transition probability that is a rule for concealment is used. As a method for calculating the transition probability, the method shown in Section 5.1 of Non-Patent Document 1 is known. As a method for concealing a numerical attribute that is a continuous value, a method disclosed in Non-Patent Document 2 is known.

五十嵐大，千田浩司，高橋克巳，“多値属性に適用可能な効率的プライバシー保護クロス集計”，CSS2008，2008年.Igarashi Univ., Koji Senda, Katsumi Takahashi, “Efficient Privacy Protection Cross Tabulation Applicable to Multi-valued Attributes”, CSS2008, 2008. 五十嵐大，千田浩司，高橋克巳，“数値属性における、ｋ−匿名性を満たすランダム化手法”，CSS2011，2011年.University of Igarashi, Koji Senda, Katsumi Takahashi, “Randomization method satisfying k-anonymity in numerical attributes”, CSS2011, 2011.

しかしながら、非特許文献１に示された技術は属性値が離散的な場合には使用できるが、属性値が連続的な場合には使用できないという課題がある。本発明は、連続的な属性値を含むデータベースに対して、秘匿された個別データ群から統計値を算出するための遷移確率を求める技術を提供することを目的とする。 However, the technique disclosed in Non-Patent Document 1 can be used when the attribute value is discrete, but cannot be used when the attribute value is continuous. An object of this invention is to provide the technique which calculates | requires the transition probability for calculating a statistics value from the individual data group concealed with respect to the database containing a continuous attribute value.

本発明の遷移確率算出装置は、細分化部、集計値情報部、細分確率計算部、統合部を備え、連続値を取る属性値を含むレコードを複数有するデータベースのデータを秘匿する際の属性値が遷移する確率を求める。まず、Ａ_ｖｕは属性値ｖが属性値ｕに遷移する確率密度関数、Ｊは属性値が取り得る値域の一部である値Ｇ_ｍｉｎから値Ｇ_ｍａｘの区間、Ｋは属性値が取り得る値域の一部の区間、Ａ_ＪＫは区間Ｊに含まれる属性値が区間Ｋに含まれる属性値に遷移する遷移確率、Ｍは２以上の整数、ｍは０以上Ｍ−１以下の整数、ｊ_０，…，ｊ_Ｍ−１は区間ＪをＭ個に細分化した区間、Ｎ_Ｊは区間Ｊに属性値が含まれるレコードの数、Ｎ_ｊｍは区間ｊ_ｍに属性値が含まれるレコードの数、ｇ_０は値Ｇ_ｍｉｎ、ｇ_Ｍは値Ｇ_ｍａｘ、ｍが１以上Ｍ−１以下のときはｇ_ｍは区間ｊ_ｍ-1と区間ｊ_ｍとの区切りとなる属性値、ｇ_０＜ｇ_１＜・・・＜ｇ_Ｍとする。細分化部は、区間ＪをＭ個に細分化し、細分化した区間ｊ_０，…，ｊ_Ｍ−１を求める。集計値情報部は、数Ｎ_Ｊと数Ｎ_ｊ０，…，Ｎ_ｊＭ−１を求める。細分確率計算部は、細分確率ｐ_ｍを The transition probability calculation device of the present invention includes a subdivision unit, a total value information unit, a subdivision probability calculation unit, and an integration unit, and attribute values for concealing data in a database having a plurality of records including attribute values that take continuous values Find the probability of the transition. First, A _vu is a probability density function in which the attribute value v transitions to the attribute value u, J is a range from a value G _min to a value G _max that is a part of a value range that the attribute value can take, and K is a value range that the attribute value can take A _JK is a transition probability that an attribute value included in the section J transitions to an attribute value included in the section K, M is an integer of 2 or more, m is an integer of 0 to M-1, and j ₀ ,..., J _M-1 is a section obtained by subdividing the section J into M pieces, N _J is the number of records whose attribute value is included in the section J, N _jm is the number of records whose attribute value is included in the section j _m , g ₀ is a value G _min , g _M is a value G _max , and when m is 1 or more and M−1 or less, g _m is an attribute value that delimits the section j _m−1 and the section j _m, and g ₀ <g ₁ <... <g _M. The subdivision section subdivides the section J into M pieces, and obtains subdivided sections j ₀ ,..., J _M−1 . The total value information part obtains the number N _J and the numbers N _j0 ,..., N _jM−1 . The subdivision probability calculation unit calculates the subprobability p _m

のように求める。統合部は、遷移確率Ａ_ＪＫを Seek like. The integration unit uses the transition probability A _JK

のように求める。 Seek like.

本発明の集計値算出装置は、本発明の遷移確率算出装置を備え、さらに区間設定部と集計値更新部も備える。そして、区間設定部が、属性値のすべての値域が設定されるように順次区間Ｊを設定し、細分化部が設定された区間Ｊを細分化して細分化した区間を求め、集計値情報部がそれぞれの設定での細分化した区間に属性値が含まれるレコードの数を求めて細分化した区間の集計値とする。次に、区間設定部が、属性値のすべての値域の組み合わせが設定されるように順次区間Ｊと区間Ｋを設定し、細分化部が設定された区間Ｊを細分化して細分化した区間を求め、細分確率計算部が、設定された区間Ｊと区間Ｋについて、細分確率ｐ_０，…，ｐ_Ｍ−１を求め、統合部が、細分化した区間の集計値と細分確率ｐ_０，…，ｐ_Ｍ−１を用いて遷移確率Ａ_ＪＫを求める。そして、集計値更新部が、属性値のすべての値域の組み合わせが設定されるように順次設定された区間Ｊと区間Ｋのそれぞれの遷移確率Ａ_ＪＫを遷移確率行列の各要素とし、逆行列手法または反復ベイズ手法によって細分化した区間に属性値が含まれるレコードの数を更新し、新しい細分化した区間の集計値とする。区間Ｊのレコードの数は、区間Ｊに含まれる細分化した区間のレコード数を合計して求める。 The total value calculation device of the present invention includes the transition probability calculation device of the present invention, and further includes a section setting unit and a total value update unit. Then, the section setting unit sequentially sets the section J so that all value ranges of the attribute values are set, subdivides the section J in which the subdivision unit is set, obtains a subdivided section, and the total value information section The number of records in which attribute values are included in the subdivided sections in each setting is obtained and used as the aggregated value of the subdivided sections. Next, the section setting unit sequentially sets the section J and the section K so that all combinations of attribute values are set, and subdivides the section J set by the subdivision section into subdivided sections. Then, the subdivision probability calculation unit obtains subdivision probabilities p ₀ ,..., P _M−1 for the set sections J and K, and the integration unit calculates the aggregated values and subdivision probabilities p ₀ ,. , P _M−1 to obtain the transition probability A _JK . Then, the aggregate value update unit uses the transition probabilities A _JK of the sections J and K sequentially set so that combinations of all the range of attribute values are set as each element of the transition probability matrix, and an inverse matrix method Alternatively, the number of records whose attribute values are included in the segment subdivided by the iterative Bayes technique is updated to be the aggregate value of the new subdivided segment. The number of records in the section J is obtained by totaling the number of records in the subdivided sections included in the section J.

本発明の遷移確率算出装置では、連続的な属性値の値域を有限個の区間に区切り、各区間に含まれる属性値の数を用いるので、連続的な遷移確率密度関数ではなく、遷移確率を求めることができる。また、本発明の集計値算出装置は、求めた遷移確率を遷移確率行列の要素とするので、非特許文献１の技術（逆行列手法または反復ベイズ手法）を利用して秘匿された個別データ群から統計値を算出できる。 In the transition probability calculation apparatus of the present invention, the range of continuous attribute values is divided into a finite number of sections, and the number of attribute values included in each section is used. Therefore, the transition probability is not a continuous transition probability density function. Can be sought. Moreover, since the total value calculation apparatus of this invention uses the calculated | required transition probability as an element of a transition probability matrix, the individual data group concealed using the technique (inverse matrix method or iterative Bayes method) of nonpatent literature 1 Statistical values can be calculated from

実施例１の遷移確率算出装置の機能構成例を示す図。FIG. 3 is a diagram illustrating a functional configuration example of a transition probability calculation apparatus according to the first embodiment. 実施例１の遷移確率算出装置の処理フローを示す図。The figure which shows the processing flow of the transition probability calculation apparatus of Example 1. FIG. 本発明の集計値算出装置の機能構成例、実施例２の遷移確率算出装置の機能構成例を示す図。The function structural example of the total value calculation apparatus of this invention and the figure which shows the functional structural example of the transition probability calculation apparatus of Example 2 are shown. 本発明の集計値算出装置の処理フローを示す図。The figure which shows the processing flow of the total value calculation apparatus of this invention. 実施例２の遷移確率算出装置の処理フローを示す図。The figure which shows the processing flow of the transition probability calculation apparatus of Example 2. FIG.

以下、本発明の実施の形態について、詳細に説明する。なお、同じ機能を有する構成部には同じ番号を付し、重複説明を省略する。 Hereinafter, embodiments of the present invention will be described in detail. In addition, the same number is attached | subjected to the structure part which has the same function, and duplication description is abbreviate | omitted.

図１に実施例１の遷移確率算出装置の機能構成例を、図２に実施例１の遷移確率算出装置の処理フローを示す。実施例１の遷移確率算出装置は、連続値を取る属性値を含むレコードを複数有するデータベース９００のデータを秘匿する際の属性値が遷移する確率を求める。レコードとは、いくつかのあらかじめ定められた項目に対する値からなる。属性とは各項目のことであり、属性値とは各項目の値である。属性値が離散的な属性とは、例えば“性別”や“年齢”などであり、属性値が連続値を取る属性とは、例えば“身長”や“体重”などである。 FIG. 1 shows a functional configuration example of the transition probability calculation apparatus according to the first embodiment, and FIG. 2 shows a processing flow of the transition probability calculation apparatus according to the first embodiment. The transition probability calculation apparatus according to the first embodiment obtains a probability that an attribute value changes when concealing data in the database 900 having a plurality of records including attribute values that take continuous values. A record consists of values for several predetermined items. The attribute is each item, and the attribute value is the value of each item. An attribute having discrete attribute values is, for example, “sex” or “age”, and an attribute having a continuous attribute value is, for example, “height” or “weight”.

実施例１の遷移確率算出装置１００は、データベース９００とネットワークで接続されており、細分化部１１０、集計値情報部１２０、細分確率計算部１３０、統合部１４０、記録部１９０を備える。まず、Ａ_ｖｕは属性値ｖが属性値ｕに遷移する確率密度関数、Ｊは属性値が取り得る値域の一部である値Ｇ_ｍｉｎから値Ｇ_ｍａｘの区間、Ｋは属性値が取り得る値域の一部の区間、Ａ_ＪＫは区間Ｊに含まれる属性値が区間Ｋに含まれる属性値に遷移する遷移確率、Ｍは２以上の整数、ｍは０以上Ｍ−１以下の整数、ｊ_０，…，ｊ_Ｍ−１は区間ＪをＭ個に細分化した区間、Ｎ_Ｊは区間Ｊに属性値が含まれるレコードの数、Ｎ_ｊｍは区間ｊ_ｍに属性値が含まれるレコードの数、ｇ_０は値Ｇ_ｍｉｎ、ｇ_Ｍは値Ｇ_ｍａｘ、ｍが１以上Ｍ−１以下のときはｇ_ｍは区間ｊ_ｍ-1と区間ｊ_ｍとの区切りとなる属性値、ｇ_０＜ｇ_１＜・・・＜ｇ_Ｍとする。なお、属性値ｇ_ｍは区間ｊ_ｍ-1と区間ｊ_ｍのどちらかの区間に属していれば、どちらの区間に属することにしてもよい。 The transition probability calculation apparatus 100 according to the first embodiment is connected to the database 900 via a network, and includes a subdivision unit 110, a total value information unit 120, a subdivision probability calculation unit 130, an integration unit 140, and a recording unit 190. First, A _vu is a probability density function in which the attribute value v transitions to the attribute value u, J is a range from a value G _min to a value G _max that is a part of a value range that the attribute value can take, and K is a value range that the attribute value can take A _JK is a transition probability that an attribute value included in the section J transitions to an attribute value included in the section K, M is an integer of 2 or more, m is an integer of 0 to M-1, and j ₀ ,..., J _M-1 is a section obtained by subdividing the section J into M pieces, N _J is the number of records whose attribute value is included in the section J, N _jm is the number of records whose attribute value is included in the section j _m , g ₀ is a value G _min , g _M is a value G _max , and when m is 1 or more and M−1 or less, g _m is an attribute value that delimits the section j _m−1 and the section j _m, and g ₀ <g ₁ <... <g _M. The attribute value g _m may belong to either section as long as it belongs to either section j _m-1 or section j _m .

細分化部１１０は、区間ＪをＭ個に細分化し、細分化した区間ｊ_０，…，ｊ_Ｍ−１を求める（Ｓ１１０）。なお、区間ｊ_ｍは属性値がｇ_ｍ以上ｇ_ｍ＋１より小さい区間でもよいし、属性値がｇ_ｍより大きくｇ_ｍ＋１以下の区間でもよい。 The subdivision section 110 subdivides the section J into M pieces, and obtains subdivided sections j ₀ ,..., J _M−1 (S110). Incidentally, the interval _{j m} is to attribute values may be _{g m} or _{g m + 1} smaller intervals, the attribute value may be a larger _{g m + 1} following section than _{g m.}

集計値情報部１２０は、数Ｎ_Ｊと数Ｎ_ｊ０，…，Ｎ_ｊＭ−１を求め、記録部１９０に記録する（Ｓ１２０）。数Ｎ_Ｊと数Ｎ_ｊ０，…，Ｎ_ｊＭ−１の求め方には、いくつかの方法があり得る。例えば、データベース９００から属性値の正しい集計値（Ｎ_ｊ０，…，Ｎ_ｊＭ−１）を取得できるのであれば、取得すべきである。この場合は、Ｎ_Ｊ＝Ｎ_ｊ０＋…＋Ｎ_ｊＭ−１のように数Ｎ_Ｊを求めればよい。集計値が分からない状態のときは、例えば、属性値に対して一様に分布していることを前提として集計値（Ｎ_ｊ０，…，Ｎ_ｊＭ−１）を求める方法がある。この場合は、属性値の値域を最小値Ｖ_ｍｉｎから最大値Ｖ_ｍａｘ、全レコード数をＮ_ＡＬＬとし、集計値情報部１２０は、数Ｎ_Ｊと数Ｎ_ｊ０，…，Ｎ_ｊＭ−１を、 The total value information unit 120 calculates the numbers N _J and the numbers N _j0 ,..., N _jM−1 and records them in the recording unit 190 (S120). There are several methods for obtaining the numbers N _J and the numbers N _j0 ,..., N _jM−1 . For example, if a correct aggregate value (N _j0 ,..., N _jM−1 ) of attribute values can be acquired from the database 900, it should be acquired. In this case, the number N _J may be obtained as N _J = N _j0 +... + N _jM−1 . When the total value is unknown, for example, there is a method for obtaining the total value (N _j0 ,..., N _jM−1 ) on the assumption that the attribute value is uniformly distributed. In this case, the maximum value _{V max} of the range of attribute values from the minimum value _{V min,} the total number of records and _{N ALL,} aggregate value information 120, the number _{N J} and the number _N j0, _..., a _{N jM-1,}

のように求めればよい。また、集計値が分からない状態のときの別の例としては、以下のように秘匿された属性値の分布を用いる方法もある。具体的には、属性値の値域を最小値Ｖ_ｍｉｎから最大値Ｖ_ｍａｘ、全レコード数をＮ_ＡＬＬとし、集計値情報部１２０は、秘匿された属性値の分布を用いて数Ｎ_ｊ０，…，Ｎ_ｊＭ−１を求め、数Ｎ_ＪをＮ_Ｊ＝Ｎ_ｊ０＋…＋Ｎ_ｊＭ−１のように求めればよい。 You can ask as follows. As another example when the total value is unknown, there is a method of using a secret attribute value distribution as follows. Specifically, the range of attribute values is set from the minimum value V _min to the maximum value V _max , the total number of records is set to N _ALL , and the total value information unit 120 uses a secret attribute value distribution to calculate the number N _j0,. , N _jM−1 and the number N _J may be _calculated as N _J = N _j0 +... + N _jM−1 .

細分確率計算部１３０は、細分確率ｐ_ｍ（ただし、ｍ＝０，…，Ｍ−１）を、 The subdivision probability calculation unit 130 calculates subdivision probability p _m (where m = 0,..., M−1).

のように求め、記録部１９０に記録する（Ｓ１３０）。 And is recorded in the recording unit 190 (S130).

例えば、確率密度関数Ａ_ｖｕがラプラス分布を基礎ノイズとする有限ノイズ関数の場合を考える。ここで、区間Ｋの属性値の値域を最小値Ｈ_ｍｉｎから最大値Ｈ_ｍａｘ、属性値の値域を最小値Ｖ_ｍｉｎから最大値Ｖ_ｍａｘ、２σ^２はラプラス分布の分散とする。このとき、確率密度関数Ａ_ｖｕは、入力ｖに以下を満たす確率密度関数ｆ_ｖ（ｘ）に従うノイズＸを加算した確率変数Ｙの確率密度である。基礎ノイズを確率密度関数ｆ（ｘ）とするとき、あるｖに依存した数α_ｖがあって、
Ｖ_ｍｉｎ−ｖ≦ｘ≦Ｖ_ｍａｘ−ｖ
である場合に、
ｆ_ｖ（ｘ）＝ｆ（ｘ）／α_ｖ
を満たし、そうでない場合には
ｆ_ｖ（ｘ）＝０
を満たす。 For example, consider the case where the probability density function A _vu is a finite noise function with a Laplace distribution as the basic noise. Here, the value range of the attribute value in the section K is the minimum value H _min to the maximum value H _max , and the attribute value range is the minimum value V _min to the maximum value V _max , and 2σ ² is the dispersion of the Laplace distribution. At this time, the probability density function A _vu is the probability density of the random variable Y obtained by adding the noise X according to the probability density function f _v (x) satisfying the following to the input v. When the basic noise is a probability density function f (x), there is a number α _v depending on a certain v,
V _min −v ≦ x ≦ V _max −v
If
f _v (x) = f (x) / α _v
If not, f _v (x) = 0
Meet.

Ａ_ｖｕがこのような確率密度関数の場合には、以下のように計算すればよい。 When A _vu is such a probability density function, it may be calculated as follows.

ただし、 However,

統合部１４０は、遷移確率Ａ_ＪＫを The integration unit 140 determines the transition probability A _JK

のように求め、記録部１９０に記録する（Ｓ１４０）。 And is recorded in the recording unit 190 (S140).

実施例１の遷移確率算出装置によれば、連続的な属性値の値域を有限個の区間に区切り、各区間に含まれる属性値の数を用いるので、連続的な遷移確率密度関数ではなく、遷移確率を求めることができる。したがって、求めた遷移確率を遷移確率行列の要素とすることができるので、非特許文献１の技術を利用して秘匿された個別データ群から統計値を算出できるようになる。 According to the transition probability calculation device of the first embodiment, the range of continuous attribute values is divided into a finite number of sections, and the number of attribute values included in each section is used. Therefore, instead of a continuous transition probability density function, Transition probability can be obtained. Therefore, since the obtained transition probability can be used as an element of the transition probability matrix, the statistical value can be calculated from the individual data group concealed using the technique of Non-Patent Document 1.

本発明の集計値算出装置の機能構成例を図３に、本発明の集計値算出装置の処理フローを図４に示す。本発明の集計値算出装置２００は、データベース９００とネットワークで接続されており、遷移確率算出装置１００、区間設定部２５０、集計値更新部２６０を備える。なお、集計値算出装置２００は、正しい集計値（Ｎ_ｊ０，…，Ｎ_ｊＭ−１）が分からない場合に用いる装置である。 FIG. 3 shows a functional configuration example of the total value calculation apparatus of the present invention, and FIG. 4 shows a processing flow of the total value calculation apparatus of the present invention. The total value calculation device 200 of the present invention is connected to the database 900 via a network, and includes a transition probability calculation device 100, a section setting unit 250, and a total value update unit 260. The aggregate value calculation apparatus 200 is an apparatus used when a correct aggregate value (N _j0 ,..., N _jM−1 ) is not known.

以下では、図４の処理フローに従いながら処理を説明する。区間設定部２５０は、属性値の値域（Ｖ_ｍｉｎ〜Ｖ_ｍａｘ）の中から区間Ｊ（Ｇ_ｍｉｎ〜Ｇ_ｍａｘ）を選んで設定する（Ｓ２５１）。細分化部１１０は、設定された区間Ｊを細分化して細分化した区間（ｊ_０，…，ｊ_Ｍ−１）を求める（Ｓ１１１）。集計値情報部１２０は、細分化した区間（ｊ_０，…，ｊ_Ｍ−１）に属性値が含まれるレコードの数をあらかじめ定めた方法で求めて細分化した区間の集計値とし、記録部１９０に記録する（Ｓ１２１）。具体的には、属性値の値域を最小値Ｖ_ｍｉｎから最大値Ｖ_ｍａｘ、全レコード数をＮ_ＡＬＬとし、集計値情報部１２０は、数Ｎ_ｊ０，…，Ｎ_ｊＭ−１を、 Hereinafter, the processing will be described while following the processing flow of FIG. The section setting unit 250 selects and sets the section J (G _{min to} G _max ) from the attribute value range (V _{min to} V _max ) (S251). The subdividing unit 110 subdivides the set section J to obtain a subdivided section (j ₀ ,..., J _M−1 ) (S111). The total value information unit 120 obtains the number of records whose attribute values are included in the subdivided section (j ₀ ,..., J _M−1 ) by a predetermined method and sets the total value of the subdivided section. 190 (S121). Specifically, the maximum value _{V max} of the range of attribute values from the minimum value _{V min,} the total number of records and _{N ALL,} aggregate value information 120, the number _N j0, _..., a _{N jM-1,}

のように求めればよい。もしくは、集計値情報部１２０は、秘匿された属性値の分布を用いて数Ｎ_ｊ０，…，Ｎ_ｊＭ−１を求めればよい。そして、区間設定部２５０は、属性値の値域（Ｖ_ｍｉｎ〜Ｖ_ｍａｘ）のすべてに対して区間Ｊ（Ｇ_ｍｉｎ〜Ｇ_ｍａｘ）を選んで設定したかを確認し（Ｓ２５２）、Ｎｏの場合にはステップＳ２５１に戻る。 You can ask as follows. Or the total value information part 120 should _{just obtain | require} number _Nj0 , ..., _NjM-1 using the distribution of the secret attribute value. Then, the section setting unit 250 confirms whether or not the section J (G _{min to} G _max ) is selected and set for all the attribute value ranges (V _{min to} V _max ) (S252). Returns to step S251.

ステップＳ２５２がＹｅｓの場合には、区間設定部２５０は、属性値の値域（Ｖ_ｍｉｎ〜Ｖ_ｍａｘ）の中から区間Ｊ（Ｇ_ｍｉｎ〜Ｇ_ｍａｘ）と区間Ｋ（Ｈ_ｍｉｎ〜Ｈ_ｍａｘ）の組み合わせを選んで設定する（Ｓ２５３）。細分化部１１０は、設定された区間Ｊを細分化して細分化した区間（ｊ_０，…，ｊ_Ｍ−１）を求める（Ｓ１１２）。細分確率計算部１３０は、設定された区間Ｊと区間Ｋについて、細分確率ｐ_０，…，ｐ_Ｍ−１を求め、記録部１９０に記録する（Ｓ１３０）。統合部１４０が、細分化した区間の集計値と細分確率ｐ_０，…，ｐ_Ｍ−１を用いて遷移確率Ａ_ＪＫを求め、記録部１９０に記録する（Ｓ１４０）。ステップＳ１３０とステップＳ１４０は、実施例１と同じ方法とすればよい。そして、区間設定部２５０は、属性値の値域（Ｖ_ｍｉｎ〜Ｖ_ｍａｘ）のすべてに対して区間Ｊ（Ｇ_ｍｉｎ〜Ｇ_ｍａｘ）と区間Ｋ（Ｈ_ｍｉｎ〜Ｈ_ｍａｘ）の組み合わせを選んで設定したかを確認し（Ｓ２５４）、Ｎｏの場合にはステップＳ２５３に戻る。 If step S252 is Yes, the interval setting unit 250, a combination of attribute values of the value range _{_(V} min _{~V max)} interval from the _{J (G} min _{~G max)} and section _{K _(H} min _~H _max) Is selected and set (S253). The subdividing unit 110 subdivides the set section J to obtain a subdivided section (j ₀ ,..., J _M−1 ) (S112). The subdivision probability calculation unit 130 obtains subdivision probabilities p ₀ ,..., P _M−1 for the set sections J and K, and records them in the recording unit 190 (S130). The integration unit 140 obtains the transition probability A _JK using the aggregated values of the subdivided sections and the subdivision probabilities p ₀ ,..., P _M−1 and records them in the recording unit 190 (S140). Steps S130 and S140 may be the same method as in the first embodiment. The section setting unit 250 selects and sets a combination of the section J (G _{min to} G _max ) and the section K (H _{min to} H _max ) for all the attribute value ranges (V _{min to} V _max ). (S254). If No, the process returns to step S253.

ステップＳ５４がＹｅｓの場合には、集計値更新部２６０が、属性値のすべての値域の組み合わせが設定されるように順次設定された区間Ｊと区間Ｋのそれぞれの遷移確率Ａ_ＪＫを要素とした遷移確率行列Ａを作成する。そして、例えば、非特許文献１の５．２節に示された逆行列手法や反復ベイズ手法によって、遷移確率行列Ａを用いて細分化した区間に属性値が含まれるレコードの数を更新し、新しい細分化した区間の集計値とし、記録部１９０に記録する（Ｓ２６０）。なお、反復ベイズ手法など、反復処理の終了条件として、更新前の集計値と更新後の集計値の差があらかじめ定めた範囲かを確認する手法もある。このような手法の場合、集計値更新部２６０は、更新された集計値が反復処理の終了条件を満たすかを確認する（Ｓ２６１）。そして、Ｎｏの場合はステップＳ２５３に戻り、新しい細分化した区間の集計値を用いて処理を進める。ステップＳ２６１がＹｅｓの場合とステップＳ２６１がない場合は、集計値更新部２６０は、属性値が区間Ｊ（Ｇ_ｍｉｎ〜Ｇ_ｍａｘ）に含まれるレコード数Ｎ_Ｊを、Ｎ_Ｊ＝Ｎ_ｊ０＋…＋Ｎ_ｊＭ−１のように求め、記録部１９０に記録する（Ｓ２６２）。なお、本実施例に示したステップＳ２５１〜Ｓ２６１の処理をまとめて、細分化集計値算出ステップ（Ｓ２００）と呼ぶことにする。 When step S54 is Yes, the total value update unit 260 uses the transition probabilities A _JK of the sections J and K that are sequentially set so that combinations of all the range of attribute values are set as elements. A transition probability matrix A is created. And, for example, by updating the number of records whose attribute values are included in the segment subdivided using the transition probability matrix A by the inverse matrix method or the iterative Bayesian method shown in Section 5.2 of Non-Patent Document 1, The total value of the new segmented section is recorded in the recording unit 190 (S260). There is also a method for confirming whether the difference between the total value before the update and the total value after the update is within a predetermined range as an end condition of the iterative processing, such as an iterative Bayes method. In the case of such a method, the total value update unit 260 confirms whether the updated total value satisfies the end condition of the iterative process (S261). In the case of No, the process returns to step S253, and the process proceeds using the total value of the new subdivided section. When Step S261 is Yes and Step S261 is not present, the aggregate value update unit 260 calculates the number of records N _J whose attribute values are included in the section J (G _{min to} G _max ) as N _J = N _j0 +. _jM-1 is obtained and recorded in the recording unit 190 (S262). Note that the processing of steps S251 to S261 shown in this embodiment will be collectively referred to as a subdivided total value calculation step (S200).

本発明の集計値算出装置によれば、連続的な属性値の値域を有限個の区間に区切り、区間から区間への遷移確率を求めるので、求めた遷移確率を要素とする遷移確率行列を求めることができる。しがたって、非特許文献１の技術を利用して秘匿された個別データ群から統計値を算出できるようになる。 According to the total value calculation device of the present invention, the range of continuous attribute values is divided into a finite number of sections, and the transition probability from section to section is obtained, so a transition probability matrix having the obtained transition probability as an element is obtained. be able to. Therefore, the statistical value can be calculated from the individual data group concealed using the technique of Non-Patent Document 1.

本実施例の遷移確率算出装置３００を図３に、処理フローを図５に示す。本実施例は、実施例２の方法で求められた精度の高い細分化した区間の集計値を用いて遷移確率を求める。構成としては図３に示すように、集計値算出装置２００と同じである。ただし、図５に示すように、実施例２で示した細分化集計値算出ステップ（Ｓ２００）を実施した後、統合部１４０が、集計値更新部２６０が求めた細分化した区間の集計値を用いて遷移確率Ａ_ＪＫを求めることが異なる。なお、細分化集計値算出ステップ（Ｓ２００）では、属性値のすべての値域の組み合わせが設定されるように順次区間Ｊと区間Ｋを設定して細分化した区間の集計値を求めるので、遷移確率行列のすべての要素に対して遷移確率を求めることができる。 FIG. 3 shows the transition probability calculation apparatus 300 of this embodiment, and FIG. 5 shows the processing flow. In the present embodiment, the transition probability is obtained using the aggregate value of the segment with high accuracy obtained by the method of the second embodiment. As shown in FIG. 3, the configuration is the same as that of the total value calculation device 200. However, as shown in FIG. 5, after performing the subdivided total value calculation step (S200) shown in the second embodiment, the integration unit 140 calculates the total value of the subdivided sections obtained by the total value update unit 260. The difference is that the transition probabilities A _JK are obtained. In the subdivided aggregate value calculation step (S200), since the sections J and K are sequentially set so that combinations of all the range of attribute values are set, the aggregate values of the subdivided sections are obtained, so the transition probability Transition probabilities can be obtained for all elements of the matrix.

具体的には、細分化集計値算出ステップ（Ｓ２００）を実行した後、区間設定部２５０は、属性値の値域（Ｖ_ｍｉｎ〜Ｖ_ｍａｘ）の中から区間Ｊ（Ｇ_ｍｉｎ〜Ｇ_ｍａｘ）と区間Ｋ（Ｈ_ｍｉｎ〜Ｈ_ｍａｘ）の組み合わせを選んで設定する（Ｓ２５３）。細分化部１１０は、設定された区間Ｊを細分化して細分化した区間（ｊ_０，…，ｊ_Ｍ−１）を求める（Ｓ１１２）。細分確率計算部１３０は、設定された区間Ｊと区間Ｋについて、細分確率ｐ_０，…，ｐ_Ｍ−１を求め、記録部１９０に記録する（Ｓ１３０）。統合部１４０が、細分化した区間の集計値と細分確率ｐ_０，…，ｐ_Ｍ−１を用いて遷移確率Ａ_ＪＫを求め、記録部１９０に記録する（Ｓ１４０）。ステップＳ１３０とステップＳ１４０は、実施例１と同じ方法とすればよい。そして、区間設定部２５０は、属性値の値域（Ｖ_ｍｉｎ〜Ｖ_ｍａｘ）のすべてに対して区間Ｊ（Ｇ_ｍｉｎ〜Ｇ_ｍａｘ）と区間Ｋ（Ｈ_ｍｉｎ〜Ｈ_ｍａｘ）の組み合わせを選んで設定したかを確認し（Ｓ２５４）、Ｎｏの場合にはステップＳ２５３に戻る。ステップＳ５４がＹｅｓの場合には、処理を終了する。 Specifically, after executing the subdivided total value calculation step (S200), the section setting unit 250 selects the section J (G _{min to} G _max ) and the section from the attribute value range (V _{min to} V _max ). A combination of K (H _{min to} H _max ) is selected and set (S253). The subdividing unit 110 subdivides the set section J to obtain a subdivided section (j ₀ ,..., J _M−1 ) (S112). The subdivision probability calculation unit 130 obtains subdivision probabilities p ₀ ,..., P _M−1 for the set sections J and K, and records them in the recording unit 190 (S130). The integration unit 140 obtains the transition probability A _JK using the aggregated values of the subdivided sections and the subdivision probabilities p ₀ ,..., P _M−1 and records them in the recording unit 190 (S140). Steps S130 and S140 may be the same method as in the first embodiment. The section setting unit 250 selects and sets a combination of the section J (G _{min to} G _max ) and the section K (H _{min to} H _max ) for all the attribute value ranges (V _{min to} V _max ). (S254). If No, the process returns to step S253. If step S54 is Yes, the process ends.

実施例３の遷移確率算出装置３００によれば、精度の高い細分化した区間の集計値を用いるので、精度の高い遷移確率を求めることができる。さらに、属性値のすべての値域の組み合わせが設定されるように順次区間Ｊと区間Ｋを設定できるので、再構築に用いる遷移確率行列のすべての要素（遷移確率）を求めることができる。 According to the transition probability calculation device 300 of the third embodiment, since the aggregate value of the segment with high accuracy is used, the transition probability with high accuracy can be obtained. Furthermore, since the section J and the section K can be set sequentially so that combinations of all the range of attribute values are set, all elements (transition probabilities) of the transition probability matrix used for reconstruction can be obtained.

なお、実施例１から３に共通する本発明のポイントは、精度を高めるために区間Ｊをさらに細かい区間ｊ_ｍに分割する点である。秘匿されたデータから“再構築”により集計値を得る際、遷移確率から遷移確率行列を作る。すなわち、属性値ｖと属性値ｕの組について、ｖとｕに確率的に変化する確率を算出する。複数の属性に関するクロス集計を再構築する際、再構築の計算量は属性に関する属性値の値域の大きさの積となり、非常に大きくなる。区間Ｊを細かい区間とすると、区間の数、値域の大きさが大きくなってしまうため、あまり区間Ｊは細かい区間にできない。しかし、各属性の遷移確率行列を計算する場合には当該属性だけに注目すればよいため、計算量は当該属性の単一の値域の大きさだけに比例し、より細かい分割としても計算量の観点から、計算は可能である。本発明は、この点に着目したものである。 Incidentally, the point of the present invention which is common in Examples 1 to 3 is that it divides the interval J into smaller sections j _m in order to increase the accuracy. A transition probability matrix is created from the transition probabilities when the aggregated value is obtained by “reconstruction” from the secret data. That is, the probability of probabilistically changing to v and u is calculated for the set of attribute value v and attribute value u. When restructuring a cross tabulation for a plurality of attributes, the amount of calculation for the reconstruction is a product of the size of the attribute value range for the attribute, which is very large. If the section J is a fine section, the number of sections and the size of the range will be large, so the section J cannot be made very fine. However, when calculating the transition probability matrix of each attribute, it is only necessary to pay attention to that attribute. From the point of view, calculation is possible. The present invention focuses on this point.

［プログラム、記録媒体］
上述の各種の処理は、記載に従って時系列に実行されるのみならず、処理を実行する装置の処理能力あるいは必要に応じて並列的にあるいは個別に実行されてもよい。その他、本発明の趣旨を逸脱しない範囲で適宜変更が可能であることはいうまでもない。 [Program, recording medium]
The various processes described above are not only executed in time series according to the description, but may also be executed in parallel or individually as required by the processing capability of the apparatus that executes the processes. Needless to say, other modifications are possible without departing from the spirit of the present invention.

また、上述の構成をコンピュータによって実現する場合、各装置が有すべき機能の処理内容はプログラムによって記述される。そして、このプログラムをコンピュータで実行することにより、上記処理機能がコンピュータ上で実現される。 Further, when the above-described configuration is realized by a computer, processing contents of functions that each device should have are described by a program. The processing functions are realized on the computer by executing the program on the computer.

この処理内容を記述したプログラムは、コンピュータで読み取り可能な記録媒体に記録しておくことができる。コンピュータで読み取り可能な記録媒体としては、例えば、磁気記録装置、光ディスク、光磁気記録媒体、半導体メモリ等どのようなものでもよい。 The program describing the processing contents can be recorded on a computer-readable recording medium. As the computer-readable recording medium, for example, any recording medium such as a magnetic recording device, an optical disk, a magneto-optical recording medium, and a semiconductor memory may be used.

また、このプログラムの流通は、例えば、そのプログラムを記録したＤＶＤ、ＣＤ−ＲＯＭ等の可搬型記録媒体を販売、譲渡、貸与等することによって行う。さらに、このプログラムをサーバコンピュータの記憶装置に格納しておき、ネットワークを介して、サーバコンピュータから他のコンピュータにそのプログラムを転送することにより、このプログラムを流通させる構成としてもよい。 The program is distributed by selling, transferring, or lending a portable recording medium such as a DVD or CD-ROM in which the program is recorded. Furthermore, the program may be distributed by storing the program in a storage device of the server computer and transferring the program from the server computer to another computer via a network.

このようなプログラムを実行するコンピュータは、例えば、まず、可搬型記録媒体に記録されたプログラムもしくはサーバコンピュータから転送されたプログラムを、一旦、自己の記憶装置に格納する。そして、処理の実行時、このコンピュータは、自己の記録媒体に格納されたプログラムを読み取り、読み取ったプログラムに従った処理を実行する。また、このプログラムの別の実行形態として、コンピュータが可搬型記録媒体から直接プログラムを読み取り、そのプログラムに従った処理を実行することとしてもよく、さらに、このコンピュータにサーバコンピュータからプログラムが転送されるたびに、逐次、受け取ったプログラムに従った処理を実行することとしてもよい。また、サーバコンピュータから、このコンピュータへのプログラムの転送は行わず、その実行指示と結果取得のみによって処理機能を実現する、いわゆるＡＳＰ（Application Service Provider）型のサービスによって、上述の処理を実行する構成としてもよい。なお、本形態におけるプログラムには、電子計算機による処理の用に供する情報であってプログラムに準ずるもの（コンピュータに対する直接の指令ではないがコンピュータの処理を規定する性質を有するデータ等）を含むものとする。 A computer that executes such a program first stores, for example, a program recorded on a portable recording medium or a program transferred from a server computer in its own storage device. When executing the process, the computer reads a program stored in its own recording medium and executes a process according to the read program. As another execution form of the program, the computer may directly read the program from a portable recording medium and execute processing according to the program, and the program is transferred from the server computer to the computer. Each time, the processing according to the received program may be executed sequentially. Also, the program is not transferred from the server computer to the computer, and the above-described processing is executed by a so-called ASP (Application Service Provider) type service that realizes the processing function only by the execution instruction and result acquisition. It is good. Note that the program in this embodiment includes information that is used for processing by an electronic computer and that conforms to the program (data that is not a direct command to the computer but has a property that defines the processing of the computer).

また、この形態では、コンピュータ上で所定のプログラムを実行させることにより、本装置を構成することとしたが、これらの処理内容の少なくとも一部をハードウェア的に実現することとしてもよい。 In this embodiment, the present apparatus is configured by executing a predetermined program on a computer. However, at least a part of these processing contents may be realized by hardware.

１００、３００遷移確率算出装置１１０細分化部
１２０集計値情報部１３０細分確率計算部
１４０統合部１９０記録部
２００集計値算出装置２５０区間設定部
２６０集計値更新部９００データベース
100, 300 Transition probability calculation device 110 Subdivision unit 120 Total value information unit 130 Subdivision probability calculation unit 140 Integration unit 190 Recording unit 200 Total value calculation device 250 Section setting unit 260 Total value update unit 900 Database

Claims

A transition probability calculation device for obtaining a probability of transition of the attribute value when concealing data in a database having a plurality of records including attribute values taking continuous values,
A _vu is a probability density function in which the attribute value v transitions to the attribute value u, J is a range from a value G _min to a value G _max that is a part of a value range that the attribute value can take, and K is a value range that the attribute value can take A _JK is a transition probability that an attribute value included in the section J transitions to an attribute value included in the section K, M is an integer of 2 or more, m is an integer of 0 to M−1, j ₀ ,. , J _M−1 is a section obtained by subdividing section J into M pieces, N _J is the number of records whose attribute value is included in section J, N _jm is the number of records whose attribute value is included in section j _m , and g ₀ Is a value G _min , g _M is a value G _max , and when m is 1 or more and M−1 or less, g _m is an attribute value that delimits the section j _m−1 and the section j _m, and g ₀ <g ₁ <.・・ <G _M
Subdividing the section J into M pieces, and subdividing sections for obtaining the subdivided sections j ₀ ,..., J _M−1 ;
A total value information part for _{obtaining the} number N _J and the numbers N _j0 ,..., N _jM−1 ;
The subdivision probability _{p m}

Subdivided probability calculation unit to be obtained as follows,
Transition probability A _JK

And the integration department
A transition probability calculation device comprising:

The transition probability calculation device according to claim 1,
The attribute value range of the section K is from the minimum value H _min to the maximum value H _max , the attribute value range is from the minimum value V _min to the maximum value V _max , and the probability density function A _vu is a finite noise function based on Laplace distribution. 2σ ² is the dispersion of the Laplace distribution,
The subdivision probability calculation unit, said subdivision probability p _m,

However,

The transition probability calculation device characterized by obtaining like this.

The transition probability calculation device according to claim 1 or 2,
The total value information part is
The number _N j0 from the database, ... _to obtain the _{N jM-1,} the number _{_{_{N J N J = N j0 +}}} ... + N jM-1
The transition probability calculation device characterized by obtaining like this.

The transition probability calculation device according to claim 1 or 2,
The range of the attribute value is a minimum value V _min to a maximum value V _max , and the total number of records is N _ALL ,
The total value information part is
The number N _J and the number N _j0 ,..., N _jM−1 are

The transition probability calculation device according to claim 1 or 2,
The range of the attribute value is a minimum value V _min to a maximum value V _max , and the total number of records is N _ALL ,
The total value information part is
Said number _N j0 using the distribution of confidential attribute value, _..., the determined _{N jM-1,} the number _{_{_{N J N J = N j0 +}}} ... + N jM-1
The transition probability calculation device characterized by obtaining like this.

A total value calculation device comprising the transition probability calculation device according to claim 4 or 5,
further,
A section setting unit for setting the section J and the section K;
From the already calculated total value and transition probability, there is also a total value update unit that calculates a new total value,
The section setting unit sequentially sets the section J so that all the ranges of the attribute values are set, and subdivides the section J in which the subdivision section is set to obtain a subdivided section, and the aggregate value The information part calculates the number of records whose attribute values are included in the subdivided section in each setting and sets it as the aggregate value of the subdivided section,
The section setting unit sequentially sets the section J and the section K so that all combinations of the attribute values are set, and subdivides the section J set by the subdivision section into subdivided sections. The subdivision probability calculation unit calculates subdivision probabilities p ₀ ,..., P _M−1 for the set sections J and K, and the integration unit calculates the aggregated values of the subdivided sections and the subdivision probabilities p. ₀ ,..., P _M−1 are used to determine the transition probability A _JK ,
The aggregate value updating unit uses the transition probabilities A _JK of the sections J and K that are sequentially set so that combinations of all the range of the attribute values are set as each element of the transition probability matrix, and an inverse matrix method Alternatively, an aggregate value calculation apparatus characterized by updating the number of records whose attribute values are included in a segment subdivided by an iterative Bayes technique, and obtaining a new aggregate segment value.

The transition probability calculation device according to claim 4 or 5,
further,
A section setting unit for setting the section J and the section K;
From the already calculated total value and transition probability, there is also a total value update unit that calculates a new total value,
The section setting unit sequentially sets the section J so that all the ranges of the attribute values are set, and subdivides the section J in which the subdivision section is set to obtain a subdivided section, and the aggregate value The information part calculates the number of records whose attribute values are included in the subdivided section in each setting and sets it as the aggregate value of the subdivided section,
The section setting unit sequentially sets the section J and the section K so that all combinations of the attribute values are set, and subdivides the section J set by the subdivision section into subdivided sections. The subdivision probability calculation unit calculates subdivision probabilities p ₀ ,..., P _M−1 for the set sections J and K, and the integration unit calculates the aggregated values of the subdivided sections and the subdivision probabilities p. ₀ ,..., P _M−1 are used to determine the transition probability A _JK ,
The aggregate value updating unit uses the transition probabilities A _JK of the sections J and K that are sequentially set so that combinations of all the range of the attribute values are set as each element of the transition probability matrix, and an inverse matrix method Or, update the number of records whose attribute values are included in the segment refined by the iterative Bayesian method, and set it as the aggregate value of the new segment,
The said integration part calculates _| requires transition probability _AJK using the total value of the segment which the said total value update part calculated _| required. The transition probability calculation apparatus characterized by the above-mentioned.

A transition probability calculation method using a transition probability calculation device including a subdivision unit, a summary value information unit, a subdivision probability calculation unit, and an integration unit ,
A _vu is a probability density function in which the attribute value v transitions to the attribute value u, J is a range from a value G _min to a value G _max that is a part of a value range that the attribute value can take, and K is a value range that the attribute value can take A _JK is a transition probability that an attribute value included in the section J transitions to an attribute value included in the section K, M is an integer of 2 or more, m is an integer of 0 to M−1, j ₀ ,. , J _M−1 is a section obtained by subdividing section J into M pieces, N _J is the number of records whose attribute value is included in section J, N _jm is the number of records whose attribute value is included in section jm, and g ₀ is The values G _min and g _M are values G _max , and when m is 1 or more and M−1 or less, g _m is an attribute value that separates the section j _m−1 and the section j _m, and g ₀ <g ₁ <.・ <G _M
The subdivision section subdivides the section J into M pieces and obtains subdivided sections j ₀ ,..., J _M−1 ;
The aggregate value information section, said number _{N J} and the number _N j0, _..., and the aggregate value information determining the _{N jM-1,}
The sub-probability calculation unit calculates sub-probabilities p _m

Subdivided probability calculation step obtained as follows:
The integration unit _calculates the transition probability A _JK

The integration steps you want
A transition probability calculation method comprising:

6. A transition probability calculating apparatus according to claim 4; a section setting unit for setting the section J and the section K; and a total value updating unit for determining a new total value from the already calculated total value and the transition probability. A total value calculation method using the total value calculation device,
The section setting unit sequentially sets the section J so that all the ranges of the attribute values are set, and subdivides the section J in which the subdivision section is set to obtain a subdivided section, and the aggregate value The information part calculates the number of records whose attribute values are included in the subdivided section in each setting and sets it as the aggregate value of the subdivided section,
The section setting unit sequentially sets the section J and the section K so that all combinations of the attribute values are set, and subdivides the section J set by the subdivision section into subdivided sections. The subdivision probability calculation unit calculates subdivision probabilities p ₀ ,..., P _M−1 for the set sections J and K, and the integration unit calculates the aggregated values of the subdivided sections and the subdivision probabilities p. ₀ ,..., P _M−1 are used to determine the transition probability A _JK ,
The aggregate value updating unit uses the transition probabilities A _JK of the sections J and K that are sequentially set so that combinations of all the range of the attribute values are set as each element of the transition probability matrix, and an inverse matrix method Alternatively, the number of records whose attribute values are included in the segment subdivided by the iterative Bayesian method is updated to the aggregate value of the new subdivided segment, and the number of records in the subdivided segment included in the segment J is added to the segment J An aggregate value calculation method characterized by obtaining the number of records.

6. A transition probability calculating apparatus according to claim 4; a section setting unit for setting the section J and the section K; and a total value updating unit for determining a new total value from the already calculated total value and the transition probability. A transition probability calculation method using the transition probability calculation device,
The section setting unit sequentially sets the section J so that all the ranges of the attribute values are set, and subdivides the section J in which the subdivision section is set to obtain a subdivided section, and the aggregate value The information part calculates the number of records whose attribute values are included in the subdivided section in each setting and sets it as the aggregate value of the subdivided section,
The section setting unit sequentially sets the section J and the section K so that all combinations of the attribute values are set, and subdivides the section J set by the subdivision section into subdivided sections. The subdivision probability calculation unit calculates subdivision probabilities p ₀ ,..., P _M−1 for the set sections J and K, and the integration unit calculates the aggregated values of the subdivided sections and the subdivision probabilities p. ₀ ,..., P _M−1 are used to determine the transition probability A _JK ,
The aggregate value updating unit uses the transition probabilities A _JK of the sections J and K that are sequentially set so that combinations of all the range of the attribute values are set as each element of the transition probability matrix, and an inverse matrix method Or, update the number of records whose attribute value is included in the segment refined by the iterative Bayesian method to the aggregate value of the new segment,
The said integration part calculates _| requires transition probability _AJK using the total value of the subdivided area which the said total value update part updated. The transition probability calculation method characterized by the above-mentioned.