JP6693503B2

JP6693503B2 - Secret search system, server device, secret search method, search method, and program

Info

Publication number: JP6693503B2
Application number: JP2017501912A
Authority: JP
Inventors: 一真大原; 俊則荒木; 古川　潤; 潤古川
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2015-02-23
Filing date: 2016-02-17
Publication date: 2020-05-13
Anticipated expiration: 2036-02-17
Also published as: WO2016136201A1; JPWO2016136201A1

Description

本発明は、データを検索する検索システムに関し、特に、データを複数に分散して保持する際に、検索条件と保持されたデータを秘匿できる秘匿検索システムに関する。 The present invention relates to a search system for searching data, and more particularly to a secret search system capable of concealing a search condition and held data when the data is distributed and held in plural.

クラウドなど外部のサーバ装置に、ユーザが情報を預けるサービスなどが知られている。そのようなサービスにおいては、預けたデータが漏洩することを防ぐために、暗号化などによってデータを秘匿化する方法が一般的に採用されている。一般的な秘匿化方法には、次に述べるような問題がある。具体的には、秘匿化されたデータから所望のデータをサーバ装置側で検索する場合、秘匿化されたデータをサーバ装置側で復元することで検索が可能となる。しかしながら、このようなサーバ装置側での復元が伴う方法では、サーバ装置からのデータ漏洩のリスクが発生する。 There is known a service in which a user deposits information in an external server device such as a cloud. In such a service, a method of concealing data by encryption or the like is generally adopted in order to prevent the deposited data from being leaked. The general concealment method has the following problems. Specifically, when the server device side searches for the desired data from the concealed data, the retrieval can be performed by restoring the concealed data on the server device side. However, with such a method involving restoration on the server device side, there is a risk of data leakage from the server device.

このため秘匿化されたデータを秘匿化したままで検索できる秘匿検索技術が、種々提案されている。 For this reason, various secret search technologies have been proposed that can search the concealed data while keeping it secret.

秘匿検索を実現する一つの技術として、マルチパーティ計算（Ｍｕｌｔｉ−ＰａｒｔｙＣｏｍｐｕｔａｔｉｏｎ：ＭＰＣ）技術が知られている（例えば、特許文献１、特許文献２参照）。 A multi-party computation (MPC) technique is known as one technique for realizing a secret search (see, for example, Patent Documents 1 and 2).

ＭＰＣは、それぞれ秘密情報を持つ２台以上のサーバ装置が協力して計算することで、それぞれの秘密情報を漏らすことなく、秘密情報を入力とする任意の関数値を計算する。ＭＰＣを用いて秘匿検索は、以下のように実現する。まず、預けるデータを秘密分散法（例えば、非特許文献１参照）で上記２台以上のサーバ装置に分散して保持する。
そして、上記関数として「ある部分データを含むデータが秘密分散されてサーバ装置に保存されているときに１を、そうでないときに０を返す関数」のように定義する。In MPC, two or more server devices each having secret information cooperate with each other to calculate an arbitrary function value to which secret information is input, without leaking each secret information. The confidential search using MPC is realized as follows. First, the data to be deposited is distributed and held in the two or more server devices by the secret sharing method (see Non-Patent Document 1, for example).
Then, the function is defined as "a function that returns 1 when data including a certain partial data is secretly shared and stored in the server device, and returns 0 when it is not".

ＭＰＣの実現方法として、非特許文献１に記載されている、Ｓｈａｍｉｒのしきい値型秘密分散法（ＴｈｒｅｓｈｏｌｄＳｅｃｒｅｔＳｈａｒｉｎｇＳｃｈｅｍｅ：ＴＳＳＳ）がある。まず、非特許文献１について説明する。 As an implementation method of MPC, there is Shamir's threshold secret sharing scheme (TSSS) described in Non-Patent Document 1. First, Non-Patent Document 1 will be described.

しきい値型秘密分散法（ＴＳＳＳ）は、秘密情報を複数の分散情報に変換し、変換された分散情報をしきい値以上の個数を集めることによって秘密情報を復元する方法である。しきい値型秘密分散法では、しきい値以下の個数の分散情報からは元の秘密情報が漏れない。 The threshold type secret sharing method (TSSS) is a method of restoring secret information by converting the secret information into a plurality of pieces of shared information and collecting a number of pieces of the converted shared information equal to or larger than a threshold value. In the threshold type secret sharing method, the original secret information is not leaked from the number of pieces of shared information equal to or less than the threshold value.

非特許文献１のしきい値秘密分散法（ＴＳＳＳ）は、Ｎ台のサーバ装置で有限体Ｚｐに属する数ａを秘密に分散する方法であり、ｋ−１次多項式ｆ_ａ（ｘ）を用いる。この方法では、例えばｆ_ａ（０）＝ａとし、第ｉのサーバ装置（１≦ｉ≦Ｎ）には、それぞれ多項式上の点ｆ_ａ（ｉ）を配る。配られる情報ｆ_ａ（ｉ）をしきい値秘密分散法（ＴＳＳＳ）におけるｘの分散情報と呼ぶ。このとき、ｋ台のサーバ装置が協力すると、多項式上のｋ個の点からｋ−１次多項式ｆ_ａ（ｘ）を一意に復元することができるので、秘密情報であるｆ_ａ（０）を求めることができる。 The threshold secret sharing method (TSSS) of Non-Patent Document 1 is a method of secretly distributing the number a belonging to the finite field Zp among N server devices, and uses a k−1 order polynomial f_a (x). In this method, for example, f_a (0) = a, and points f_a (i) on the polynomial are distributed to the i-th server device (1 ≦ i ≦ N). The distributed information f_a (i) is called distribution information of x in the threshold secret sharing scheme (TSSS). At this time, if the k server devices cooperate, the k−1-degree polynomial f_a (x) can be uniquely restored from the k points on the polynomial, and thus the secret information f_a (0) must be obtained. You can

秘密情報ａを、ｐを法とする多項式ｆ_ａ（ｘ）を用いてＮ台のサーバ装置で分散するときに生成される、秘密情報ａの分散情報を、［ａ］_ｐ＝（ｆ_ａ（１），ｆ_ａ（２），…，ｆ_ａ（Ｎ））と記述する。このとき、識別子ｉ（１≦ｉ＜Ｎ）を持つ第ｉのサーバ装置にｆ_ａ（ｉ）が保持されているものとする。 The distribution information of the secret information a, which is generated when the secret information a is distributed by N server devices using the polynomial f_a (x) modulo p, is [a] _p = (f_a (1) , F_a (2), ..., f_a (N)). At this time, it is assumed that f_a (i) is held in the i-th server device having the identifier i (1 ≦ i <N).

非特許文献２は、分散情報を非特許文献１のしきい値秘密分散法（ＴＳＳＳ）で保持する複数台のサーバ装置が、協力計算によって秘密情報を復元することなく算術演算を行うＭＰＣの方法を開示する。次に、非特許文献２の方法について説明する。 Non-Patent Document 2 is an MPC method in which a plurality of server devices that hold distributed information by the threshold secret sharing method (TSSS) of Non-Patent Document 1 perform arithmetic operations without restoring secret information by cooperative calculation. Is disclosed. Next, the method of Non-Patent Document 2 will be described.

非特許文献１の方法において、秘密情報ａ＋ｂを分散する多項式ｆ_（ａ＋ｂ）（ｘ）は、秘密情報ａを分散する多項式ｆ_ａ（ｘ）と、秘密情報ｂを分散する多項式ｆ_ｂ（ｘ）との和で書き表すことができる。すなわち、ｆ_（ａ＋ｂ）（ｘ）＝ｆ_ａ（ｘ）＋ｆ_ｂ（ｘ）ｍｏｄｐとなる。この性質より、識別子ｉを持つ各サーバ装置は、秘密情報ａの分散情報と秘密情報ｂの分散情報とから秘密情報ａ＋ｂの分散情報を計算したいときには、個別に計算を行えばよい。すなわち、各サーバ装置は、ｆ_（ａ＋ｂ）（ｉ）＝ｆ_ａ（ｉ）＋ｆ_ｂ（ｉ）ｍｏｄｐを個別に計算することによって、サーバ装置間の通信なしに、秘密情報の和を秘密に分散して保持することができる。 In the method of Non-Patent Document 1, a polynomial f_ (a + b) (x) that disperses secret information a + b is a polynomial f_a (x) that disperses secret information a and a polynomial f_b (x) that disperses secret information b. Can be written in Japanese. That is, f_ (a + b) (x) = f_a (x) + f_b (x) mod p. Due to this property, each server device having the identifier i may individually calculate the shared information of the secret information a + b from the shared information of the secret information a and the shared information of the secret information b. That is, each server device individually calculates f_ (a + b) (i) = f_a (i) + f_b (i) mod p to secretly distribute the sum of the secret information without communication between the server devices. Can be held.

同様に、秘密情報の積を秘密に分散することも可能である。ただし、積の計算を行う場合にはＮ（Ｎ−１）回の通信を要する。 Similarly, it is possible to secretly distribute the product of secret information. However, communication of N (N-1) times is required to calculate the product.

非特許文献２では、このような秘密情報の和と積を秘密に分散する方法を組み合わせて、加法と乗法で計算可能な任意の関数を計算する。 In Non-Patent Document 2, a method of secretly distributing the sum and product of such secret information is combined to calculate an arbitrary function that can be calculated by addition and multiplication.

上記の通り、非特許文献２の方法によって任意の関数が計算でき、文字列検索を実現することができる。非特許文献１の方法を用いた秘匿検索は、次のように実現される。 As described above, an arbitrary function can be calculated by the method of Non-Patent Document 2 and a character string search can be realized. The secret search using the method of Non-Patent Document 1 is realized as follows.

検索を依頼するクライアント装置は、検索要求データｓに対して、その分散情報を生成して各サーバ装置に送信する。Ｎ台のサーバ装置は、自身が保持するデータｔの部分情報の分散データ［ｔ’_１］_ｐ，…，［ｔ’_ｌ］_ｐと、検索要求データであるｓの分散情報［ｓ］_ｐとから、非特許文献２のマルチパーティ計算（ＭＰＣ）で［ｓ−ｔ’_１］_ｐ，［ｓ−ｔ’_２］_ｐ，…，［ｓ−ｔ’_ｌ］_ｐを計算する。検索要求データと一致した分散データが存在するとき、差の値が０になることに注意する。その後、Ｎ台のサーバ装置は、乱数の分散情報を共有し、非特許文献１に記載の秘密情報の積計算によって差の情報をマスクして、マスクされた差の情報を検索結果の分散情報として出力する。 The client device requesting the search generates the distributed information for the search request data s and sends it to each server device. Each of the N server devices has distributed data [t'_1] _p, ..., [t'_l] _p of partial information of the data t held by itself, and distributed information [s] _p of s which is search request data. Then, [s-t'_1] _p, [s-t'_2] _p, ..., [s-t'_l] _p are calculated by the multi-party calculation (MPC) of Non-Patent Document 2. Note that the difference value becomes 0 when there is distributed data that matches the search request data. After that, the N server devices share the random number distribution information, mask the difference information by the product calculation of the secret information described in Non-Patent Document 1, and share the masked difference information as the search result distribution information. Output as.

一方、非特許文献３は、乱数の分散情報のサイズを削減する方法が開示されている。非特許文献３では、複製型秘密分散法（ＲｅｐｌｉｃａｔｅｄＳｅｃｒｅｔＳｈａｒｉｎｇＳｃｈｅｍｅ：ＲＳＳＳ）が利用されている。 On the other hand, Non-Patent Document 3 discloses a method of reducing the size of distributed information of random numbers. In Non-Patent Document 3, a replicated secret sharing scheme (RSSS) is used.

まず、非特許文献３に記載された複製型秘密分散法（ＲＳＳＳ）の方法について説明する。ＲＳＳＳは、Ｎ台のサーバ装置で有限体Ｚｑに属する整数ｂを秘密に分散する方法であり、次のような方法で秘密情報を分散する。 First, the method of the replica secret sharing method (RSSS) described in Non-Patent Document 3 will be described. RSSS is a method of secretly distributing integers b belonging to a finite field Zq among N server devices, and secret information is distributed by the following method.

Ｎ−１個の乱数ｂ_１，ｂ_２，…，ｂ_（Ｎ−１）を選び、ｂ_Ｎ＝ｂ−（ｂ_１＋ｂ_２＋…＋ｂ_（Ｎ−１））ｍｏｄｑとする（ｑは素数）。ｂ_１，ｂ_２，…，ｂ_（Ｎ−１），ｂ_Ｎをすべて足し合わせると、もとの秘密情報ｂに戻ることに注意する。このｂ_１，ｂ_２，…，ｂ_（Ｎ−１），ｂ_Ｎを、サーバ装置に適当に割り当てることによって、複数台のサーバ装置が協力してｂ_１，ｂ_２，…，ｂ_（Ｎ−１），ｂ_Ｎがすべて揃うときに限り、秘密情報を復元することが出来る。秘密情報を復元することが可能なサーバ装置の組み合わせは、ｂ_１，ｂ_２，…，ｂ_（Ｎ−１），ｂ_Ｎの割り当て方によって任意に設計が可能である。 , N_1 random numbers b_1, b_2, ..., B_ (N-1) are selected, and b_N = b- (b_1 + b_2 + ... + b_ (N-1)) mod q (q is a prime number). Note that when b_1, b_2, ..., b_ (N-1), b_N are all added, the original secret information b is restored. By appropriately allocating b_1, b_2, ..., b_ (N-1), b_N to the server apparatus, a plurality of server apparatuses cooperate to obtain b_1, b_2, ..., b_ (N-1), b_N. The secret information can be restored only when all items are gathered. A combination of server devices capable of restoring the secret information can be arbitrarily designed depending on how b_1, b_2, ..., B_ (N-1), b_N are assigned.

例えば、３台のサーバ装置があり、いずれか２台のサーバ装置が協力したときに秘密情報ｂが復元できるＲＳＳＳは、ｂ＝ｂ_１＋ｂ_２＋ｂ_３となるようにｂ_１，ｂ_２，ｂ_３を生成したのち、（ｂ_１，ｂ_２），（ｂ_２，ｂ_３），（ｂ_３，ｂ_１）のように２つずつの値を各サーバ装置に割り当てることで実現できる。 For example, if there are three server devices and the RSSS that can restore the secret information b when any two server devices cooperate, after generating b_1, b_2, and b_3 so that b = b_1 + b_2 + b_3, (b_1 , B_2), (b_2, b_3), (b_3, b_1), each two values are assigned to each server device.

秘密情報ｂを、ｂ＝ｂ_１＋ｂ_２＋…＋ｂ_Ｎｍｏｄｑとなるようなｂ_１，ｂ_２，…，ｂ_Ｎを用いてＲＳＳＳでＮ台のサーバ装置に分散するときに生成されるｂの分散情報を＜ｂ＞ｑ＝（ｖ_１，ｖ_２，…ｖ_ｎ）と書くことにする。このとき、識別子ｉ（１≦ｉ＜Ｎ）を持つサーバ装置にｖ_ｉが保持されているものとする。上記の３台のサーバ装置のうち２台によって復号できるＲＳＳＳの例では、ｖ_１＝（ｂ_１，ｂ_２）、ｖ_２＝（ｂ_２，ｂ_３）、ｖ_３＝（ｂ_３，ｂ_１）である。 The distribution information of b generated when the secret information b is distributed to N server devices by RSSS using b_1, b_2, ..., b_N such that b = b_1 + b_2 + ... + b_N mod q <b> q = (V_1, v_2, ... v_n) will be written. At this time, it is assumed that v_i is held in the server device having the identifier i (1 ≦ i <N). In the example of RSSS that can be decrypted by two of the above three server devices, v_1 = (b_1, b_2), v_2 = (b_2, b_3), v_3 = (b_3, b_1).

非特許文献３は、乱数のＲＳＳＳによる分散情報をサーバ装置間で共有しておき、この乱数の分散情報を用いて、通信を伴わない計算によって、疑似乱数のＳＳＳによる分散情報を生成する方法を記載している。この方法を用いると、ＳＳＳの乱数の分散情報は計算時に必要に応じて生成できるので、事前に分散しておく必要がない。したがって、保持する乱数の分散情報のサイズは小さくなる。 Non-Patent Document 3 discloses a method of sharing disperse information based on a random number RSSS between server devices, and using the disperse information about the random number to generate the disperse information based on the SSS of a pseudo-random number by calculation without communication. It has been described. When this method is used, the SSS random number distribution information can be generated at the time of calculation as needed, and therefore it is not necessary to be distributed in advance. Therefore, the size of the distributed random number information is small.

特開２００７−１１４４９４号公報JP, 2007-114494, A 特開２０１２−０２４１８２号公報JP 2012-024182A

ＡｄｉＳｈａｍｉｒ， “ＨｏｗｔｏＳｈａｒｅａＳｅｃｒｅｔ，” Ｃｏｍｍｕｎ．ＡＣＭ２２（１１），ｐｐ．６１２−６１３，１９７９．Adi Shamir, "How to Share a Secret," Commun. ACM 22 (11), pp. 612-613, 1979. ＭｉｃｈａｅｌＢｅｎ−Ｏｒ，ＳｈａｆｉＧｏｌｄｗａｓｓｅｒａｎｄＡｖｉＷｉｇｄｅｒｓｏｎ， “ＣｏｍｐｌｅｔｅｎｅｓｓＴｈｅｏｒｅｍｓｆｏｒＮｏｎ−ＣｒｙｐｔｏｇｒａｐｈｉｃＦａｕｌｔ−ＴｏｌｅｒａｎｔＤｉｓｔｒｉｂｕｔｅｄＣｏｍｐｕｔａｔｉｏｎ（ＥｘｔｅｎｄｅｄＡｂｓｔｒａｃｔ），” Ｐｒｏｃｅｅｄｉｎｇｓｏｆｔｈｅ２０ｔｈＡｎｎｕａｌＡＣＭＳｙｍｐｏｓｉｕｍｏｎＴｈｅｏｒｙｏｆＣｏｍｐｕｔｉｎｇ，１９８８．Michael Ben-Or, Shafi Goldwasser and Avi Wigderson, "Completeness Theorems for Non-Cryptographic Fault-Tolerant Distributed Computation (Extended Abstract)," Proceedings of the 20th Annual ACM Symposium on Theory of Computing, 1988. ＲｏｎａｌｄＣｒａｍｅｒ，ＩｖａｎＤａｍｇａｒｄ，ＹｕｖａｌＩｓｈａｉ， ”ＳｈａｒｅＣｏｎｖｅｒｓｉｏｎ，ＰｓｅｕｄｏｒａｎｄｏｍＳｅｃｒｅｔ−ＳｈａｒｉｎｇａｎｄＡｐｐｌｉｃａｔｉｏｎｓｔｏＳｅｃｕｒｅＣｏｍｐｕｔａｔｉｏｎ，” ＴｈｅｏｒｙｏｆＣｒｙｐｔｏｇｒａｐｈｙ，ＳｅｃｏｎｄＴｈｅｏｒｙｏｆＣｒｙｐｔｏｇｒａｐｈｙＣｏｎｆｅｒｅｎｃｅ（ＴＣＣ），ｐｐ．３４２−３６２，２００５．Ronald Cramer, Ivan Damgard, Yuval Ishai, "Share Conversion, Pseudorandom Secret-Sharing and Applications to Secure Computation," Theory of Cryptography, Second Theory of Cryptography Conference (TCC), pp. 342-362, 2005.

非特許文献１を用いた方法では、次に述べるような問題がある。ｎシンボルのデータＴ＝（ｔ_１，ｔ_２…，ｔ_ｎ）に対して任意の長さの検索要求データｓ＝（ｓ_１，…，ｓ_ｍ）（ただし、ｎ＞ｍとする）の検索に対応するとする。この場合、上記のような検索を実行するために、各サーバ装置はＳＳＳによるＴのあらゆる部分データに対する分散情報を保持する必要がある。具体的には、非特許文献１を用いた方法では、任意の１≦ｉ＜ｊ≦ｎに対して（ｔ_ｉ，…，ｔ_ｊ）の分散情報を保持する必要がある。したがって、非特許文献１を用いた方法には、元データの（ｔ_１，ｔ_２…，ｔ_ｎ）に対して、各サーバ装置が保持すべき分散情報のデータサイズが非常に大きくなるという課題がある。 The method using Non-Patent Document 1 has the following problems. It is assumed that n-symbol data T = (t_1, t_2 ..., T_n) is searched for search request data s = (s_1, ..., s_m) (where n> m). In this case, in order to execute the search as described above, each server device needs to hold the distributed information for every partial data of T by SSS. Specifically, in the method using Non-Patent Document 1, it is necessary to hold the shared information of (t_i, ..., T_j) for arbitrary 1 ≦ i <j ≦ n. Therefore, the method using Non-Patent Document 1 has a problem that the data size of the shared information to be held by each server device is very large with respect to the original data (t_1, t_2 ..., t_n).

一方、非特許文献３の方法は任意に選べるデータではなく、乱数の分散情報のサイズを削減するための方法である。秘匿検索という目的において、分散されるデータは秘匿検索システムの利用者の秘密情報であり、乱数ではない。そのため、非特許文献３の方法を、秘匿検索にそのまま適用することができない。 On the other hand, the method of Non-Patent Document 3 is not a data that can be arbitrarily selected, but is a method for reducing the size of random number distributed information. For the purpose of secret search, the distributed data is the secret information of the user of the secret search system and is not a random number. Therefore, the method of Non-Patent Document 3 cannot be directly applied to the secret search.

本発明の目的は、秘密分散法を用いた秘匿検索システムにおいて、その検索機能を損なうことなく、各サーバ装置が保持するデータ（分散情報）のサイズを低減する、サーバ装置、検索方法等を提供することにある。 An object of the present invention is to provide a server device, a search method, etc. in a secret search system using a secret sharing method, which reduces the size of data (distributed information) held by each server device without impairing its search function. To do.

本発明のサーバ装置は、秘密情報の１シンボルごとの分散登録データを保存するデータ記憶部と；前記データ記憶部に保存された前記分散登録データを、複数シンボルを連結したデータに対する検索用データに変換するデータ変換部と；前記検索用データと分散検索要求データとを用いて、他のサーバ装置のデータ検索部と通信を行いながら、前記データ記憶部の前記分散登録データに対する検索を行って、分散検索結果を出力するデータ検索部と；を有する。 A server device of the present invention includes a data storage unit that stores distributed registration data for each symbol of secret information; and the distributed registration data stored in the data storage unit as search data for data in which a plurality of symbols are linked. A data conversion unit for converting; using the search data and the distributed search request data, performing a search for the distributed registration data in the data storage unit while communicating with a data search unit of another server device, And a data search unit for outputting the distributed search result.

本発明の検索方法は、サーバ装置で検索を実行する検索方法であって、秘密情報の１シンボルごとの分散登録データを保存し、保存された前記分散登録データを、複数シンボルを連結したデータに対する検索用データに変換し、前記検索用データと分散検索要求データとを用いて、他のサーバ装置と通信し、前記分散登録データに対する検索を行って、分散検索結果を出力する。 A search method of the present invention is a search method for executing a search in a server device, in which distributed registration data for each symbol of secret information is stored, and the stored distributed registration data is used for data obtained by concatenating a plurality of symbols. The data is converted into search data, the search data and the distributed search request data are used to communicate with another server device, the distributed registration data is searched, and the distributed search result is output.

本発明のプログラムは、コンピュータに、秘密情報の１シンボルごとの分散登録データを保存し、保存された前記分散登録データを、複数シンボルを連結したデータに対する検索用データに変換し、前記検索用データと分散検索要求データとを用いて、他のサーバ装置と通信し、前記分散登録データに対する検索を行って、分散検索結果を出力する、ことを実行させる検索プログラム。
A program of the present invention stores, in a computer, distributed registration data for each symbol of secret information, converts the saved distributed registration data into search data for data in which a plurality of symbols are linked, and searches the search data. and using the distributed search request data, communicate with other server devices, and perform a search for the distributed registration data, distributed search results outputs a search program for executing the.

本発明によれば、各サーバ装置に保存するデータ（分散情報）のサイズを削減することができる。 According to the present invention, the size of data (distributed information) stored in each server device can be reduced.

第１の実施形態に係る秘匿検索システムの構成を示すブロック図である。It is a block diagram which shows the structure of the confidential search system which concerns on 1st Embodiment. 第１の実施形態の秘匿検索システムにおけるサーバ装置の構成を示すブロック図である。It is a block diagram which shows the structure of the server apparatus in the confidential search system of 1st Embodiment. 第１の実施形態の秘匿検索システムにおけるクライアント装置の構成を示すブロック図である。It is a block diagram which shows the structure of the client apparatus in the confidential search system of 1st Embodiment. 第１の実施形態の秘匿検索システムにおけるデータ登録時の処理を示すフローチャートである。It is a flow chart which shows processing at the time of data registration in the secret search system of a 1st embodiment. 第１の実施形態の秘匿検索システムにおけるデータ検索時の処理を示すフローチャートである。It is a flow chart which shows processing at the time of data search in the secret search system of a 1st embodiment. 第２の実施形態に係る秘匿検索システムの構成を示すブロック図である。It is a block diagram which shows the structure of the confidential search system which concerns on 2nd Embodiment. 第２の実施形態の秘匿検索システムにおけるサーバ装置の構成を示すブロック図である。It is a block diagram which shows the structure of the server apparatus in the confidential search system of 2nd Embodiment. 第２の実施形態の秘匿検索システムにおけるクライアント装置の構成を示すブロック図である。It is a block diagram which shows the structure of the client apparatus in the secret search system of 2nd Embodiment. 第２の実施形態の秘匿検索システムのデータ登録時の処理を示すフローチャートである。It is a flowchart which shows the process at the time of data registration of the confidential search system of 2nd Embodiment. 第１の実施形態の秘匿検索システムにおけるデータ検索時の処理を示すフローチャートである。It is a flow chart which shows processing at the time of data search in the secret search system of a 1st embodiment.

はじめに、本発明の実施形態の概要について説明する。
［実施形態の概要］First, the outline of the embodiment of the present invention will be described.
[Outline of Embodiment]

関連技術による方法では、各サーバ装置は検索可能な部分文字列の分散情報を全て持つ必要があった。これに対し、本実施形態では、検索として長さｍシンボル分のデータが入力されたときに、１シンボルごとに分散された秘密情報から、ｍシンボルを連結した秘密情報の分散情報を生成する処理を行う。 In the method according to the related art, each server device needs to have all the distributed information of searchable partial character strings. On the other hand, in the present embodiment, when data of length m symbols is input as a search, a process of generating distributed information of secret information in which m symbols are concatenated from secret information distributed for each symbol. I do.

秘密情報の分散情報を生成するために秘密情報を１シンボルごとに持つ方法として、複製型秘密分散法を用いる方法（第１の方法）と、Ｓｈａｍｉｒのしきい値型秘密分散法を用いる方法（第２の方法）とがある。 As a method of having secret information for each symbol to generate shared information of secret information, a method using a duplicate secret sharing method (first method) and a method using Shamir's threshold secret sharing method ( The second method).

第１の方法では、１シンボルごとの複製型秘密分散法の分散情報を連結したｍシンボルの分散情報に変換するために、複製型秘密分散法（ＲＳＳＳ）の分散情報をＳｈａｍｉｒのしきい値型秘密分散法（ＴＳＳＳ）の分散情報に変換する手法を利用する。このとき、分散情報のサイズが大きくなるために、元の秘密情報がうまく復元できない可能性がある。そこで、これを補正するため、分散情報を復号せずに大小比較を行うマルチパーティ計算（ＭＰＣ）を利用する。 In the first method, the shared information of the replicated secret sharing scheme (RSSS) is converted to the shared information of the m-symbols by converting the shared information of the replicated secret sharing scheme for each symbol into the Shamir threshold type. A method of converting to shared information of the secret sharing method (TSSS) is used. At this time, since the size of the shared information becomes large, the original secret information may not be restored properly. Therefore, in order to correct this, multi-party calculation (MPC) that compares the magnitudes without decoding the shared information is used.

一方、検索対象の秘密情報をしきい値型秘密分散法（ＴＳＳＳ）で分散する方法（第２の方法）では、小さい体上のしきい値型秘密分散法（ＴＳＳＳ）で生成された分散情報を、拡大体上のしきい値型秘密分散法の分散情報として扱うことで分散情報の連結を行う。 On the other hand, in the method of distributing the secret information to be searched by the threshold type secret sharing method (TSSS) (second method), the shared information generated by the small body threshold type secret sharing method (TSSS) The distributed information is connected by treating as the shared information of the threshold secret sharing method on the extension field.

本実施形態の秘匿検索システムは、Ｎ台のサーバ装置と１台のクライアント装置とから成る。本実施形態のサーバ装置は、秘匿検索システムの中の１つの計算装置である。 The confidential search system according to the present embodiment includes N server devices and one client device. The server device of this embodiment is one computing device in the confidential search system.

換言すると、本実施形態では、各サーバ装置に保存するデータを１シンボルごとの分散情報として保管する。そして、検索処理時に必要な部分データに対応する分散情報を生成する前処理を行ってから検索を行うという方法を取る。これによって、各サーバ装置に保存するデータ（分散情報）のサイズを削減する。 In other words, in the present embodiment, the data stored in each server device is stored as shared information for each symbol. Then, a method of performing a pre-process for generating shared information corresponding to the partial data required during the search process and then performing the search is adopted. This reduces the size of data (distributed information) stored in each server device.

本実施形態の秘匿検索システムは、Ｎ個のサーバ装置に分散されて保持された複数のデータの中に、ユーザが指定した部分データを含むデータが存在するかどうかを、データを復号することなく、またユーザが指定したデータをサーバに明かすことなく検索することができる。 The confidential search system according to the present embodiment determines whether or not there is data including partial data designated by the user among a plurality of data distributed and held by N server devices without decrypting the data. Also, the data specified by the user can be searched without revealing it to the server.

このとき、各サーバ装置が保持する分散データのデータ量は、例えば、上記第１の方法で分散前のデータの１２倍程度である。非特許文献２を利用した通常の秘匿検索と比較して、後述する実施形態では、データの変換処理以外は同等の計算コストであり、データの変換処理を各サーバ装置が個々に計算することが出来る。このため、本実施形態の秘匿検索システムは、データの変換処理に通信を必要とせず、高速に実行できる。 At this time, the amount of distributed data held by each server device is, for example, about 12 times the amount of data before distribution by the first method. Compared with the normal confidential search using Non-Patent Document 2, in the embodiment described later, the calculation cost is the same except for the data conversion process, and each server device may individually calculate the data conversion process. I can. Therefore, the confidential search system of this embodiment does not require communication for data conversion processing and can be executed at high speed.

関連技術において、任意の長さのデータを検索するときに、複数シンボルをまとめて秘密計算することで、サーバ装置間の通信の回数を減らそうとすると、あらゆる長さの部分データに対する分散データを全て持つ必要があり、保持しなければならないデータのサイズが増大するという問題があった。 In the related art, when searching for data of any length, by trying to reduce the number of communications between server devices by secretly calculating multiple symbols collectively, distributed data for partial data of any length can be obtained. There was a problem that the size of the data that had to be held had to be held all the way up.

これに対して、本実施形態では、各サーバ装置は秘密情報の１シンボルごとの部分データを保持する。そして、ｍシンボルの検索要求データの部分情報が入力されたときに、各サーバ装置は、秘密情報のｍシンボル分の部分データに対応する分散情報を、通信を伴わない処理によって生成する。このため、本実施形態では、事前に保持する分散データのデータ量を増やさずに通信回数を減らすことができる。 On the other hand, in the present embodiment, each server device holds partial data for each symbol of secret information. Then, when the partial information of the m-symbol search request data is input, each server device generates distributed information corresponding to the partial data of m symbols of the secret information by a process that does not involve communication. Therefore, in the present embodiment, the number of times of communication can be reduced without increasing the data amount of the distributed data held in advance.

本実施形態を用いれば、複数のサーバ装置にデータを分散して、各サーバ装置にデータを隠ぺいしたまま任意のデータに対する検索を行うことができる。これはあるサーバ装置で秘密のデータを外部のサーバ装置に委託する何らかのサービスを提供するときに、サーバ装置の管理者がデータを盗み出すことを防止することになる。すなわち、複数の管理者が結託しない限り、サーバ装置の中の秘密情報を管理者が復号することは不可能であり、サービス利用者の秘密情報を保護することに貢献する。 By using this embodiment, it is possible to distribute data to a plurality of server devices and search for arbitrary data while hiding the data in each server device. This prevents an administrator of the server device from stealing the data when a certain server device provides some service for outsourcing confidential data to an external server device. That is, unless a plurality of managers collude, the manager cannot decrypt the secret information in the server device, which contributes to the protection of the secret information of the service user.

本実施形態は、例えば、関連技術と比べて、データサイズをおよそ１／８０から１／１０００に削減する効果がある。 This embodiment has an effect of reducing the data size from approximately 1/80 to 1/1000, as compared with the related art, for example.

本発明の実施形態について、図面を用いて詳細に説明する。なお、実施形態の構成を示す図において図面中の矢印の方向は、一例を示すものであり、ブロック間の信号の向きを限定するものではない。
［第１の実施形態］
図１乃至図３を参照して、本発明の第１の実施形態に係る秘匿検索システムについて説明する。Embodiments of the present invention will be described in detail with reference to the drawings. In the drawings showing the configuration of the embodiment, the direction of the arrow in the drawing is an example, and does not limit the direction of signals between blocks.
[First Embodiment]
A confidential search system according to a first embodiment of the present invention will be described with reference to FIGS. 1 to 3.

［構成の説明］
図１は、本発明の第１の実施形態に係る秘匿検索システムの構成を示すブロック図である。[Description of configuration]
FIG. 1 is a block diagram showing the configuration of a confidential search system according to the first embodiment of the present invention.

図１を参照すると、本発明の第１の実施形態に係る秘匿検索システムは、Ｎ（Ｎは２以上の整数）台のサーバ装置１００_１、１００_２、…、１００_Ｎと、クライアント装置２００とからなる。ここでは、サーバ装置１００_１〜１００_Ｎを、それぞれ、第１乃至第Ｎのサーバ装置とも呼ぶ。クライアント装置２００は、Ｎ台のサーバ装置１００_１〜１００_Ｎと通信する。また、Ｎ台のサーバ装置１００_１〜１００_Ｎは互いに通信する。 Referring to FIG. 1, the confidential search system according to the first exemplary embodiment of the present invention includes N (N is an integer of 2 or more) server devices 100_1, 100_2, ..., 100_N and a client device 200. Here, the server devices 100_1 to 100_N are also referred to as first to Nth server devices, respectively. The client device 200 communicates with N server devices 100_1 to 100_N. Also, the N server devices 100_1 to 100_N communicate with each other.

図２は、第ｎのサーバ装置１００_ｎ（１≦ｎ≦Ｎ）の構成を示すブロック図である。第ｎのサーバ装置１００_ｎは、第ｎのデータ記憶部１０１_ｎと、第ｎのデータ変換部１０２_ｎと、第ｎのデータ検索部１０３_ｎとを備える。 FIG. 2 is a block diagram showing the configuration of the nth server device 100_n (1 ≦ n ≦ N). The nth server device 100_n includes an nth data storage unit 101_n, an nth data conversion unit 102_n, and an nth data search unit 103_n.

第ｎのデータ記憶部１０１_ｎは、後述するクライアント装置２００から第ｎの分散登録データ１０４_ｎを受け、それを保存する。また、検索時には、第ｎのデータ記憶部１０１_ｎは、保存した第ｎの分散登録データを第ｎのデータ変換部１０２_ｎに出力する。第ｎのデータ変換部１０２_ｎは、第ｎのデータ記憶部１０１_ｎに保存された第ｎの分散登録データを、第ｎの検索用の分散情報（第ｎの検索用データ）１０５_ｎに変換する。 The nth data storage unit 101_n receives the nth distributed registration data 104_n from the client device 200 described later and stores it. Further, at the time of search, the nth data storage unit 101_n outputs the stored nth distributed registration data to the nth data conversion unit 102_n. The n-th data conversion unit 102_n converts the n-th distributed registration data stored in the n-th data storage unit 101_n into distributed information for n-th search (n-th search data) 105_n.

第ｎのデータ検索部１０３_ｎは、第ｎのデータ変換部１０２_ｎから受けた第ｎの検索用データ１０５_ｎと、クライアント装置２００から受けた第ｎの分散検索要求データ１０６_ｎとを用いて、他のデータ検索部１０３_１〜１０３_Ｎ（１０３_ｎを除く）と互いに通信を行いながら、第ｎのデータ記憶部１０１_ｎの第ｎの分散登録データに対する検索を行って、第ｎの分散検索結果１０７_ｎを出力する。 The n-th data search unit 103_n uses the n-th search data 105_n received from the n-th data conversion unit 102_n and the n-th distributed search request data 106_n received from the client device 200, and other data. While communicating with the search units 103_1 to 103_N (excluding 103_n), a search is performed on the nth distributed registration data in the nth data storage unit 101_n, and the nth distributed search result 107_n is output.

図３は、クライアント装置２００の構成を示すブロック図である。クライアント装置２００は、登録データシェア生成部２０１と、クエリデータシェア生成部２０２と、秘密分散復号部２０３とを備える。 FIG. 3 is a block diagram showing the configuration of the client device 200. The client device 200 includes a registration data share generation unit 201, a query data share generation unit 202, and a secret sharing decryption unit 203.

登録データシェア生成部２０１は、図示しない入力装置から登録データ２０４を受ける。登録データシェア生成部２０１は、登録データ２０４に対して秘密分散法を用いて、第１乃至第Ｎの分散登録データ１０４_１、…、１０４_Ｎを生成する。登録データシェア生成部２０１は、第ｎの分散登録データ１０４_ｎ（１≦ｎ≦Ｎ）を第ｎのサーバ装置１００_ｎに送信する。 The registration data share generation unit 201 receives the registration data 204 from an input device (not shown). The registration data share generation unit 201 generates the first to N-th distributed registration data 104_1, ..., 104_N by using the secret sharing method for the registration data 204. The registration data share generation unit 201 transmits the nth distributed registration data 104_n (1 ≦ n ≦ N) to the nth server device 100_n.

クエリデータシェア生成部２０２は、図示しない入力装置から検索要求データ２０５を受ける。クエリデータシェア生成部２０２は、検索要求データ２０５に対して秘密分散法を用いて、第１乃至第Ｎの分散検索要求データ１０６_１、…、１０６_Ｎを生成する。クエリデータシェア生成部２０２は、第ｎの分散検索要求データ１０６_ｎ（１≦ｎ≦Ｎ）を第ｎのサーバ装置１００_ｎに送信する。 The query data share generation unit 202 receives the search request data 205 from an input device (not shown). The query data share generation unit 202 generates the first to Nth distributed search request data 106_1, ..., 106_N by using the secret sharing method for the search request data 205. The query data share generation unit 202 transmits the nth distributed search request data 106_n (1 ≦ n ≦ N) to the nth server device 100_n.

秘密分散復号部２０３は、第ｎのサーバ装置１００_ｎ（１≦ｎ≦Ｎ）から第ｎの分散検索結果１０７_ｎを受ける。秘密分散復号部２０３は、第１乃至第Ｎの分散検索結果１０７_１、…、１０７_Ｎに対して秘密分散法を用いて、検索結果２０６を復元する。 The secret sharing decryption unit 203 receives the nth shared search result 107_n from the nth server device 100_n (1 ≦ n ≦ N). The secret sharing decryption unit 203 restores the search result 206 by using the secret sharing method for the first to Nth shared search results 107_1, ..., 107_N.

［動作の説明］
次に、図１から図５を用いて、本発明の第１の実施形態に係る秘匿検索システムの動作について詳細に説明する。[Description of operation]
Next, the operation of the confidential search system according to the first embodiment of the present invention will be described in detail with reference to FIGS. 1 to 5.

本発明の第１の実施形態に係る秘匿検索システムは、（１）データ登録処理と、（２）データ検索処理と、の２種類の処理を行う。図４、５は、それぞれ、データ登録処理およびデータ検索処理を示すフローチャートである。 The confidential search system according to the first embodiment of the present invention performs two types of processing: (1) data registration processing and (2) data search processing. 4 and 5 are flowcharts showing the data registration process and the data search process, respectively.

（データ登録処理）
図４は、本発明の第１の実施形態における秘匿検索システムのデータ登録処理の動作を示すフローチャートである。(Data registration process)
FIG. 4 is a flowchart showing the operation of the data registration processing of the confidential search system according to the first embodiment of the present invention.

新規データを第１乃至第Ｎのサーバ装置１００_１、…、１００_Ｎに登録するときには、以下のようにする。 When registering new data in the first to Nth server devices 100_1, ..., 100_N, the following is performed.

秘匿検索システムは、新規に分散したい登録データ２０４（ｔ_１，…，ｔ_ｎ）を、クライアント装置２００の登録データシェア生成部２０１に入力する。ただし、ｔ_１，…，ｔ_ｎはそれぞれデータの１シンボルに対応するものであり、それぞれのサイズはｌｏｇｑとする（ｑは素数、底は２）（ステップＳ１０１）。なお、以降のｌｏｇも底は２であるが、省略してｌｏｇとして記載する。 The confidential search system inputs the registration data 204 (t_1, ..., T_n) to be newly distributed to the registration data share generation unit 201 of the client device 200. However, t_1, ..., T_n respectively correspond to one symbol of data, and each size is logq (q is a prime number, base is 2) (step S101). It should be noted that the log that follows has a base of 2, but is omitted and described as log.

登録データシェア生成部２０１は、登録データ２０４をＲＳＳによって第ｎの分散登録データ１０４_ｎ（１≦ｎ≦Ｎ）に変換する（ステップＳ１０２）。 The registration data share generation unit 201 converts the registration data 204 into the nth distributed registration data 104_n (1 ≦ n ≦ N) by RSS (step S102).

詳述すると、まず、登録データシェア生成部２０１は、１≦ｉ≦ｎについて、ｔ_ｉ＝ｔ｛ｉ，１｝＋ｔ_｛ｉ，２｝＋…＋ｔ_｛ｉ，Ｎ｝ｍｏｄｑとなるようなｔ_｛ｉ，１｝，…，ｔ_｛ｉ，Ｎ｝を生成し、適当な組み合わせで各サーバ装置に割り当てることによって、各シンボルｉ（１≦ｉ≦ｎ）について、＜ｔ_ｉ＞ｑ＝（（ｔ_｛１，１｝，…，ｔ_｛Ｎ，１｝），（ｔ_｛１，２｝，…，ｔ_｛Ｎ，２｝），…，（ｔ_｛１，Ｎ｝，…，ｔ_｛Ｎ，Ｎ｝））を生成する。 More specifically, first, the registration data share generation unit 201, for 1 ≦ i ≦ n, t_i = t {i, 1} + t_ {i, 2} + ... + t_ {i, N} mod q such that t_i. By generating {i, 1}, ..., T_ {i, N} and assigning them to each server device in an appropriate combination, <t_i> q = ((t_t for each symbol i (1 ≦ i ≦ n)). {1,1}, ..., t_ {N, 1}), (t_ {1,2}, ..., t_ {N, 2}), ..., (t_ {1, N}, ..., t_ {N, N})) is generated.

また、登録データシェア生成部２０１は、（ｎ−１）ｑ ≦ ｔ_ｉ＜ｑであればｂ_ｉ＝ｎとし、ｂ_ｉのＳＳＳによる分散情報［ｂ_ｉ］_ｐ＝（ｂ_｛ｉ，１｝，ｂ_｛ｉ，２｝，…，ｂ_｛ｉ，Ｎ｝）を生成する。このとき，ｔ_｛ｉ，１｝＋ｔ_｛ｉ，２｝＋…＋ｔ_｛ｉ，Ｎ｝＝ｔ_ｉ＋ｂ_ｉ・ｑとなることに注意する。 Further, the registered data share generation unit 201 sets b_i = n if (n-1) q ≤ t_i <q, and the shared information [b_i] _p = (b_ {i, 1}, b_ {i by SSS of b_i. , 2}, ..., b_ {i, N}) are generated. At this time, note that t_ {i, 1} + t_ {i, 2} + ... + t_ {i, N} = t_i + b_i.q.

第ｎの分散登録データ１０４_ｎは，ｔ_ｉの分散情報（ｔ_｛１，ｎ｝，…，ｔ｛Ｎ，ｎ｝）とｂ_ｉの分散情報ｂ_｛ｉ，ｎ｝の組とする。 The n-th distributed registration data 104_n is a set of distributed information (t_ {1, n}, ..., t {N, n}) of t_i and distributed information b_ {i, n} of b_i.

登録データシェア生成部２０１は、第ｎの分散登録データ１０４_ｎを第ｎのサーバ装置１００_ｎ（１≦ｎ≦Ｎ）に送信する（ステップＳ１０３）。 The registration data share generation unit 201 transmits the nth distributed registration data 104_n to the nth server device 100_n (1 ≦ n ≦ N) (step S103).

（ステップＳ１０４）第ｎのサーバ装置１００_ｎ（１≦ｎ≦Ｎ）は受信した第ｎの分散登録データ１０４_ｎを第ｎのデータ記憶部１０１_ｎに保存する。 (Step S104) The nth server device 100_n (1 ≦ n ≦ N) stores the received nth distributed registration data 104_n in the nth data storage unit 101_n.

（データ検索処理）
図５は、本発明の第１の実施形態における秘匿検索システムのデータ検索時の動作を示すフローチャートである。(Data search process)
FIG. 5 is a flowchart showing an operation at the time of data search of the confidential search system according to the first embodiment of the present invention.

第１乃至第Ｎのサーバ装置１００_１、…、１００_Ｎに分散されたデータＴ＝（ｔ_１，ｔ_２，…，ｔ_ｎ）の組に関して、ある部分データ（ｓ_１，…，ｓ_ｍ）を含むデータが存在するかどうかを検索したいときには次のようにする。 Does the data including certain partial data (s_1, ..., s_m) exist for the set of data T = (t_1, t_2, ..., t_n) distributed to the first to Nth server devices 100_1, ..., 100_N? If you want to search for something like this:

秘匿検索システムは、検索を行いたいデータである検索要求データ２０５（ｓ_１，…，ｓ_ｍ）を、クライアント装置２００のクエリデータシェア生成部２０２に入力する（ステップＳ２０１）。ｓ_１，…，ｓ_ｍは検索要求データの各文字を表すものであり、それぞれのサイズはｌｏｇｑである。 The confidential search system inputs the search request data 205 (s_1, ..., s_m), which is the data to be searched, to the query data share generation unit 202 of the client device 200 (step S201). s_1, ..., s_m represent each character of the search request data, and the size of each is logq.

クエリデータシェア生成部２０２は、非特許文献１のＴＳＳＳの方法で、下記数１のような形の第１乃至第Ｎの分散検索要求データ（１０６_１、…、１０６_Ｎ）［ｓ_１ｓ_２ … ｓ_ｍ］_ｐを生成する（ステップＳ２０２）。 The query data share generation unit 202 uses the TSSS method of Non-Patent Document 1 for the first to N-th distributed search request data (106_1, ..., 106_N) [s_1 s_2 ... s_m] _p in the form of the following Expression 1. Is generated (step S202).

そして、クエリデータシェア生成部２０２は、第ｎの分散検索要求データ１０６_ｎを第ｎのサーバ装置１００_ｎの第ｎのデータ検索部１０３_ｎに送信する。ただし、ｌｏｇｐ＝ｌｏｇｑ×ｍとする（ｐ、ｑは素数、ｍは任意の正の整数でｍ＞ｎ）。 Then, the query data share generation unit 202 transmits the nth distributed search request data 106_n to the nth data search unit 103_n of the nth server device 100_n. However, logp = logq × m (p and q are prime numbers, m is an arbitrary positive integer, and m> n).

第ｎのサーバ装置１００_ｎ（１≦ｎ≦Ｎ）は、第ｎのデータ記憶部１０１_ｎから第ｎの分散登録データを読み出し、第ｎのデータ変換部１０２_ｎに送信する（ステップＳ２０３）。分散登録データはｎ文字のテキストそれぞれについて、＜ｔ_１＞ｑ，…，＜ｔ_ｎ＞ｑの形で分散されていることに注意する。 The nth server device 100_n (1 ≦ n ≦ N) reads the nth distributed registration data from the nth data storage unit 101_n and sends it to the nth data conversion unit 102_n (step S203). Note that the distributed registration data is distributed in the form of <t_1> q, ..., <t_n> q for each of the n character texts.

第ｎのデータ変換部１０２_ｎはまず、第ｎのデータ記憶部１０１_ｎから受けたＲＳＳの分散データ＜ｔ_１＞ｑ，…，＜ｔ_ｎ＞ｑを、非特許文献３に記載の方法でＳＳＳの分散情報［ｔ＊_１］_ｐ，…，［ｔ＊_ｎ］_ｐに変換する（ステップＳ２０４）。 First, the n-th data conversion unit 102_n receives the RSS distributed data <t_1> q, ..., <t_n> q received from the n-th data storage unit 101_n by the method described in Non-Patent Document 3. [T * _1] _p, ..., [t * _n] _p (step S204).

このとき，ｔ＊_ｉ＝ｔ_ｉ＋ｂ_ｉ・ｑとなっているので，以下の計算によってｂ_ｉに補正する。 At this time, since t * _i = t_i + b_i · q, it is corrected to b_i by the following calculation.

第ｎのデータ変換部１０２_ｎは、各ｔ_ｉについて，［ｔ’_ｉ］_ｐ＝［ｔ＊_ｉ］_ｐ−ｑ×［ｂ_ｉ］_ｐを計算することによって，分散情報［ｔ’_１］_ｐ，［ｔ’_２］_ｐ，…［ｔ’_ｎ］_ｐを生成する。 The n-th data conversion unit 102_n calculates the distribution information [t′_1] _p, [t′_1] _p, [t′_i] _p = [t * _i] _p−q × [b_i] _p for each t_i. '_2] _p, ... [t'_n] _p are generated.

第ｎのデータ変換部１０２_ｎは、非特許文献２に記載の方法を用いて、和と定数倍の計算は通信を要さずに実行できることを利用して、下記数２のようにｍシンボルを結合したデータに関するＳＳＳの分散データに変換する。 The n-th data conversion unit 102_n uses the method described in Non-Patent Document 2 and utilizes that the calculation of the sum and the constant multiple can be executed without communication, so that the m symbol Convert to the distributed data of SSS regarding the combined data.

第ｎのデータ変換部１０２_ｎは、［ｓ’］_ｐを第ｎの検索用データ１０５_ｎとして第ｎのデータ検索部１０３_ｎに送出する。 The nth data conversion unit 102_n sends [s'] _ p as the nth search data 105_n to the nth data search unit 103_n.

第ｎのデータ検索部１０３_ｎは、第ｎのデータ変換部１０２_ｎから送られた第ｎの検索用データ１０５_ｎと、クエリデータシェア生成部２０２から送信された第ｎの分散検索要求データ１０６_ｎを用いて、下記数３の計算によって、分散データ［ｓ’_０］_ｐ，…，［ｓ’_｛ｎ−ｍ｝］_ｐを生成する（ステップＳ２０５）。 The nth data search unit 103_n uses the nth search data 105_n sent from the nth data conversion unit 102_n and the nth distributed search request data 106_n sent from the query data share generation unit 202. , [S′_0] _p, ..., [s ′ _ {n−m}] _ p are generated by the calculation of the following Expression 3 (step S205).

もし検索対象の部分データであるｔ’_１，…，ｔ’_（ｎ−ｍ）のいずれかが検索要求データと一致した場合、ｓ’_０，…，ｓ’_（ｎ−ｍ）のいずれかが０になることに注意する。 If any of the partial data t'_1, ..., T '_ (n-m) to be searched matches the search request data, any of s'_0, ..., s' _ (n-m). Note that the value becomes 0.

その後、第１乃至第Ｎのデータ検索部１０３_１〜１０３_Ｎは、互いに通信しながら、下記数４のような計算を行い、第ｎのデータ検索部１０３_ｎは、得られた結果を第ｎの分散検索結果１０７_ｎとしてクライアント装置２００に送信する。 After that, the first to N-th data search units 103_1 to 103_N communicate with each other and perform calculations such as the following formula 4, and the n-th data search unit 103_n searches the obtained results for the n-th distributed search. The result 107_n is transmitted to the client device 200.

クライアント装置２００は、第１乃至第Ｎのサーバ装置１００_１、…、１００_Ｎから受け取った第１乃至第Ｎの分散検索結果１０７_１、…、１０７_Ｎをすべて秘密分散復号部２０３に入力し、検索結果２０６を復号する（ステップＳ２０６）。復号された結果が０であったときは、登録データの中に検索要求データを含むデータが存在することを意味し、０でなかったときは、そのようなデータは存在しなかったことを示す。 The client device 200 inputs all the first to Nth shared search results 107_1, ..., 107_N received from the first to Nth server devices 100_1, ..., 100_N to the secret sharing decryption unit 203, and outputs the search result 206. Decrypt (step S206). When the decrypted result is 0, it means that the data including the search request data exists in the registered data, and when it is not 0, it indicates that such data does not exist. ..

（第１の実施形態の効果）
第１の実施形態によれば、各サーバ装置に保存するデータ（分散情報）のサイズを削減することができる。その理由は、各サーバ装置に保存するデータを１シンボルごとの分散情報として保管する。そして、検索処理時に必要な部分データに対応する分散情報を生成する前処理を行ってから検索するからである。(Effects of the first embodiment)
According to the first embodiment, the size of data (distributed information) stored in each server device can be reduced. The reason is that the data stored in each server device is stored as distributed information for each symbol. This is because the search is performed after the preprocessing for generating the shared information corresponding to the partial data required in the search processing is performed.

第１の実施形態では、秘密情報を１シンボルごとにＲＳＳＳで保管し、データ変換部ではこれを非特許文献３の方法でＴＳＳＳの分散情報に変換する。このとき、非特許文献３の方法で変換した分散情報はｔ_ｉの分散情報ではなくｔ＊_ｉ＝ｔ_ｉ＋ｂ_ｉ・ｑの分散情報になっている可能性がある。そこで、ｔ_ｉとともにｂ_ｉを分散しておき、［ｔ_ｉ］_ｐ＝［ｔ＊_ｉ］_ｐ−ｑ×［ｂ_ｉ］_ｐの計算を行うことによって、分散情報の変換後も同じ秘密情報に復元されるように補正を行っている。上記数２の式によって計算される値はｔ’_ｉの分散情報であり、ｔ’_ｉはｑビットごとに（ｔ_１，…，ｔ_ｎ）の各シンボルであるので、ｍシンボルの部分データに対する分散情報になっている。ｔ’_ｉと検索要求データｓ’との差を取ったのちに乱数でマスクをかける操作は、ｍ個のシンボルそれぞれに関して差を取ってマスクをかける操作と同様の効果がある。 In the first embodiment, the secret information is stored in RSSS for each symbol, and the data conversion unit converts this into TSSS distributed information by the method of Non-Patent Document 3. At this time, the shared information converted by the method of Non-Patent Document 3 may not be the shared information of t_i but the shared information of t * _i = t_i + b_i · q. Therefore, by distributing b_i together with t_i and calculating [t_i] _p = [t * _i] _p−q × [b_i] _p, the same secret information can be restored even after the conversion of the shared information. Is being corrected. The value calculated by the equation (2) is t'_i variance information, and t'_i is each symbol of (t_1, ..., t_n) for every q bits, so the variance information for partial data of m symbols. It has become. The operation of masking with random numbers after the difference between t'_i and the search request data s' is obtained has the same effect as the operation of masking the difference with respect to each of the m symbols.

［第２の実施形態］
次に、図６乃至図８を参照して、本発明の第２の実施形態に係る秘匿検索システムについて説明する。[Second Embodiment]
Next, a confidential search system according to the second embodiment of the present invention will be described with reference to FIGS. 6 to 8.

［構成の説明］
図６は、本発明の第２の実施形態に係る秘匿検索システムの構成を示すブロック図である。[Description of configuration]
FIG. 6 is a block diagram showing the configuration of the confidential search system according to the second embodiment of the present invention.

図６を参照すると、第２の実施形態に係る秘匿検索システムは、クライアント装置２００Ａと、第１乃至第Ｎのサーバ装置１００Ａ_１、１００Ａ_２、…、１００Ａ_Ｎとを備える。サーバ装置１００Ａ_１〜１００Ａ_Ｎは、それぞれ、第１乃至第Ｎのサーバ装置とも呼ばれる。クライアント装置２００Ａは、Ｎ台のサーバ装置１００Ａ_１〜１００Ａ_Ｎと通信する。また、Ｎ台のサーバ装置１００Ａ_１、…、１００Ａ_Ｎは互いに通信する。 Referring to FIG. 6, the confidential search system according to the second embodiment includes a client device 200A and first to Nth server devices 100A_1, 100A_2, ..., 100A_N. The server devices 100A_1 to 100A_N are also referred to as first to Nth server devices, respectively. The client device 200A communicates with N server devices 100A_1 to 100A_N. Further, the N server devices 100A_1, ..., 100A_N communicate with each other.

図７は、第ｎのサーバ装置１００Ａ_ｎ（１≦ｎ≦Ｎ）の構成を示すブロック図である。
第ｎのサーバ装置１００Ａ_ｎ（１≦ｎ≦Ｎ）は、第ｎのデータ記憶部１０１Ａ_ｎと、第ｎのデータ変換部１０２Ａ_ｎと、第ｎのデータ検索部１０３Ａ_ｎとを備える。FIG. 7 is a block diagram showing the configuration of the nth server device 100A_n (1 ≦ n ≦ N).
The nth server device 100A_n (1 ≦ n ≦ N) includes an nth data storage unit 101A_n, an nth data conversion unit 102A_n, and an nth data search unit 103A_n.

第ｎのデータ記憶部１０１Ａ_ｎは、クライアント装置２００Ａから第ｎの分散登録データ１０４Ａ_ｎを受け、それを保存する。また、第ｎのデータ記憶部１０１Ａ_ｎは、検索時に保存した第ｎの分散登録データ１０４Ａ_ｎを第ｎのデータ変換部１０２Ａ_ｎに出力する。第ｎのデータ変換部１０２Ａ_ｎは、第ｎのデータ記憶部１０１Ａ_ｎに保存された第ｎの分散登録データを第ｎの検索用の分散情報（検索用データ）１０５Ａ_ｎに変換する。 The nth data storage unit 101A_n receives the nth distributed registration data 104A_n from the client device 200A and stores it. Further, the nth data storage unit 101A_n outputs the nth distributed registration data 104A_n stored at the time of search to the nth data conversion unit 102A_n. The n-th data conversion unit 102A_n converts the n-th distributed registration data stored in the n-th data storage unit 101A_n into distributed information for search (search data) 105A_n.

第ｎのデータ検索部１０３Ａ_ｎは、第ｎのデータ変換部１０２Ａ_ｎから受けた第ｎの検索用データ１０５Ａ_ｎと、クライアント装置２００Ａから受けた第ｎの分散検索要求データ１０６Ａ_ｎとを用いて、他のデータ検索部１０３Ａ_１〜１０３Ａ_Ｎ（１０３Ａ_ｎを除く）と互いに通信しながら、第ｎのデータ記憶部１０１Ａ_ｎの第ｎの分散登録データに対する検索を行って、第ｎの分散検索結果１０７Ａ_ｎを出力する。 The n-th data search unit 103A_n uses the n-th search data 105A_n received from the n-th data conversion unit 102A_n and the n-th distributed search request data 106A_n received from the client device 200A, and uses the other data. While communicating with the search units 103A_1 to 103A_N (excluding 103A_n), a search is performed on the nth distributed registration data in the nth data storage unit 101A_n, and the nth distributed search result 107A_n is output.

図８は、クライアント装置２００Ａの構成を示すブロック図である。クライアント装置２００Ａは、登録データシェア生成部２０１Ａと、クエリデータシェア生成部２０２Ａと、秘密分散復号部２０３Ａとを備える。 FIG. 8 is a block diagram showing the configuration of the client device 200A. The client device 200A includes a registration data share generation unit 201A, a query data share generation unit 202A, and a secret sharing decryption unit 203A.

登録データシェア生成部２０１Ａは、図示しない入力装置から登録データ２０４Ａを受ける。登録データシェア生成部２０１Ａは、秘密分散法の分散データ生成手順を実行して、第１乃至第Ｎの分散登録データ１０４Ａ_１、…、分散登録データ１０４Ａ_Ｎを生成する。登録データシェア生成部２０１Ａは、第ｎの分散登録データ１０４Ａ_ｎ（１≦ｎ≦Ｎ）を第ｎのサーバ装置１００Ａ_ｎに送信する。 The registration data share generation unit 201A receives the registration data 204A from an input device (not shown). The registration data share generation unit 201A executes the shared data generation procedure of the secret sharing method to generate the first to Nth distributed registration data 104A_1, ..., Distributed registration data 104A_N. The registration data share generation unit 201A transmits the nth distributed registration data 104A_n (1 ≦ n ≦ N) to the nth server device 100A_n.

クエリデータシェア生成部２０２Ａは、図示しない入力装置から検索要求データ２０５Ａを受ける。クエリデータシェア生成部２０２Ａは、秘密分散法の分散データ生成手順を実行して、第１乃至第Ｎの分散検索要求データ１０６Ａ_１、…、１０６Ａ_Ｎを生成する。クエリデータシェア生成部２０２Ａは、第ｎの分散検索要求データ１０６Ａ_ｎ（１≦ｎ≦Ｎ）を第ｎのサーバ装置１００Ａ_ｎに送信する。 The query data share generation unit 202A receives the search request data 205A from an input device (not shown). The query data share generation unit 202A executes the shared data generation procedure of the secret sharing method to generate the first to Nth distributed search request data 106A_1, ..., 106A_N. The query data share generation unit 202A transmits the nth distributed search request data 106A_n (1 ≦ n ≦ N) to the nth server device 100A_n.

秘密分散復号部２０３Ａは、第ｎのサーバ装置１００Ａ_ｎ（１≦ｎ≦Ｎ）から第ｎの分散検索結果１０７Ａ_ｎを受ける。秘密分散復号部２０３Ａは、第１乃至第Ｎの分散検索結果１０７Ａ_１、…、１０７Ａ_Ｎに対して秘密分散法の復号手順を実行して、検索結果２０６Ａを復元する。 The secret sharing decryption unit 203A receives the nth shared search result 107A_n from the nth server apparatus 100A_n (1 ≦ n ≦ N). The secret sharing decryption unit 203A executes the secret sharing decryption procedure on the first to Nth shared search results 107A_1, ..., 107A_N to restore the search result 206A.

［動作の説明］
次に、図６から図１０を用いて、本発明の第２の実施形態に係る秘匿検索システムの動作について詳細に説明する。[Description of operation]
Next, the operation of the confidential search system according to the second embodiment of the present invention will be described in detail with reference to FIGS. 6 to 10.

本発明の第２の実施形態に係る秘匿検索システムは、（１）データ登録処理と、（２）データ検索処理と、の２種類の処理を行う。図９、図１０は、それぞれ、データ登録処理およびデータ検索処理のフローを示すフローチャートである。 The confidential search system according to the second embodiment of the present invention performs two types of processing: (1) data registration processing and (2) data search processing. 9 and 10 are flowcharts showing the flow of the data registration process and the data search process, respectively.

（データ登録処理）
図９は、本発明の第２の実施形態における秘匿検索システムのデータ登録時の動作を示すフローチャートである。(Data registration process)
FIG. 9 is a flowchart showing an operation at the time of data registration of the confidential search system according to the second exemplary embodiment of the present invention.

新規データを第１乃至第Ｎのサーバ装置１００Ａ_１、…、１００Ａ_Ｎに登録するときには以下のようにする。 The following is performed when registering new data in the first to Nth server devices 100A_1, ..., 100A_N.

秘匿検索システムは、新規に分散したい登録データ２０４Ａ（ｔ_１，…，ｔ_ｎ）をクライアント装置２００Ａの登録データシェア生成部２０１Ａに入力する（ステップＳ１０１Ａ）。ただし、ｔ_１，…，ｔ_ｎはそれぞれデータの１シンボルに対応するものであり、各ｔ_ｉは有限体ＧＦ（ｑ）の元とする。 The confidential search system inputs the new registration data 204A (t_1, ..., t_n) to be distributed to the registration data share generation unit 201A of the client device 200A (step S101A). However, t_1, ..., T_n respectively correspond to one symbol of data, and each t_i is an element of the finite field GF (q).

登録データシェア生成部２０１Ａは、１≦ｉ≦ｎについて、ＳＳＳによるｔ_ｉのシェア［ｔ_ｉ］ｑを生成し，［ｔ_１］ｑ，…［ｔ_ｎ］ｑをそれぞれ第１乃至第Ｎの分散登録データ１０４Ａ_１，…，１０４Ａ_Ｎとする（ステップＳ１０２Ａ）。 The registration data share generation unit 201A generates a share [t_i] q of t_i by SSS for 1 ≦ i ≦ n, and assigns [t_1] q, ... [t_n] q to the first to Nth distributed registration data 104A_1. , ..., 104A_N (step S102A).

登録データシェア生成部２０１Ａは、第ｎの分散登録データ１０４Ａ_ｎを第ｎのサーバ装置１００Ａ_ｎ（１≦ｎ≦Ｎ）に送信する（ステップＳ１０３Ａ）。 The registration data share generation unit 201A transmits the nth distributed registration data 104A_n to the nth server device 100A_n (1 ≦ n ≦ N) (step S103A).

第ｎのサーバ装置１００Ａ_ｎ（１≦ｎ≦Ｎ）は、受信した第ｎの分散登録データ１０４Ａ_ｎを第ｎのデータ記憶部１０１Ａ_ｎに保存する（ステップＳ１０４Ａ）。 The nth server device 100A_n (1 ≦ n ≦ N) stores the received nth distributed registration data 104A_n in the nth data storage unit 101A_n (step S104A).

（データ検索処理）
図１０は、本発明の第２の実施形態における秘匿検索システムのデータ検索時の動作を示すフローチャートである。(Data search process)
FIG. 10 is a flowchart showing an operation at the time of data search of the confidential search system according to the second embodiment of the present invention.

第１乃至第Ｎのサーバ装置１００Ａ_１、…、１００Ａ_Ｎに分散されたデータＴ＝（ｔ_１，ｔ_２，…，ｔ_ｎ）の組に関して、ある部分データ（ｓ_１，…，ｓ_ｍ）を含むデータが存在するかどうかを検索したいときには次のようにする。 Does the data including certain partial data (s_1, ..., s_m) exist for the set of data T = (t_1, t_2, ..., t_n) distributed to the first to Nth server devices 100A_1, ..., 100A_N? If you want to search for something like this:

秘匿検索システムは、検索を行いたいデータである検索要求データ２０５Ａ（ｓ_１，…，ｓ_ｍ）を、クライアント装置２００Ａのクエリデータシェア生成部２０２Ａに入力する（ステップＳ２０１Ａ）。ｓ_１，…，ｓ_ｍは検索要求データの各文字を表すものであり、それぞれのサイズはｌｏｇｑである。 The confidential search system inputs the search request data 205A (s_1, ..., s_m), which is the data desired to be searched, to the query data share generation unit 202A of the client device 200A (step S201A). s_1, ..., s_m represent each character of the search request data, and the size of each is logq.

クエリデータシェア生成部２０２Ａは、検索要求データ２０５Ａ（ｓ_１，…，ｓ_ｍ）から、ＳＳＳの分散情報［ｓ］_ｐ＝［ｓ_１ || ｓ_２ || … || ｓ_ｍ］_ｐを生成する（ステップＳ２０２Ａ）。ただし、ｐ＝ｑ＾ｍであり、ｓは有限体ＧＦ（ｑ＾ｍ）の元である。クエリデータシェア生成部２０２Ａは、第ｎの分散検索要求データ１０６Ａ_ｎ（１≦ｎ≦Ｎ）を第ｎのサーバ装置１００Ａ_ｎに送信する。 The query data share generation unit 202A generates SSS distribution information [s] _p = [s_1 || s_2 || ... || s_m] _p from the search request data 205A (s_1, ..., s_m) (step S202A). .. However, p = q ^ m, and s is an element of the finite field GF (q ^ m). The query data share generation unit 202A transmits the nth distributed search request data 106A_n (1 ≦ n ≦ N) to the nth server device 100A_n.

第ｎのサーバ装置１００Ａ_ｎ（１≦ｎ≦Ｎ）は、第ｎのデータ記憶部１０１Ａ_ｎから第ｎの分散登録データを読み出し、第ｎのデータ変換部１０２Ａ_ｎに送出する（ステップＳ２０３Ａ）。 The nth server device 100A_n (1 ≦ n ≦ N) reads the nth distributed registration data from the nth data storage unit 101A_n and sends it to the nth data conversion unit 102A_n (step S203A).

第ｎのデータ変換部１０２Ａ_ｎはまず、第ｎのデータ記憶部１０１Ａ_ｎから受けたＳＳＳの分散情報［ｔ_１］ｑ，…，［ｔ_ｎ］ｑを、以下の方法で検索用データに変換する（ステップＳ２０４Ａ）。 The nth data conversion unit 102A_n first converts the SSS distribution information [t_1] q, ..., [t_n] q received from the nth data storage unit 101A_n into search data by the following method (step S204A). ).

第ｎのデータ変換部１０２Ａ_ｎは、各ｔ_ｉについて［ｔ_ｉ］ｑ＝（ｔ_｛ｉ，１｝，ｔ_｛ｉ，２｝，．．．，ｔ_｛ｉ，Ｎ｝）のうち、自らが保持する分散情報ｔ_｛１，ｊ｝，ｔ_｛２，ｊ｝，．．．，ｔ_｛ｎ，ｊ｝を用いて、ｔ’_｛ｉ，ｊ｝＝ｔ_｛ｉ，ｊ｝ || ｔ_｛ｉ＋１，ｊ｝ || … || ｔ_｛ｉ＋ｍ−１，ｊ｝を０≦ｉ≦ｎ−ｍについて計算し、ｔ’_｛０｝，．．．，ｔ’_｛ｎ−ｍ｝を得る。ｔ’_｛ｉ，１｝，．．．，ｔ’_｛ｉ，Ｎ｝は、ｔ’_｛ｉ｝＝ｔ_｛ｉ｝ || … || ｔ_｛ｉ＋ｍ−１｝の分散情報である。すなわち、［ｔ’_ｉ］_ｐ＝［ｔ_｛ｉ｝ || … || ｔ_｛ｉ＋ｍ−１｝］_ｐである。ただし、ｐ＝ｑ＾ｍであり、各ｔ’_ｉは有限体ＧＦ（２＾ｑ）の元である。 The n-th data conversion unit 102A_n holds by itself among [t_i] q = (t_ {i, 1}, t_ {i, 2}, ..., T_ {i, N}) for each t_i. Distributed information t_ {1, j}, t_ {2, j} ,. ．． , T_ {n, j}, t ′ _ {i, j} = t_ {i, j} || t_ {i + 1, j} || ... || t_ {i + m-1, j} is 0 ≦ Compute for i ≦ n−m, t ′ _ {0} ,. ．． , T '_ {n-m}. t '_ {i, 1} ,. ．． , T '_ {i, N} is the disperse information of t' _ {i} = t_ {i} || ... || t_ {i + m-1}. That is, [t'_i] _p = [t_ {i} || ... || t_ {i + m-1}] _ p. However, p = q ^ m, and each t'_i is an element of the finite field GF (2 ^ q).

第ｎのデータ変換部１０２Ａ_ｎは、［ｔ’_０］_ｐ，．．．，［ｔ’_｛ｎ−ｍ｝］_ｐを第ｎの検索用データ１０５Ａ_ｎとして、第ｎのデータ検索部１０３Ａ_ｎに送出する。 The n-th data conversion unit 102A_n has [t'_0] _p ,. ．． , [T '_ {n-m}] _ p are sent to the nth data search unit 103A_n as the nth search data 105A_n.

第ｎのデータ検索部１０３Ａ_ｎは、第ｎのデータ変換部１０２Ａ_ｎから送られた第ｎの検索用データ１０５Ａ_ｎと、クエリデータシェア生成部２０２Ａから送信された第ｎの分散検索要求データ１０６Ａ_ｎを用いて、上記数３の計算によって、分散データ［ｓ’_０］_ｐ，…，［ｓ’_｛ｎ−ｍ｝］_ｐを生成する（ステップＳ２０５Ａ）。もし検索対象の部分データであるｔ’_０，…，ｔ’_（ｎ−ｍ）のいずれかが検索要求データと一致した場合、ｓ’_０，…，ｓ’_｛ｎ−ｍ｝のいずれかが０になることに注意する。 The nth data search unit 103A_n uses the nth search data 105A_n sent from the nth data conversion unit 102A_n and the nth distributed search request data 106A_n sent from the query data share generation unit 202A. , [S′_0] _p, ..., [s ′ _ {n−m}] _ p are generated by the calculation of Equation 3 (step S205A). If any of the search target partial data t'_0, ..., T '_ (n-m) matches the search request data, any of s'_0, ..., s' _ {n-m}. Note that the value becomes 0.

その後、第ｎのデータ検索部１０３Ａ_ｎは、上記数４のような計算を行い、得られた結果を第ｎの分散検索結果１０７Ａ_ｎとして、クライアント装置２００Ａに送信する。 After that, the nth data search unit 103A_n performs the calculation as shown in the above mathematical expression 4, and transmits the obtained result to the client apparatus 200A as the nth distributed search result 107A_n.

クライアント装置２００Ａは、第１乃至第Ｎのサーバ装置１００Ａ_１、…、１００Ａ_Ｎから受け取った第１乃至第Ｎの分散検索結果１０６Ａ_１、…、１０６Ａ_Ｎをすべて秘密分散復号部２０３Ａに入力し、検索結果２０６Ａを復号する（ステップＳ２０６Ａ）。復号された結果が０であったときは、登録データの中に検索要求データを含むデータが存在することを意味し、０でなかったときは、そのようなデータは存在しなかったことを示す。 The client apparatus 200A inputs all the first to Nth shared search results 106A_1, ..., 106A_N received from the first to Nth server apparatuses 100A_1, ..., 100A_N to the secret sharing decryption unit 203A, and outputs the search result 206A. Decrypt (step S206A). When the decrypted result is 0, it means that the data including the search request data exists in the registered data, and when it is not 0, it indicates that such data does not exist. ..

（第２の実施形態の効果）
第２の実施形態によれば、各サーバ装置に保存するデータ（分散情報）のサイズを削減することができる。その理由は、各サーバ装置に保存するデータを１シンボルごとの分散情報として保管する。そして、検索処理時に必要な部分データに対応する分散情報を生成する前処理を行ってから検索するからである。(Effects of the second embodiment)
According to the second embodiment, the size of data (distributed information) stored in each server device can be reduced. The reason is that the data stored in each server device is stored as distributed information for each symbol. This is because the search is performed after the preprocessing for generating the shared information corresponding to the partial data required in the search processing is performed.

第２の実施形態において、秘密情報は１シンボルごとにＴＳＳＳで分散されており、データ変換部ではこれをステップＳ２０４Ａのｔ’_｛ｉ，ｊ｝＝ｔ_｛ｉ，ｊ｝ || ｔ_｛ｉ＋１，ｊ｝ || … || ｔ_｛ｉ＋ｍ−１，ｊ｝のように連結する。各シンボルｔ_｛１，ｊ｝，ｔ_｛２，ｊ｝，．．．，ｔ_｛ｎ，ｊ｝が有限体ＧＦ（ｑ）の元であるとき、これをｍ個連結したものは有限体ＧＦ（ｑ＾ｍ）の元であり、もとの秘密情報ｔ＝（ｔ１，…，ｔ_ｎ）のうちのｍシンボルの部分データに対する分散情報になっている。これにより、上記第１の実施形態と同様に、ｍシンボル分に対する操作をまとめて実行することがでる。 In the second embodiment, the secret information is distributed by TSSS for each symbol, and the data conversion unit performs t '_ {i, j} = t_ {i, j} || t_ {i + 1 in step S204A. , J} || ... || t_ {i + m-1, j} are connected. Each symbol t_ {1, j}, t_ {2, j} ,. ．． , T_ {n, j} is an element of a finite field GF (q), the concatenation of m pieces is an element of the finite field GF (q ^ m), and the original secret information t = (t1 , ..., T_n), the distribution information is for the partial data of m symbols. As a result, as in the first embodiment, the operations for m symbols can be collectively executed.

［実施形態の具体例］
上記第１の実施形態の具体例として、３台のサーバ装置１００_１、１００_２、１００_３と、クライアント装置２００とで構成された文字列検索システムの構成を示す。[Specific Example of Embodiment]
As a specific example of the first embodiment, a configuration of a character string search system including three server devices 100_1, 100_2, 100_3 and a client device 200 will be shown.

この第１の実施形態の具体例では、テキストの元データはそれぞれ最大２００文字とし、Ｔ＝（ｔ_１，…，ｔ_２００）の形で表現され、ｔ_ｉ（１≦ｉ≦ｎ）はテキストの各文字を表す。３台のサーバ装置１００_１、１００_２、１００_３には文字列の組が１文字ごとにＲＳＳＳを用いて＜ｔ_１＞ｑ，…，＜ｔ_２００＞ｑのように分散されて保管されている。各文字は８ビットで符号化されており、ｑ＝２＾８とする。第１の実施形態の具体例では、クライアント装置２００が検索を要求するキーワードの長さは１０文字とする。すなわち、検索要求データは最大８０ビットの文字列であり、ｐ＝２＾８０とする。 In the specific example of the first embodiment, the original data of the text has a maximum of 200 characters each and is expressed in the form of T = (t_1, ..., t_200), and t_i (1 ≦ i ≦ n) is each character of the text. Represents. In the three server devices 100_1, 100_2, 100_3, a set of character strings is distributed and stored for each character using RSSS as <t_1> q, ..., <t_200> q. Each character is encoded with 8 bits, and q = 2 ^ 8. In the specific example of the first embodiment, the length of the keyword requested to be searched by the client device 200 is 10 characters. That is, the search request data is a character string of maximum 80 bits, and p = 2 ^ 80.

クライアント装置２００は、検索要求データとして、１０文字のテキストｓ＝（ｓ_１，ｓ_２，…，ｓ_１０）＝“ｃａｌｃｕｌａｔｏｒ”をＳＳＳで［（“ｃａｌｃｕｌａｔｏｒ”）］_ｐ＝［２＾（７２）・”ｃ”＋２＾（６４）・”ａ”＋２＾（５６）・”ｌ”＋２＾（４８）・”ｃ”＋２＾（４０）・”ｕ”＋２＾（３２）・”ｌ”＋２＾（２４）・”ａ”＋２＾（１６）・”ｔ”＋２＾（８）・”ｏ”＋“ｒ”］_ｐのように、３台のサーバ装置１００_１、１００_２、１００_３に分散する。 The client device 200 uses the text of 10 characters s = (s_1, s_2, ..., s_10) = “calculator” as the search request data in SSS to [(“calculator”)] _ p = [2 ^ (72) · ”c "+ 2 ^ (64)," a "+ 2 ^ (56)," l "+ 2 ^ (48)," c "+ 2 ^ (40)," u "+ 2 ^ (32)," l "+ 2 ^ (24 ) · “A” + 2 ^ (16) · “t” + 2 ^ (8) · “o” + “r”] _ p, which are distributed to the three server devices 100_1, 100_2, 100_3.

第ｎのサーバ装置１００_ｎ（１≦ｎ≦３）は、１０文字の検索要求データに対するＲＳＳＳの第ｎの分散登録データ１０４_ｎを受けたのち、第ｎのデータ記憶部１０１_ｎに保存された、テキストの各文字に対するＲＳＳＳの分散情報＜ｔ_１＞ｑ，…，＜ｔ_２００＞ｑのうち、自らが保持する情報を、第ｎのデータ変換部１０２_ｎに入力し、上記数２のようにＴＳＳＳの分散情報［ｔ＊_１］_ｐ，…，［ｔ＊_ｎ］_ｐに変換する。このとき、ｔ＊_ｉ（＝ｔ_ｉｍｏｄｑ）は，ｔ_ｉ，ｔ_ｉ＋ｑ，ｔ_ｉ＋２ｑの３種類の値を取る可能性があることに注意する。 The nth server device 100_n (1 ≦ n ≦ 3) receives the RSSth nth distributed registration data 104_n for the search request data of 10 characters and then stores the text of the text stored in the nth data storage unit 101_n. Of the RSSS distributed information <t_1> q, ..., <t_200> q for each character, the information held by itself is input to the nth data conversion unit 102_n, and the TSSS distributed information [[ t * _1] _p, ..., [t * _n] _p. At this time, note that t * _i (= t_i mod q) may take three kinds of values of t_i, t_i + q, and t_i + 2q.

次に，第ｎのサーバ装置１００_ｎは，各ｔ_ｉについて、［ｔ_ｉ］_ｐ＝［ｔ＊_ｉ］_ｐ−ｑ×［ｂ_ｉ］_ｐを計算する。 Next, the nth server device 100_n calculates [t_i] _p = [t * _i] _p−q × [b_i] _p for each t_i.

その後，サーバ装置１００_ｎは，上記で計算した［ｔ_１］_ｐ，．．．，［ｔ_｛ｎ−ｍ｝］_ｐから，１０文字ごとの部分文字列に対するＳＳＳの分散情報［ｔ’_ｉ］_ｐ＝［２＾（７２）・ｔ_｛ｉ｝＋２＾（６４）・ｔ_｛ｉ＋１｝＋２＾（５６）・ｔ_｛ｉ＋２｝＋２＾（４８）・ｔ_｛ｉ＋３｝＋２＾（４０）・ｔ_｛ｉ＋４｝＋２＾（３２）・ｔ_｛ｉ＋５｝＋２＾（２４）・ｔ_｛ｉ＋６｝＋２＾（１６）・ｔ_｛ｉ＋７｝＋２＾（８）・ｔ_｛ｉ＋８｝＋ｔ_｛ｉ＋９｝］_ｐ（１≦ｉ＜１９１）１０５_ｎに変換する。 After that, the server device 100_n calculates [t_1] _p ,. ．． , [T_ {n−m}] _ p, SSS distribution information [t′_i] _p = [2̂ (72) · t_ {i} + 2̂ (64) · t_ {for 10 character substrings. i + 1} + 2 ^ (56) .t_ {i + 2} + 2 ^ (48) .t_ {i + 3} + 2 ^ (40) .t_ {i + 4} + 2 ^ (32) .t_ {i + 5} + 2 ^ (24) .t_ { i + 6} + 2 ^ (16) · t_ {i + 7} + 2 ^ (8) · t_ {i + 8} + t_ {i + 9}] _ p (1 ≦ i <191) 105_n.

第ｎのサーバ装置１００_ｎは、クライアント装置２００から受けた第ｎの分散検索要求データ１０６_ｎと第ｎのデータ記憶部１０１_ｎのデータから変換された第ｎの検索用データ１０５_ｎとを、第ｎのデータ検索部１０３_ｎに入力する。 The nth server device 100_n receives the nth distributed search request data 106_n received from the client device 200 and the nth search data 105_n converted from the data of the nth data storage unit 101_n as the nth data. Input to the search unit 103_n.

第ｎのデータ検索部１０３_ｎは、入力された分散検索要求データ［（ｃａｌｃｕｌａｔｏｒ）］_ｐと検索用データ［ｔ’_１］_ｐ，…，［ｔ’_１９１］_ｐから、
［ｓ’_１］_ｐ＝［ｔ’_１］_ｐ−［（ｃａｌｃｕｌａｔｏｒ）］_ｐ，
…
［ｓ’_１９１］_ｐ＝［ｔ’_１９１］_ｐ−［（ｃａｌｃｕｌａｔｏｒ）］_ｐ
の計算で検索要求データと部分文字列とのマッチングを行う。From the input distributed search request data [(calculator)] _ p and search data [t'_1] _p, ..., [t'_191] _p, the nth data search unit 103_n
[S'_1] _p = [t'_1] _p-[(calculator)] _ p,
…
[S'_191] _p = [t'_191] _p-[(calculator)] _ p
The search request data and the partial character string are matched by the calculation of.

ここで、ある部分文字列ｔ’_ｋについて、ｔ’_ｋ＝（“ｃａｌｃｕｌａｔｏｒ”）となるとする。このとき、［ｓ’_ｋ］＝［ｔ’_ｋ］_ｐ−［（”ｃａｌｃｕｌａｔｏｒ”）］_ｐ＝［０］_ｐとなる。 Here, it is assumed that t′_k = (“calculator”) for a certain partial character string t′_k. At this time, [s'_k] = [t'_k] _p-[("calculator")] _ p = [0] _p.

その後、第ｎのデータ検索部１０３_ｎは、乱数ｒの分散情報［ｒ］_ｐを分散し、上記数に示す式を用いて計算を行い、検索結果Ｓ＝ｓ’_１×…×ｓ’_（１９１）×ｒの分散情報を生成する。あるｓ’_ｋ＝０であるため、Ｓの値は０となる。 After that, the nth data search unit 103_n distributes the distribution information [r] _p of the random number r, performs the calculation using the formula shown in the above number, and the search result S = s′_1 × ... × s ′ _ ( 191) × r distributed information is generated. Since a certain s'_k = 0, the value of S becomes 0.

第ｎのデータ検索部１０３_ｎは、Ｓの分散情報を第ｎの分散検索結果１０７_ｎとして出力する。第ｎのサーバ装置１００_ｎは、第ｎの分散検索結果１０７_ｎをクライアント装置２００に送信する。 The nth data search unit 103_n outputs the S shared information as the nth distributed search result 107_n. The nth server device 100_n transmits the nth distributed search result 107_n to the client device 200.

クライアント装置２００は、３台のサーバ装置１００_１、１００_２、１００_３から、検索結果Ｓの部分情報１０７_１、１０７_２、１０７_３を受け、ＴＳＳＳの復元アルゴリズムによって検索結果Ｓを復元する。クライアント装置２００は、Ｓ＝０が復号されたことを確認し、「サーバに”ｃａｌｃｕｌａｔｏｒ”を含む文字列が存在する」という結果を出力する。 The client device 200 receives the partial information 107_1, 107_2, 107_3 of the search result S from the three server devices 100_1, 100_2, 100_3, and restores the search result S by the restore algorithm of TSSS. The client device 200 confirms that S = 0 has been decrypted, and outputs the result that “a character string including“ calculator ”exists in the server”.

第１の実施形態の具体例において、サーバ装置に保存されるテキストの分散情報のサイズは、１テキストあたりｎ・ｌｏｇｐ＋２ｎ・ｌｏｇｑ＝２００×８０＋２×２００×８＝１９２００ビットである。関連技術のように、部分文字列をすべて８０ビットのＴＳＳＳの分散情報で保持していた場合、分散情報のサイズは、（１／２）・ｎ・（ｎ＋１）・ｌｏｇｐ＝（１／２）×２００×２０１×８０＝１６０８０００ｂｉｔｓ（≒２００ＫＢ）となる。このように、第１の実施形態の具体例では、関連技術と比べて分散情報のサイズを１／８３にすることができる。 In the specific example of the first embodiment, the size of the distributed information of text stored in the server device is n · logp + 2n · logq = 200 × 80 + 2 × 200 × 8 = 19200 bits per text. As in the related art, when the partial character strings are all held by the TSSS distributed information of 80 bits, the size of the distributed information is (1/2) .n. (N + 1) .logp = (1/2) × 200 × 201 × 80 = 1608000 bits (≈200 KB). As described above, in the specific example of the first embodiment, the size of shared information can be reduced to 1/83 as compared with the related art.

上記第２の実施形態の具体例として、３台のサーバ装置１００Ａ_１、１００Ａ_２、１００Ａ_３とクライアント装置２００Ａとで構成された文字列検索システムの構成を示す。 As a specific example of the second embodiment, a configuration of a character string search system including three server devices 100A_1, 100A_2, 100A_3 and a client device 200A will be shown.

この第２の実施形態の具体例では、テキストの元データはそれぞれ最大２００文字とし、Ｔ＝（ｔ_１，…，ｔ_２００）の形で表現され、ｔ_ｉ（１≦ｉ＜ｎ）はテキストの各文字を表す。３台のサーバ装置１００Ａ_１、１００Ａ_２、１００Ａ_３には文字列の組が１文字ごとにＴＳＳＳで［ｔ_１］ｑ，…，［ｔ_２００］ｑのように分散されて保管されている。各文字は８ビットで符号化されており、ｑ＝２＾８とする。本具体例では、クライアント装置２００Ａが検索を要求するキーワードの長さは１０文字とする。すなわち、検索要求データは最大８０ビットの文字列であり、ｐ＝２＾８０とする。 In the specific example of the second embodiment, the original data of the text has a maximum of 200 characters each and is expressed in the form of T = (t_1, ..., t_200), and t_i (1 ≦ i <n) is each character of the text. Represents. In the three server devices 100A_1, 100A_2, 100A_3, a set of character strings is distributed and stored for each character in TSSS such as [t_1] q, ..., [t_200] q. Each character is encoded with 8 bits, and q = 2 ^ 8. In this specific example, the length of the keyword requested by the client device 200A to search is 10 characters. That is, the search request data is a character string of maximum 80 bits, and p = 2 ^ 80.

クライアント装置２００Ａは、検索要求データとして、１０文字のテキストｓ＝（ｓ_１，ｓ_２，…，ｓ_１０）＝“ｃａｌｃｕｌａｔｏｒ”をＳＳＳで［（“ｃａｌｃｕｌａｔｏｒ”）］_ｐ＝［２＾（７２）・”ｃ”＋２＾（６４）・”ａ”＋２＾（５６）・”ｌ”＋２＾（４８）・”ｃ”＋２＾（４０）・”ｕ”＋２＾（３２）・”ｌ”＋２＾（２４）・”ａ”＋２＾（１６）・”ｔ”＋２＾（８）・”ｏ”＋“ｒ”］_ｐのように、３台のサーバ装置１００Ａ_１、１００Ａ_２、１００Ａ_３に分散する。 The client device 200A uses, as search request data, a 10-character text s = (s_1, s_2, ..., s_10) = “calculator” in SSS [(“calculator”)] _ p = [2 ^ (72) · ”c "+ 2 ^ (64)," a "+ 2 ^ (56)," l "+ 2 ^ (48)," c "+ 2 ^ (40)," u "+ 2 ^ (32)," l "+ 2 ^ (24 ) · “A” + 2 ^ (16) · “t” + 2 ^ (8) · “o” + “r”] _ p, which are distributed to the three server devices 100A_1, 100A_2, 100A_3.

第ｎのサーバ装置１００Ａ_ｎ（１≦ｎ≦３）は、１０文字の検索要求データに対するＴＳＳＳの分散登録データ１０４Ａ_ｎを受けたのち、第ｎのデータ記憶部１０１Ａ_ｎに保存された、テキストの各文字に対するＳＳＳの分散情報［ｔ_１］ｑ，…，［ｔ_２００］ｑのうち、自らが保持する情報ｔ_｛１，ｊ｝，ｔ_｛２，ｊ｝，．．．，ｔ_｛ｎ，ｊ｝を、第ｎのデータ変換部１０２Ａ_ｎに入力する。
第ｎのデータ変換部１０２Ａ_ｎは、
ｔ’_｛ｉ，ｊ｝＝ｔ_｛ｉ，ｊ｝ || ｔ_｛ｉ＋１，ｊ｝ || … || ｔ_｛ｉ＋ｍ−１，ｊ｝
を０≦ｉ≦ｎ−ｍについて計算し、［ｔ’_｛０｝］_ｐ，．．．，［ｔ’_｛ｎ−ｍ｝］_ｐを第ｎの検索用データ１０５Ａ_ｎとして、第ｎのデータ検索部１０３Ａ_ｎに送信する。The nth server device 100A_n (1 ≦ n ≦ 3) receives the TSSS distributed registration data 104A_n for the search request data of 10 characters, and then, for each character of the text stored in the nth data storage unit 101A_n. Of the distributed information [t_1] q, ..., [t_200] q of SSS, information t_ {1, j}, t_ {2, j} ,. ．． , T_ {n, j} are input to the n-th data conversion unit 102A_n.
The nth data conversion unit 102A_n
t '_ {i, j} = t_ {i, j} || t_ {i + 1, j} || ... || t_ {i + m-1, j}
For 0 ≦ i ≦ n−m, and [t ′ _ {0}] _ p ,. ．． , [T ′ _ {n−m}] _ p is transmitted to the nth data search unit 103A_n as the nth search data 105A_n.

その後は、上記第１の実施形態の具体例と同様に、第ｎのデータ検索部１０３Ａ_ｎは、入力された第ｎの分散検索要求データ［（ｃａｌｃｕｌａｔｏｒ）］_ｐと第ｎの検索用データ［ｔ’_１］_ｐ，…，［ｔ’_１９１］_ｐから、
［ｓ’_１］_ｐ＝［（ｃａｌｃｕｌａｔｏｒ）］_ｐ−［ｔ’_１］_ｐ，
…
［ｓ’_１９１］_ｐ＝［（ｃａｌｃｕｌａｔｏｒ）］_ｐ−［ｔ’_１９１］_ｐ，
の計算で、検索要求データと部分文字列とのマッチングを行い、第１の実施形態の具体例と同様の方法で検索結果を出力する。After that, similarly to the specific example of the first embodiment, the nth data search unit 103A_n inputs the nth distributed search request data [(calculator)] _ p and the nth search data [t. From "_1" _p, ..., [t'_191] _p,
[S'_1] _p = [(calculator)] _ p- [t'_1] _p,
…
[S'_191] _p = [(calculator)] _ p- [t'_191] _p,
In this calculation, the search request data and the partial character string are matched, and the search result is output by the same method as in the specific example of the first embodiment.

第２の実施形態の具体例において、サーバ装置に保存されるテキストの分散情報のサイズは、１テキストあたりｎ・ｌｏｇｑ＝２００×８＝１６００ビットである。関連技術のように、部分文字列をすべて８０ビットのＴＳＳＳの分散情報で保持していた場合、分散情報のサイズは、（１／２）ｎ（ｎ＋１）・ｌｏｇｐ＝（１／２）×２００×２０１×８０＝１６０８０００ｂｉｔｓ（≒２００ＫＢ）となる。このように、第２の実施形態の具体例では、関連技術と比べて分散情報のサイズを１／１０００にすることができる。 In the specific example of the second embodiment, the size of the distributed information of the text stored in the server device is n · logq = 200 × 8 = 1600 bits per text. As in the related art, when the partial character strings are all held in the TSSS distributed information of 80 bits, the size of the distributed information is (1/2) n (n + 1) .logp = (1/2) × 200. × 201 × 80 = 1608000 bits (≈200 KB). As described above, in the specific example of the second embodiment, the size of the shared information can be reduced to 1/1000 as compared with the related art.

なお、本発明は、上記実施形態に限定されるものではなく、その要旨を逸脱しない範囲で構成要素を変形することができる。また、複数の構成要素の適宜な組合せにより種々の発明を形成できる。例えば、上記実施形態の具体例では、サーバ装置が３台ある場合で説明したが、サーバ装置がＮ台（Ｎは２以上の整数）ある場合にも同様に適用できる。 The present invention is not limited to the above embodiment, and the constituent elements can be modified without departing from the scope of the invention. Further, various inventions can be formed by appropriately combining a plurality of constituent elements. For example, in the specific example of the above-described embodiment, the case where there are three server devices has been described, but the same can be applied to the case where there are N server devices (N is an integer of 2 or more).

第１、第２の実施形態によれば、秘密分散法を用いた秘匿検索システムにおいて、その検索機能を損なうことなく、各サーバ装置が保持するデータ（分散情報）のサイズを低減することが可能となる。 According to the first and second embodiments, in the secret search system using the secret sharing method, it is possible to reduce the size of data (distributed information) held by each server device without impairing its search function. Becomes

第１、第２の実施形態は、秘匿検索を組み合わせたより高度な秘匿データ分析にも貢献することができる。例えば、実施形態とその他のデータを秘匿したまま関数の計算を行う技術を組み合わせることにより、ある部分データを含むデータに対して何らかの処理を行う秘匿データ分析システムを実現することができる。 The first and second embodiments can also contribute to more sophisticated secret data analysis that combines secret search. For example, by combining the embodiment with other techniques for calculating a function while keeping data secret, it is possible to realize a secret data analysis system that performs some processing on data including certain partial data.

また、サーバ装置にデータを委託するクラウドサービスにおいて、サービスの価格は通常データサイズと通信料によって決定される。本実施形態を適用した秘匿検索システムによって、サーバ装置の保持するデータサイズを削減することができ、秘匿検索システムを利用したサービスの低価格化と利用促進の実現を可能にする。 Further, in a cloud service in which data is outsourced to a server device, the price of the service is usually determined by the data size and the communication charge. With the confidential search system to which the present embodiment is applied, the data size held by the server device can be reduced, and it is possible to reduce the price of services and promote the use of the services using the confidential search system.

なお、第１、第２の実施形態に記載した方法は、コンピュータに実行させることができる。この方法を実行させるプログラムは、フロッピー（登録商標）ディスク、ハードディスクなどの磁気ディスク、ＣＤ−ＲＯＭ（ＣｏｍｐａｃｔＤｉｓｃ−ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＤＶＤ（ｄｉｇｉｔａｌｖｅｒｓａｔｉｌｅｄｉｓｃ）などの光ディスク、光磁気ディスク（ＭＯ）、半導体メモリなどの記録媒体に格納して頒布することもできる。 The methods described in the first and second embodiments can be executed by a computer. A program for executing this method is a floppy (registered trademark) disk, a magnetic disk such as a hard disk, an optical disk such as a CD-ROM (Compact Disc-Read Only Memory), a DVD (digital versatile disc), a magneto-optical disk (MO), and the like. It can be stored in a recording medium such as a semiconductor memory and distributed.

また、この記録媒体としては、プログラムを記憶でき、かつコンピュータが読み取り可能な記録媒体であれば、その記憶形式は何れの形態であってもよい。 Further, as this recording medium, as long as it is a recording medium that can store a program and is readable by a computer, the storage format may be any form.

また、記録媒体からコンピュータにインストールされたプログラムの指示に基づきコンピュータ上で稼働しているオペレーティングシステムや、データベース管理ソフト、ネットワークソフト等のミドルウェアなどが各処理の一部を実行してもよい。 Further, an operating system running on the computer, middleware such as database management software, network software, or the like may execute a part of each processing based on an instruction of a program installed in the computer from the recording medium.

さらに、記録媒体は、コンピュータと独立した媒体に限らず、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）やインターネットなどにより伝送されたプログラムをダウンロードして記憶または一時記憶した記録媒体も含まれる。 Further, the recording medium is not limited to a medium independent of a computer, and includes a recording medium in which a program transmitted via a LAN (Local Area Network), the Internet, etc. is downloaded and stored or temporarily stored.

また、記録媒体は１つに限らず、複数の記録媒体から上記実施形態における処理が実行される場合も含まれ、媒体構成は何れの構成であってもよい。 Further, the number of recording media is not limited to one, and a case where the processing in the above-described embodiment is executed from a plurality of recording media is also included, and the medium configuration may be any configuration.

コンピュータは、記録媒体に記憶されたプログラムに基づき各処理を実行するものであって、パソコンなどからなる装置、複数の装置がネットワーク接続されたシステムなどの何れの構成であってもよい。 The computer executes each process based on the program stored in the recording medium, and may have any configuration such as a device including a personal computer and a system in which a plurality of devices are network-connected.

また、コンピュータとは、パソコンに限らず、情報処理機器に含まれる演算処理装置を含み、プログラムによって本実施形態の機能を実現することが可能な機器、装置である。 Further, the computer is not limited to a personal computer, but includes an arithmetic processing unit included in an information processing device, and is a device or device capable of realizing the functions of the present embodiment by a program.

上記の実施形態の一部又は全部は、以下の付記のようにも記載されうるが、以下には限られない。 The whole or part of the exemplary embodiments disclosed above can be described as, but not limited to, the following supplementary notes.

（付記１）秘密情報の１シンボルごとの分散登録データを保存するデータ記憶部と、
前記データ記憶部に保存された前記分散登録データを、複数シンボルを連結したデータに対する検索用データに変換するデータ変換部と、
前記検索用データと分散検索要求データとを用いて、他のサーバ装置のデータ検索部と通信を行いながら、前記データ記憶部の前記分散登録データに対する検索を行って、分散検索結果を出力するデータ検索部と、
を有するサーバ装置。(Supplementary Note 1) A data storage unit that stores distributed registration data for each symbol of confidential information,
A data conversion unit that converts the distributed registration data stored in the data storage unit into search data for data in which a plurality of symbols are linked,
Data for performing a search on the distributed registration data in the data storage unit while communicating with a data search unit of another server device using the search data and the distributed search request data, and outputting a distributed search result. Search section,
A server device having.

（付記２）前記データ記憶部は、前記分散登録データを、１シンボルごとの複製型秘密分散法の分散情報として保持し、
前記データ変換部は、前記データ記憶部に格納された１シンボルごとの複製型秘密分散法の分散情報を、複数シンボルに対する１つのＳｈａｍｉｒのしきい値秘密分散法の分散情報に生成する処理を実行して前記検索用データを得る、
付記１に記載のサーバ装置。(Supplementary Note 2) The data storage unit holds the distributed registration data as shared information of a duplicate secret sharing method for each symbol,
The data conversion unit executes a process of generating, for each symbol, shared information of the duplicate secret sharing method stored in the data storage unit into shared information of one Shamir threshold secret sharing method for a plurality of symbols. And obtain the search data,
The server device according to attachment 1.

（付記３）前記データ記憶部は、前記分散登録データとして、Ｓｈａｍｉｒのしきい値秘密分散法によって生成された秘密情報の分散データを保持し、
前記データ変換部は、前記データ記憶部に保存された秘密情報の分散データを連結する処理を実行して前記検索用データを得、
前記データ検索部は、前記データ変換部によって連結されたデータを用いて、拡大体上の演算で検索を行う処理を実行して前記分散検索結果を得る、
付記１に記載のサーバ装置。(Supplementary Note 3) The data storage unit holds, as the distributed registration data, distributed data of secret information generated by Shamir's threshold secret sharing method,
The data conversion unit executes a process of connecting the distributed data of the secret information stored in the data storage unit to obtain the search data,
The data search unit uses the data connected by the data conversion unit to execute a process of performing a search on an extension field to obtain the distributed search result,
The server device according to attachment 1.

（付記４）各々が付記１乃至３のいずれか１つに記載のサーバ装置からなる第１乃至第Ｎ（Ｎは２以上の整数）のサーバ装置と、該第１乃至第Ｎのサーバ装置にネットワークを介して接続されたクライアント装置と、から成る秘匿検索システムであって、前記クライアント装置は、
登録データから、前記第１乃至第Ｎのサーバ装置にそれぞれ登録されるべき第１乃至第Ｎの分散登録データを生成する登録データシェア生成部と、
検索要求データから、前記第１乃至第Ｎのサーバ装置へそれぞれ送信するための第１乃至第Ｎの分散検索要求データを生成するクエリデータシェア生成部と、
前記第１乃至第Ｎのサーバ装置からそれぞれ受信した第１乃至第Ｎの分散検索結果から、検索結果の分散情報を復号する秘密分散復号部と、
を有する秘匿検索システム。(Supplementary Note 4) First to Nth (N is an integer of 2 or more) server devices, each of which is the server device according to any one of Supplementary Notes 1 to 3, and the first to Nth server devices. A confidential search system comprising a client device connected via a network, wherein the client device comprises:
A registration data share generation unit that generates first to Nth distributed registration data to be registered in the first to Nth server devices, respectively, from the registration data;
A query data share generation unit that generates first to Nth distributed search request data to be transmitted from the search request data to the first to Nth server devices, respectively.
A secret sharing decryption unit that decrypts the shared information of the search results from the first to Nth shared search results received from the first to Nth server devices, respectively.
Concealment search system with.

（付記５）クライアント装置と、該クライアント装置にネットワークを介して接続された第１乃至第Ｎ（Ｎは２以上の整数）のサーバ装置と、を備える検索システムにおける秘匿検索方法であって、預けるデータを秘密分散法で秘密分散して前記第１乃至第Ｎのサーバ装置に保存し、前記第１乃至第Ｎのサーバ装置に保存されたデータを秘匿したまま検索できる秘匿検索方法であって、
前記クライアント装置が、登録データに対して前記秘密分散法の分散データ生成手順を実行して第１乃至第Ｎの分散登録データを生成し、該第１乃至第Ｎの分散登録データをそれぞれ前記第１乃至第Ｎのサーバ装置へ送信する登録データ生成ステップと、
前記第ｎ（１≦ｎ≦Ｎ）のサーバ装置が、第ｎの分散登録データを第ｎのデータ記憶部に保存する登録データ保存ステップと、
前記クライアント装置が、検索要求データに対して前記秘密分散法の分散データ生成手順を実行して第１乃至第Ｎの分散検索データを生成し、該第１乃至第Ｎの分散検索データをそれぞれ前記第１乃至第Ｎのサーバ装置へ送信する検索要求データ生成ステップと、
前記第ｎのサーバ装置が、前記第ｎのデータ記憶部に保存された前記第ｎの分散登録データを、複数シンボルを連結したデータに対する第ｎの検索用データに変換する検索データ変換ステップと、
前記第ｎのサーバ装置が、前記第ｎの検索用データと前記第ｎの分散検索データとを用いて、他のサーバ装置と通信を行いながら、前記第ｎのデータ記憶部の前記第ｎの分散登録データに対する検索を行い、第ｎの分散検索結果を前記クライアント装置へ送信する検索ステップと、
前記クライアント装置が、前記第１乃至第Ｎのサーバ装置から受信した第１乃至第Ｎの分散検索結果に対して、前記秘密分散法の復号手順を実行して、検索結果を復元する復元ステップと、
を含む秘匿検索方法。(Supplementary Note 5) A secret search method in a search system, comprising: a client device; and first to Nth (N is an integer of 2 or more) server devices connected to the client device via a network. A secret search method capable of secretly sharing data according to a secret sharing method, storing the data in the first to Nth server devices, and searching the data stored in the first to Nth server devices while keeping the data secret
The client device performs a distributed data generation procedure of the secret sharing method on registration data to generate first to Nth distributed registration data, and the first to Nth distributed registration data are respectively generated in the first to Nth distributed registration data. A registration data generation step of transmitting to the first to Nth server devices;
A registration data storage step in which the nth (1 ≦ n ≦ N) server device stores the nth distributed registration data in the nth data storage unit;
The client device executes the distributed data generation procedure of the secret sharing method on the search request data to generate the first to Nth distributed search data, and the first to Nth distributed search data are respectively generated as described above. A search request data generation step of transmitting to the first to Nth server devices;
A search data conversion step in which the nth server device converts the nth distributed registration data stored in the nth data storage unit into nth search data for data obtained by connecting a plurality of symbols;
The n-th server device uses the n-th search data and the n-th distributed search data to communicate with another server device while the n-th data storage unit stores the n-th data. A search step of searching the distributed registration data and transmitting an n-th distributed search result to the client device;
A restoration step in which the client device executes a decryption procedure of the secret sharing method on the first to Nth distributed search results received from the first to Nth server devices and restores the search results; ,
Concealment search method including.

（付記６）前記登録データ生成ステップでは、前記クライアント装置が、前記登録データを複製型秘密分散法によって前記第１乃至第Ｎの分散検索データに変換し、
前記検索要求データ生成ステップでは、前記クライアント装置が、前記検索要求データをＳｈａｍｉｒのしきい値秘密分散法によって前記第１乃至第Ｎの分散検索データに変換し、
前記検索データ変換ステップでは、前記第ｎのサーバ装置が、前記第ｎの分散登録データを前記Ｓｈａｍｉｒのしきい値秘密分散法の分散データである前記第ｎの検索用データに変換する、
付記５に記載の秘匿検索方法。(Supplementary Note 6) In the registration data generation step, the client device converts the registration data into the first to Nth distributed search data by a duplicate secret sharing method,
In the search request data generation step, the client device converts the search request data into the first to N-th distributed search data by Shamir's threshold secret sharing method,
In the search data conversion step, the nth server device converts the nth distributed registration data into the nth search data that is distributed data of the Shamir threshold secret sharing method.
The secret search method according to attachment 5.

（付記７）前記登録データ生成ステップでは、前記クライアント装置が、前記登録データをＳｈａｍｉｒのしきい値秘密分散法によって前記第１乃至第Ｎの分散検索データに変換し、前記検索要求データ生成ステップでは、前記クライアント装置が、前記検索要求データを前記Ｓｈａｍｉｒのしきい値秘密分散法によって前記第１乃至第Ｎの分散検索データに変換し、
前記検索データ変換ステップでは、前記第ｎのサーバ装置が、前記第ｎの分散登録データを連結して一つのデータとして前記第ｎの検索用データに変換する、
付記５に記載の秘匿検索方法。(Supplementary Note 7) In the registration data generating step, the client device converts the registration data into the first to Nth distributed search data by Shamir's threshold secret sharing method, and in the search request data generating step, , The client device converts the search request data into the first to Nth distributed search data by the Shamir threshold secret sharing method,
In the search data conversion step, the nth server device concatenates the nth distributed registration data and converts it as one data into the nth search data.
The secret search method according to attachment 5.

（付記８）サーバ装置で検索を実行する検索方法であって、
秘密情報の１シンボルごとの分散登録データをデータ記憶部に保存する保存ステップと、
データ変換部が、前記データ記憶部に保存された前記分散登録データを、複数シンボルを連結したデータに対する検索用データに変換する変換ステップと、
データ検索部が、前記検索用データと分散検索要求データとを用いて、他のサーバ装置のデータ検索部と通信を行いながら、前記データ記憶部の前記分散登録データに対する検索を行って、分散検索結果を出力する検索ステップと、
を含む検索方法。(Supplementary Note 8) A search method for executing a search on a server device,
A saving step of saving the distributed registration data for each symbol of the confidential information in the data storage unit;
A conversion step in which the data conversion unit converts the distributed registration data stored in the data storage unit into search data for data in which a plurality of symbols are connected,
A data search unit uses the search data and the distributed search request data to perform a search for the distributed registration data in the data storage unit while communicating with the data search unit of another server device to perform a distributed search. A search step that outputs results,
Search method including.

（付記９）前記保存ステップでは、前記分散登録データを、１シンボルごとの複製型秘密分散法の分散情報として前記データ記憶部に保持し、
前記変換ステップでは、前記データ変換部が、前記データ記憶部に格納された１シンボルごとの複製型秘密分散法の分散情報を、複数シンボルに対する１つのＳｈａｍｉｒのしきい値秘密分散法の分散情報に生成する処理を実行して前記検索用データを得る、
付記８に記載の検索方法。(Supplementary Note 9) In the storage step, the distributed registration data is held in the data storage unit as shared information of a duplicate secret sharing method for each symbol,
In the conversion step, the data conversion unit converts the shared information of the duplicate secret sharing method for each symbol stored in the data storage unit into one Shamir's threshold secret sharing sharing information for a plurality of symbols. Performing a process of generating to obtain the search data,
The search method described in appendix 8.

（付記１０）前記保存ステップでは、前記分散登録データとして、Ｓｈａｍｉｒのしきい値秘密分散法によって生成された秘密情報の分散データを前記データ記憶部に保持し、
前記変換ステップでは、前記データ変換部が、前記データ記憶部に保存された秘密情報の分散データを連結する処理を実行して前記検索用データを得、
前記検索ステップでは、前記データ検索部が、前記変換手順によって連結されたデータを用いて、拡大体上の演算で検索を行う処理を実行して前記分散検索結果を得る、
付記８に記載の検索方法。(Supplementary Note 10) In the storing step, as the distributed registration data, shared data of secret information generated by a Shamir threshold secret sharing method is held in the data storage unit,
In the converting step, the data converting unit obtains the search data by executing a process of connecting the distributed data of the secret information stored in the data storage unit,
In the search step, the data search unit uses the data linked by the conversion procedure to perform a process of performing a search on an extension field to obtain the distributed search result.
The search method described in appendix 8.

（付記１１）コンピュータであるサーバ装置に検索を実行させる検索プログラムであって、前記コンピュータに、
秘密情報の１シンボルごとの分散登録データをデータ記憶部に保存する保存手順と、
前記データ記憶部に保存された前記分散登録データを、複数シンボルを連結したデータに対する検索用データに変換する変換手順と、
前記検索用データと分散検索要求データとを用いて、他のサーバ装置と通信を行いながら、前記データ記憶部の前記分散登録データに対する検索を行って分散検索結果を出力する検索手順と、
を実行させるための検索プログラム。(Supplementary Note 11) A search program for causing a server device, which is a computer, to execute a search, the computer including:
A saving procedure for saving the distributed registration data for each symbol of confidential information in the data storage unit,
A conversion procedure for converting the distributed registration data stored in the data storage unit into search data for data in which a plurality of symbols are connected,
A search procedure for performing a search on the distributed registration data in the data storage unit and outputting a distributed search result while communicating with another server device using the search data and the distributed search request data,
A search program for executing.

（付記１２）前記保存手順は、前記コンピュータに、前記分散登録データを、１シンボルごとの複製型秘密分散法の分散情報として前記データ記憶部に保持させ、
前記変換手順は、前記コンピュータに、前記データ記憶部に格納された１シンボルごとの複製型秘密分散法の分散情報を、複数シンボルに対する１つのＳｈａｍｉｒのしきい値秘密分散法の分散情報に生成する処理を実行させて前記検索用データを得させる、
付記１１に記載の検索プログラム。(Supplementary Note 12) The storage procedure causes the computer to hold the distributed registration data in the data storage unit as shared information of a duplicate secret sharing method for each symbol,
In the conversion procedure, the shared information of the duplicate secret sharing method for each symbol stored in the data storage unit is generated in the computer as the shared information of one Shamir threshold secret sharing method for a plurality of symbols. Execute a process to obtain the search data,
The search program according to attachment 11.

（付記１３）前記保存手順は、前記コンピュータに、前記分散登録データとして、Ｓｈａｍｉｒのしきい値秘密分散法によって生成された秘密情報の分散データを前記データ記憶部に保持させ、
前記変換手順は、前記コンピュータに、前記データ記憶部に保存された秘密情報の分散データを連結する処理を実行させて前記検索用データを得させ、
前記検索手順は、前記コンピュータに、前記変換手順によって連結されたデータを用いて、拡大体上の演算で検索を行う処理を実行させて前記分散検索結果を得させる、
付記１１に記載の検索プログラム。(Supplementary Note 13) The storage procedure causes the computer to hold, as the distributed registration data, shared data of secret information generated by Shamir's threshold secret sharing method in the data storage unit,
In the conversion procedure, the computer is caused to perform a process of connecting the distributed data of the secret information stored in the data storage unit to obtain the search data,
In the search procedure, the computer is caused to perform a process of performing a search by an operation on an extension field using the data linked by the conversion procedure to obtain the distributed search result.
The search program according to attachment 11.

本発明は、秘匿検索を組み合わせたより高度な秘匿データ分析にも貢献する。例えば、ある部分データを含むデータに対して何らかの処理を行う秘匿データ分析システムは、本発明とその他のデータを秘匿したまま関数の計算を行う技術を組み合わせることによって実現することが可能である。 The present invention also contributes to more sophisticated secret data analysis that combines secret search. For example, a secret data analysis system that performs some processing on data including certain partial data can be realized by combining the present invention and a technique for calculating a function while keeping other data secret.

また、サーバ装置にデータを委託するクラウドサービスなどにおいて、サービスの価格は通常データサイズと通信料によって決定される。したがって、秘匿検索システムにおけるサーバ装置の保持するデータサイズを削減することは、秘匿検索システムを利用したサービスの低価格化につながり、サービスの利用を促進するものと考えられる。 Further, in a cloud service that entrusts data to a server device, the price of the service is usually determined by the data size and the communication charge. Therefore, it is considered that reducing the data size held by the server device in the secret search system leads to lower prices of services using the secret search system and promotes the use of the service.

この出願は、２０１５年２月２３日に出願された日本出願特願２０１５−０３２５７３号を基礎とする優先権を主張し、その開示の全てをここに取り込む。 This application claims the priority on the basis of Japanese application Japanese Patent Application No. 2005-032573 for which it applied on February 23, 2015, and takes in those the indications of all here.

１００_１〜１００_Ｎ、１００_ｎサーバ装置
１００Ａ_１〜１００Ａ_Ｎ、１００Ａ_ｎサーバ装置
１０１_１〜１０１_Ｎ、１０１_ｎデータ記憶部
１０１Ａ_１〜１０１Ａ_Ｎ、１０１Ａ_ｎデータ記憶部
１０２_１〜１０２_Ｎ、１０２_ｎデータ変換部
１０２Ａ_１〜１０２Ａ_Ｎ、１０２Ａ_ｎデータ変換部
１０３_１〜１０３_Ｎ、１０３_ｎデータ検索部
１０３Ａ_１〜１０３Ａ_Ｎ、１０３Ａ_ｎデータ検索部
１０４_ｎ、１０４Ａ_ｎ分散登録データ
１０５_ｎ、１０５Ａ_ｎ検索用データ
１０６_ｎ、１０６Ａ_ｎ分散検索要求データ
１０７_ｎ、１０７Ａ_ｎ分散検索結果
２００、２００Ａクライアント装置
２０１、２０１Ａ登録データシェア生成部
２０２、２０２Ａクエリデータシェア生成部
２０３、２０３Ａ秘密分散復号部
２０４、２０４Ａ登録データ
２０５、２０５Ａ検索要求データ
２０６、２０６Ａ検索結果100_1 to 100_N, 100_n server device 100A_1 to 100A_N, 100A_n server device 101_1 to 101_N, 101_n data storage unit 101A_1 to 101A_N, 101A_n data storage unit 102_1 to 102_N, 102_n data conversion unit 102A_1 to 102A_N, 102A_n data conversion unit 103_1 to 103_1 to 103_1 103_n data search unit 103A_1 to 103A_N, 103A_n data search unit 104_n, 104A_n distributed registration data 105_n, 105A_n search data 106_n, 106A_n distributed search request data 107_n, 107A_n distributed search result 200, 200A client device 201, 201A registered data share generation unit 202, 202A query data share generation unit 203, 203A secret sharing decryption unit 204, 204A Registration data 205, 205A Search request data 206, 206A Search results

Claims

Data storage means for storing distributed registration data for each symbol of confidential information;
Data conversion means for converting the distributed registration data stored in the data storage means into search data in which a plurality of symbols are connected , during search processing ;
A data search unit that communicates with a data search unit of another server device using the search data and the distributed search request data, searches the distributed registration data in the data storage unit, and outputs a distributed search result. When,
A server device having.

The distributed registration data is held as shared information of the duplicate secret sharing method for each symbol,
The data conversion means converts the shared information of the replicated secret sharing method for each symbol stored in the data storage means into the shared information of one threshold secret sharing method for a plurality of symbols to convert the search data. Get
The server device according to claim 1.

It said data storage means, as the dispersion registration data, to maintain the dispersion information of the secret information converted by the threshold secret sharing scheme,
Said data conversion means, wherein acquires search data by connecting the variance information of the secret information stored in the data storage means,
The data search means obtains the distributed search result by performing a search on an extension field using the shared information connected by the data conversion means.
The server device according to claim 1.

First to N-th (N is an integer of 2 or more) server devices each comprising the server device according to any one of claims 1 to 3, and the first to N-th server devices via a network. A secret search system consisting of a client device connected by
The client device is
Registration data share generation means for converting the registration data into first to Nth distributed registration data to be registered in the first to Nth server devices, respectively.
Query data share generating means for converting the search request data into the first to Nth distributed search request data to be transmitted to the first to Nth server devices, respectively.
Secret shared decryption means for decrypting the shared information of the search results from the first to Nth distributed search results received from the first to Nth server devices, respectively.
Concealment search system with.

A confidential search method in a search system comprising a client device and first to Nth (N is an integer of 2 or more) server devices connected to the client device via a network,
Distributed registered data of the first through the N-th converted using the secret sharing scheme to the registered data by the client device, wherein the data storage means of the first n of the server apparatus of the first n (1 ≦ n ≦ N) Each symbol of registration data is saved as the nth distributed registration data,
The client device converts the search request data into first to Nth distributed search data using the secret sharing method, and the first to Nth distributed search data are respectively the first to Nth server devices. Send to
The nth server device converts the nth distributed registration data stored in the nth data storage means into nth search data in which a plurality of symbols are connected ,
The n-th server device uses the n-th search data and the n-th distributed search data to communicate with another server device while the n-th data storage means stores the n-th data. A search for the distributed registration data is performed, the n-th distributed search result is transmitted to the client device,
The client device executes a decryption procedure of the secret sharing method on the first to Nth distributed search results received from the first to Nth server devices to restore the search results.
Secret search method.

The client device converts the registration data into the first to Nth distributed search data by a duplicate secret sharing method,
The client device converts the search request data into the first to Nth distributed search data by Shamir's threshold secret sharing method;
The nth server device converts the nth distributed registration data into the nth search data which is distributed data of the Shamir threshold secret sharing method;
The confidential search method according to claim 5.

The client device converts the registration data into the first to Nth distributed search data by Shamir's threshold secret sharing method;
The client device converts the search request data into the first to Nth distributed search data by the Shamir threshold secret sharing method;
The n-th server device concatenates the n-th distributed registration data and converts the data into one piece of data into the n-th search data.
The confidential search method according to claim 5.

A search method for executing a search on a server device,
At the time of search processing , the distributed registration data for each symbol of confidential information is converted to search data in which multiple symbols are linked ,
Using the search data and the distributed search request data, communicating with another server device, performing a search for the distributed registration data, and outputting a distributed search result,
retrieval method.

On the computer,
At the time of search processing , the distributed registration data for each symbol of confidential information is converted to search data in which multiple symbols are linked ,
A search program that uses the search data and the distributed search request data to communicate with another server device, perform a search on the distributed registration data, and output a distributed search result.