JP2009271584A

JP2009271584A - Similar information retrieval system and similar information retrieval program

Info

Publication number: JP2009271584A
Application number: JP2008118871A
Authority: JP
Inventors: Shogo Shimizu; 將吾清水
Original assignee: Tokyo Metropolitan Public University Corp
Current assignee: Tokyo Metropolitan Public University Corp
Priority date: 2008-04-30
Filing date: 2008-04-30
Publication date: 2009-11-19

Abstract

<P>PROBLEM TO BE SOLVED: To efficiently retrieve similar information of retrieval information while being concealed from a manager and a third person. <P>SOLUTION: This similar information retrieval system is provided with: a secret registration information storage means CD2 for storing a polynomial point group (R); a retrieval numerical value discriminating means CD4 for discriminating whether or not retrieval numerics (b<SB>1</SB>to b<SB>m-L</SB>) included in secret retrieval information (B*) are the same values as numerical values (a<SB>1</SB>to a<SB>r</SB>) included in respective secret registration information (R<SB>1</SB>to R<SB>N</SB>); a polynomial point subset calculation means CD5 for calculating polynomial point subsets (Q<SB>1</SB>* to Q<SB>N</SB>*); a polynomial point subset element number discriminating means CD6 for discriminating whether or not the points of the polynomial point subsets (Q<SB>1</SB>* to Q<SB>N</SB>*) are equal to or more than (c-L) pieces; and a secret similar information calculation means CD7 for calculating secret similar information (R<SB>A</SB>) by calculating respective secret registration information (R<SB>1</SB>to R<SB>N</SB>) corresponding to the polynomial point subsets (Q<SB>1</SB>* to Q<SB>N</SB>*) having at least (c-L) pieces of points. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、データベースの外部委託業者としての管理者および第三者から秘匿された状態で前記データベースから検索情報の類似情報を検索する類似情報検索システムおよび類似情報検索プログラムに関する。 The present invention relates to a similar information search system and a similar information search program for searching for similar information of search information from the database in a state that it is concealed from an administrator as a database outsourcer and a third party.

従来より、データベースの管理業務について、前記データベースの構築等の技術的な問題や、前記データベースのメンテナンス費や管理費の削減等の経済的な問題から、外部委託をすることが行われている。前記データベースの外部委託のモデルについて、例えば、Database-as-a-Service（ＤＡＳ、ＤａａＳ）モデルが知られている。なお、本願明細書では、以降、「ＤＡＳモデル」と記載する。
前記ＤＡＳモデルは、前記データベースに情報（データ）を登録する（格納する）データ所有者としての「登録者」と、前記データベースから前記情報を検索する「検索者」と、前記データベースを管理する「管理者」とによって構成される。なお、前記登録者と前記検索者とは必ずしも同一人物である必要はなく、前記管理者は、いわゆる、外部委託業者であり、前記データベースの管理業務のみ委託されるものとする。 Conventionally, database management work has been outsourced due to technical problems such as construction of the database and economic problems such as reduction of maintenance and management costs of the database. As a database outsourcing model, for example, a Database-as-a-Service (DAS, DaaS) model is known. In the specification of the present application, it is hereinafter referred to as “DAS model”.
The DAS model registers (stores) information (data) in the database, a “registrant” as a data owner, a “searcher” who searches for the information from the database, and manages the database. "Administrator". Note that the registrant and the searcher are not necessarily the same person, and the manager is a so-called outsourcing contractor, and only the management work of the database is entrusted.

前記ＤＡＳモデルにおいて、例えば、前記情報が個人情報や機密情報等であった場合には、前記登録者および前記検索者以外の第三者、すなわち、前記登録者および前記検索者と前記データベースとの間の通信路上の第三者から、前記情報を秘匿できることが望ましい。また、この場合、外部委託業者である前記管理者からも、前記第三者と同様に、前記情報を秘匿できることが望ましい。
前記データベースに格納された前記情報を、前記第三者等から秘匿するための技術として、下記の非特許文献１〜４に記載の技術が知られている。 In the DAS model, for example, when the information is personal information or confidential information, a third party other than the registrant and the searcher, that is, the registrant and the searcher and the database. It is desirable that the information can be concealed from a third party on the communication path. In this case, it is desirable that the information can be concealed from the manager who is an outsourcer as well as the third party.
As techniques for concealing the information stored in the database from the third party or the like, techniques described in Non-Patent Documents 1 to 4 below are known.

非特許文献１〜３には、ＤＡＳモデルにおいて、暗号化されたデータである暗号化データが格納されたデータベース、いわゆる、暗号化データベースについての技術が記載されている。非特許文献１〜３には、従来公知のＳＱＬ（Structured Query Language）を用いて問い合わせ等の操作が行われる、いわゆる、関係データベース（ＲＤＢ：Relational Database、リレーショナルデータベース）について、前記ＳＱＬにより、前記暗号化データに関する問い合わせ処理を行う技術について記載されている。
なお、非特許文献１〜３において、前記暗号化データは、属性等の情報が付与された、いわゆる、関係データである。このため、前記ＳＱＬによる問い合わせ処理は、前記属性に基づいて、効率的に行うことができる。例えば、所定の範囲内の属性の値を有する前記暗号化データのみを抽出するフィルタリング処理等を行うことができる。 Non-Patent Documents 1 to 3 describe a technique about a database in which encrypted data that is encrypted data in the DAS model is stored, that is, a so-called encrypted database. In Non-Patent Documents 1 to 3, a so-called relational database (RDB: Relational Database), in which an operation such as a query is performed using a conventionally known SQL (Structured Query Language), the encryption is performed by the SQL. Describes a technique for performing inquiries regarding digitized data.
In Non-Patent Documents 1 to 3, the encrypted data is so-called relational data to which information such as attributes is added. For this reason, the inquiry process by the SQL can be efficiently performed based on the attribute. For example, a filtering process or the like that extracts only the encrypted data having an attribute value within a predetermined range can be performed.

また、非特許文献４には、ＤＡＳモデルとは直接関係ないが、複数のデータ提供者（provider、Bob）およびデータ検索者（querier、Alice）が存在する場合に、前記データ検索者の問い合わせた質問のデータ、いわゆる、クエリ（query）が、前記データ提供者が有するドキュメント（document）のデータベースに含まれるか否かを安全に問い合わせるために、ブルームフィルタ（Bloom filter）を用いた技術が記載されている。ここで、前記ブルームフィルタとは、前記データ提供者の鍵（Ｋ_Ｂ）で暗号化された前記ドキュメントおよび前記データ検索者の鍵（Ｋ_Ａ）で暗号化された前記クエリのことである。 In Non-Patent Document 4, although there is no direct relationship with the DAS model, when there are a plurality of data providers (provider, Bob) and data searchers (querier, Alice), the data searcher makes an inquiry. A technique using a Bloom filter is described for safely inquiring whether query data, so-called query, is included in the document database of the data provider. ing. Here, the Bloom filter is the document encrypted with the data provider key (K _B ) and the query encrypted with the data searcher key (K _A ).

非特許文献４では、まず、前記データ提供者が、前記データ提供者のブルームフィルタを公開する。次に、前記データ検索者が、前記データ検索者のブルームフィルタを、信頼できる第三者機関（trusted third party，Ted）に送信する。次に、前記第三者機関が、グループ暗号の原理に基づいて、前記データ検索者のブルームフィルタ、すなわち、前記データ検索者の鍵で暗号化された前記クエリを変換して、前記データ提供者の鍵で暗号化された前記クエリを出力する。そして、前記第三者機関が、前記データ提供者の鍵で暗号化された前記クエリと、公開された前記データ提供者のブルームフィルタとを照合することにより、前記クエリが、前記ドキュメントとして前記データベースに格納されているか否かを判別する。すなわち、非特許文献４には、前記クエリと、前記ドキュメントとを暗号化した状態で照合する技術が記載されている。 In Non-Patent Document 4, the data provider first discloses the data provider's Bloom filter. Next, the data searcher sends the data searcher's Bloom filter to a trusted third party (Ted). Next, the third party converts the query encrypted with the data searcher's Bloom filter, that is, the key of the data searcher, based on the principle of group encryption, and the data provider The query encrypted with the key is output. Then, the third party collates the query encrypted with the data provider's key with the published data provider's Bloom filter, so that the query becomes the database as the document. It is determined whether it is stored in the. That is, Non-Patent Document 4 describes a technique for collating the query and the document in an encrypted state.

ハカン・ハジグィムィシ（Hakan Hacigumus）、他２名、“暗号化されたデータの検索について（Search on Encrypted Data）”、“アドバンシーズ・イン・インフォメーション・セキュリティ（第３３巻）セキュア・データ・マネジメント・イン・ディセントラライズド・システムズ（Advances in Information Security / Secure Data Management in Decentralized Systems (Edited by T. Yu and S. Jajodia)）”、（米国）、シュプリンガー（Springer）、２００７年５月１１日、ｐ．３８３−４２６Hakan Hacigumus and two others, “Search on Encrypted Data”, “Advances in Information Security (Vol. 33) Secure Data Management in・ Decentralized Systems (Advances in Information Security / Secure Data Management in Decentralized Systems (Edited by T. Yu and S. Jajodia)) ”, (USA), Springer, May 11, 2007, p. . 383-426 ハカン・ハジグィムィシ（Hakan Hacigumus）、他３名、“データベースサービスプロバイダモデルの暗号化されたデータに対するＳＱＬの実施について（Executing SQL over Encrypted Data in the Database-Service-Provider Model）”、“エスアイジーエムオーディ・カンファレンス（SIGMOD Conference（Edited by M. J. Franklin, B. Moon and A. Ailamaki））”、（米国）、エーシーエム（ACM）、２００２年、ｐ．２１６−２２７Hakan Hacigumus and three others, “Executing SQL over Encrypted Data in the Database-Service-Provider Model”, “SMD MOD SIGMOD Conference (Edited by MJ Franklin, B. Moon and A. Ailamaki) ”, (USA), ACM, 2002, p. 216-227 三浦志保、渡辺知恵美、“管理者に対しても機密を保持できる暗号化データベースの索引構成法”、「online」、２００７年、電子情報通信学会第１８回データ工学ワークショップ／第５回日本データベース学会年次大会（DEWS2007）、「２００８年３月２６日検索」、インターネット＜ＵＲＬ：http://www.ieice.org/iss/de/DEWS/DEWS2007/pdf/e7-8.pdf＞Shiho Miura, Chiemi Watanabe, “Index Construction Method for Encrypted Databases That Can Keep Confidentiality for Administrators”, “online”, 2007, IEICE 18th Data Engineering Workshop / 5th Japan Database Annual Conference of the Society (DEWS2007), "Search on March 26, 2008", Internet <URL: http://www.ieice.org/iss/de/DEWS/DEWS2007/pdf/e7-8.pdf> スティーブン・エム・ベロビン（Steven M. Bellovin）、他１名、“暗号化されたブルームフィルタを用いて秘匿性が高められた検索について（Privacy-Enhanced Searches Using Encrypted Bloom Filters）”、（米国）、エーティアンドティ（AT&T）、２００４年３月２９日Steven M. Bellovin and one other, “Privacy-Enhanced Searches Using Encrypted Bloom Filters” (US), AT & T, March 29, 2004 エスコ・ウッコネン（Esko Ukkonen）、“ｑ−ｇｒａｍおよび最大の共通文字列集合に関する類似文字列照合について（Approximate string matching with q-grams and maximal matches）”、セオレシカル・コンピュータ・サイエンス・９２（Theoretical Computer Science 92）、（米国）、エルシビアー（Elsevier）、１９９２年、ｐ．１９１−２１１Esko Ukkonen, “Approximate string matching with q-grams and maximal matches”, Theoretical Computer Science 92 92), (USA), Elsevier, 1992, p. 191-211 今井秀樹著、「符号理論」、社団法人電子情報通信学会、１９９０年、ｐ．４７−５４，１５５−１６０，１６９−１７４，１７９−１８０Hideki Imai, “Code Theory”, The Institute of Electronics, Information and Communication Engineers, 1990, p. 47-54, 155-160, 169-174, 179-180 アリ・ジュエルズ（Ari Juels）、他１名、“ファジーボールトについて（A Fuzzy Vault Scheme）”、「online」、２００６年、デザインズ・コーズ・アンド・クリプトグラフィ（Designs, Codes and Cryptography）、「２００８年４月１日検索」、インターネット＜ＵＲＬ：http://www.rsa.com/rsalabs/staff/bios/ajuels/publications/fuzzy-vault/fuzzy_vault.pdf＞Ari Juels, 1 other, “A Fuzzy Vault Scheme”, “online”, 2006, Designs, Codes and Cryptography, “2008 Search April 1, 2009, Internet <URL: http://www.rsa.com/rsalabs/staff/bios/ajuels/publications/fuzzy-vault/fuzzy_vault.pdf> 大貫泰紀、高橋佑介、「指紋がキーとなる金庫 "Indexed Fuzzy Vault" の開発」、「online」、２００８年２月１４日、東海大学、「２００８年３月７日検索」、インターネット＜ＵＲＬ：http://www.cs.dm.u-tokai.ac.jp/DM2007/A1-Kikuchi2.doc＞Onuki Yasunori and Takahashi Keisuke, “Development of Indexed Fuzzy Vault”, “online”, February 14, 2008, Tokai University, “March 7, 2008 search”, Internet <URL: http://www.cs.dm.u-tokai.ac.jp/DM2007/A1-Kikuchi2.doc> 權娟大、他２名、“ＧＧＤＢ：糖鎖遺伝子データベース検索システム（GGDB: A database system for glycogenes）”、“第２回糖鎖科学コンソーシアムシンポジウム（The Second Symposium of Japanese Consortium for Glycobiology and Glycotechnology）”、日本糖鎖科学コンソーシアム、２００４年、ｐ．４２−４３Tsujidai and two others, “GGDB: A database system for glycogenes”, “The Second Symposium of Japanese Consortium for Glycobiology and Glycotechnology” , Japan Glycoscience Consortium, 2004, p. 42-43 ルイ・ヤン（Rui Yang）、他２名、“木構造データの類似評価について（Similarity Evaluation on Tree-structured Data）”、“エスアイジーエムオーディー・カンファレンス（SIGMOD Conference）”、（米国）、エーシーエム（ACM）、２００５年、ｐ．７５４−７６５Rui Yang and two others, “Similarity Evaluation on Tree-structured Data”, “SIGMOD Conference”, (USA), ACM ( ACM), 2005, p. 754-765 カリン・カイリング（Karin Kailing）、他３名、“大規模データベースにおける階層データの効果的な類似検索について（Efficient Similarity Search for Hierarchical Data in Large Databases）”、（ギリシャ共和国）、エクステンディング・データベース・テクノロジー（Extending Database Technology）、２００４年、ｐ．６７６−６９３Karin Kailing and three others, “Efficient Similarity Search for Hierarchical Data in Large Databases”, (Greece), Extending Database Technology (Extending Database Technology), 2004, p. 676-693 アポストロス・エヌ・パラドポウロス（Apostolos N. Papadopoulos）、他１名、“グラフのヒストグラムによる構造化された類似検索について（Structure-Based Similarity Search with Graph Histograms）”、（ギリシャ共和国）、“ディーイーエックスエー・ワークショップ（DEXA Workshop）”、１９９９年、ｐ．１７４−１７８Apostolos N. Papadopoulos, 1 other, “Structure-Based Similarity Search with Graph Histograms”, (Greece), “DLX "DEXA Workshop", 1999, p. 174-178

（従来技術の問題点）
前記ＤＡＳモデルにおいて、例えば、前記情報がＤＮＡ（Deoxyribonucleic acid、デオキシリボ核酸）の塩基配列情報（核酸配列情報）や、蛋白質のアミノ酸配列情報等の、いわゆる、遺伝子情報であった場合、前記データベースは、前記遺伝子情報が格納された遺伝子情報データベースとなる。なお、前記遺伝子情報データベースは、いわゆる、遺伝子解析の分野における研究機関等において利用されている。
ここで、前記遺伝子情報が、疾患等に関わる可能性がある遺伝子、いわゆる、候補遺伝子の情報を含む場合には、経済的・商業的・実用的な用途や価値を有する可能性があり、機密情報として取り扱われることがある。この場合、前記第三者および前記管理者から、前記遺伝子情報を秘匿できることが望ましい。 (Problems of conventional technology)
In the DAS model, for example, when the information is so-called gene information such as DNA (Deoxyribonucleic acid, deoxyribonucleic acid) base sequence information (nucleic acid sequence information) and protein amino acid sequence information, the database is: The gene information database stores the gene information. The gene information database is used in so-called research institutions in the field of gene analysis.
Here, if the gene information includes information on genes that may be related to diseases, so-called candidate genes, there is a possibility of having economic, commercial, practical use and value, and confidentiality. May be treated as information. In this case, it is desirable that the genetic information can be concealed from the third party and the administrator.

図９はＤＡＳモデルにおける遺伝子情報データベースに従来公知の暗号化データベースの技術を適用した場合の説明図である。
よって、前記遺伝子情報を秘匿するために、図９に示すように、前記非特許文献１〜３に記載された暗号化データベースの技術を適用することが考えられる。
しかしながら、前記非特許文献１〜３の技術では、前記暗号化データを暗号化する際に従来公知の暗号アルゴリズムを使用する場合には、前記登録者および前記検索者が、暗号化された前記関係データや前記問い合わせを生成するための鍵が必要になる。 FIG. 9 is an explanatory diagram when a conventionally known encryption database technique is applied to the gene information database in the DAS model.
Therefore, in order to conceal the genetic information, it is conceivable to apply the encryption database technology described in Non-Patent Documents 1 to 3, as shown in FIG.
However, in the techniques of Non-Patent Documents 1 to 3, when a conventionally known encryption algorithm is used when encrypting the encrypted data, the registrant and the searcher are connected with the encrypted relationship. A key is required to generate data and the query.

このため、前記登録者および前記検索者の人数に応じて前記鍵の管理等の問題があった。すなわち、前記登録者および前記検索者の人数に比例して前記鍵の数が多くなるため、鍵管理の安全性（厳格性）やそれに伴う管理コスト等の問題があった。
また、前記非特許文献４の技術については、前記複数のデータ提供者およびデータ検索者と、信頼できる第三者機関とが存在する場合に適用される。すなわち、前記非特許文献４の技術は、前記第三者機関を介した前記各データ提供者と前記各データ検索者との直接通信、いわゆる、Ｐ２Ｐ（Peer to Peer，peer-to-peer）型の通信に用いられることが想定されている。このため、前記各データ提供者どうしで共有する前記遺伝子情報データベースを構築すること自体が想定されておらず、前記ＤＡＳモデルを適用できないという問題がある。また、仮に、前記各データ提供者のデータベースを前記遺伝子情報データベースと想定した場合でも、前記非特許文献１〜３と同様に、前記各データ提供者、前記各データ検索者、前記第三者機関が有する鍵について、前記鍵管理の問題があった。 For this reason, there existed problems, such as management of the said key, according to the number of the said registrants and the said searchers. That is, since the number of keys increases in proportion to the number of registrants and searchers, there are problems such as the security (strictness) of key management and the associated management costs.
The technique of Non-Patent Document 4 is applied when there are a plurality of data providers and data searchers and a reliable third party organization. That is, the technique of Non-Patent Document 4 is a direct communication between each data provider and each data searcher via the third party organization, so-called P2P (Peer to Peer, peer-to-peer) type. It is assumed that it will be used for communication. For this reason, it is not envisaged to construct the gene information database shared by the data providers, and the DAS model cannot be applied. Further, even if the database of each data provider is assumed to be the gene information database, each of the data providers, each data searcher, and the third party organization, as in the non-patent documents 1 to 3. There is a problem of the key management with respect to the keys possessed by.

また、前記遺伝子解析では、前記検索者が知得した遺伝子情報が業界内で既知であるか否かを判別するために、前記遺伝子情報に基づく検索情報（問い合わせ情報）から、前記遺伝子情報データベースに格納された遺伝子情報のうち、前記検索情報に類似する前記遺伝子情報である類似情報を検索する類似情報検索処理が頻繁に行われる。例えば、図９に示すように、前記遺伝子情報を塩基配列の文字列情報とした場合に、前記検索情報（「ＧＧＣＣＡＧＧＧＣＡＣＣ」）に対して、前記遺伝子情報データベースに格納された前記類似情報（「ＧＡＣＣＧＧＧＧＴＧＣＡ」）が出力される前記類似情報検索処理が頻繁に行われる。 Further, in the gene analysis, in order to determine whether the gene information obtained by the searcher is known in the industry, from the search information (inquiry information) based on the gene information to the gene information database. Of the stored gene information, a similar information search process for searching for similar information that is the gene information similar to the search information is frequently performed. For example, as shown in FIG. 9, when the gene information is character string information of a base sequence, the similarity information (“GACCGGGGTGCA”) stored in the gene information database is compared with the search information (“GGCCAGGGCACC”). The similar information search process in which “)” is output is frequently performed.

しかしながら、前記非特許文献１〜３では、前記関係データベースに格納された暗号化データは、前記ＳＱＬによる問い合わせ処理に応じた前記属性等の情報が付加された関係データである。このため、前記類似情報検索処理のような類似文字列検索等が想定されておらず、暗号化された前記関係データのままでは効率良く前記類似情報を出力できないという問題があった。例えば、前記暗号化データの完全一致や属性一致による検索等しか行うことができないという問題があった。
また、前記非特許文献４の技術についても、前記類似情報検索処理のような類似文字列検索等が想定されておらず、前記類似情報であるか否かを判別するために、暗号化された前記ブルームフィルタを復号化してから照合する必要があるため、効率的に前記類似情報を出力できないという問題があった。 However, in Non-Patent Documents 1 to 3, the encrypted data stored in the relational database is relational data to which information such as the attribute corresponding to the inquiry processing by the SQL is added. For this reason, a similar character string search or the like as in the similar information search process is not assumed, and there is a problem in that the similar information cannot be output efficiently with the encrypted relational data. For example, there has been a problem that only search based on complete match or attribute match of the encrypted data can be performed.
Further, the technique of Non-Patent Document 4 is also assumed to be similar character search such as the similar information search process, and is encrypted to determine whether the information is the similar information. Since it is necessary to collate after decoding the Bloom filter, there is a problem that the similar information cannot be output efficiently.

本発明は、前述の事情に鑑み、管理者および第三者から秘匿された状態で検索情報の類似情報を効率良く検索することを技術的課題とする。 In view of the above-described circumstances, an object of the present invention is to efficiently search for similar information of search information in a state of being hidden from an administrator and a third party.

前記技術的課題を解決するために、請求項１記載の発明の類似情報検索システムは、
登録対象の情報である登録情報を記憶する記憶装置と、
前記記憶装置と情報の送受信が可能に接続され、前記記憶装置に対して、前記登録情報を登録させる登録装置と、
前記記憶装置と情報の送受信が可能に接続され、前記記憶装置に対して、記憶された前記登録情報のうち、検索対象の情報である検索情報と同一または類似する前記登録情報である類似情報を検索させる検索装置と、
を有する類似情報検索システムであって、
自然数をそれぞれｃ，ｄ，ｋ，Ｌ，ｎ，ｍ，ｑ，ｒとし、前記検索情報のうちの操作対象となる単位の情報である操作単位情報について、ｄ回の挿入・削除・置換の操作を行うことにより、前記検索情報が前記類似情報に変換され、且つ、前記登録情報について、ｑ個の前記操作単位情報を有する部分情報をｎ個以上演算可能であり、且つ、前記検索情報について、ｑ個の前記操作単位情報を有する部分情報をｍ個以上演算可能であり、且つ、ｃがｄ，ｑ，ｎ，ｍに基づいて演算され、且つ、ｃ≧Ｌ，ｍ≧Ｌ，ｒ≧ｎがそれぞれ成立するものとした場合に、
前記登録装置は、
前記登録情報に基づいて、（ｋ−１）次元で１変数の多項式であって、前記登録情報を復元可能な前記多項式を演算する多項式演算手段と、
前記登録情報に基づいて、前記登録情報を復元可能なｎ個の前記部分情報である登録部分情報を抽出する登録部分情報抽出手段と、
抽出されたｎ個の前記登録部分情報に基づいて、ｎ種類の数値である登録数値を要素とする集合である登録集合を演算する登録集合演算手段と、
前記登録数値が代入された前記多項式の数値である登録代入値を演算する登録代入値演算手段と、
前記登録数値以外の数値である（ｒ−ｎ）種類の擬似数値を演算する擬似数値演算手段と、
前記擬似数値が代入された前記多項式の数値以外の数値である擬似代入値を演算する擬似代入値演算手段と、
前記登録数値および前記登録数値に対応する前記登録代入値を一組とする前記多項式上の点を登録多項式点とし、前記擬似数値および前記擬似数値に対応する前記擬似代入値を一組とする前記多項式以外の点を擬似多項式点とした場合に、ｎ個の前記登録多項式点と、（ｒ−ｎ）個の前記擬似多項式点とを有するｒ個の点の集合である多項式点集合を演算することにより、前記登録情報が秘匿化された秘匿登録情報を演算する秘匿登録情報演算手段と、
演算された前記秘匿登録情報を、前記記憶装置に対して送信する秘匿登録情報送信手段と、
を有し、
前記記憶装置は、
前記秘匿登録情報送信手段により送信された前記秘匿登録情報を受信する秘匿登録情報受信手段と、
受信した前記秘匿登録情報を記憶する秘匿登録情報記憶手段と、
を有し、
前記検索装置は、
前記検索情報に基づいて、前記検索情報を復元可能なｍ個の前記部分情報である検索部分情報を抽出する検索部分情報抽出手段と、
抽出されたｍ個の前記検索部分情報に基づいて、ｍ種類の数値である検索数値を要素とする集合である検索集合を演算する検索集合演算手段と、
演算された前記検索集合を記憶する検索集合記憶手段と、
ｍ種類の前記検索数値のうち、Ｌ種類の前記検索数値を除く（ｍ−Ｌ）種類の前記検索数値を要素とする前記検索集合の部分集合である検索部分集合を演算することにより、前記検索情報が秘匿化された秘匿検索情報を演算する秘匿検索情報演算手段と、
演算された前記秘匿検索情報を、前記記憶装置に対して送信する秘匿検索情報送信手段と、
を有し、
前記記憶装置は、
前記秘匿検索情報送信手段により送信された前記秘匿検索情報を受信する秘匿検索情報受信手段と、
受信した前記秘匿検索情報に含まれる前記検索数値が、記憶した前記各秘匿登録情報に含まれる前記登録数値または前記擬似数値と同値であるか否かを判別する検索数値判別手段と、
前記検索数値と同値となる前記登録数値の前記登録多項式点および前記擬似数値の前記擬似多項式点を抽出することにより、前記多項式点集合における前記検索数値の射影集合であって、前記多項式点集合の部分集合である多項式点部分集合を演算する多項式点部分集合演算手段と、
演算された前記多項式点部分集合の点が（ｃ−Ｌ）個以上であるか否かを判別する多項式点部分集合要素数判別手段と、
（ｃ−Ｌ）個以上の点を有する前記多項式点部分集合に対応する前記各秘匿登録情報を演算することにより、前記類似情報が秘匿化された秘匿類似情報を演算する秘匿類似情報演算手段と、
演算された前記秘匿類似情報を、前記検索装置に対して送信する秘匿類似情報送信手段と、
を有し、
前記検索装置は、
前記秘匿類似情報送信手段により送信された前記秘匿類似情報を受信する秘匿類似情報受信手段と、
記憶した前記検索集合に含まれる前記検索数値が、受信した前記各秘匿類似情報に含まれる前記登録数値または前記擬似数値と同値であるか否かを判別する検索数値判別手段と、
前記検索数値と同値となる前記登録数値および前記擬似数値を抽出する数値抽出手段と、
抽出された前記登録数値および前記擬似数値がｃ個以上であるか否かを判別することにより、前記各秘匿類似情報が、前記検索情報に対する前記類似情報として復元可能であるか否かを判別する類似情報復元判別手段と、
抽出された前記登録数値および前記擬似数値がｃ個以上である場合に、前記登録数値および前記擬似数値に基づいて、前記多項式を演算して前記類似情報を復元する類似情報復元手段と、
を有する
ことを特徴とする。 In order to solve the technical problem, the similar information search system according to the first aspect of the present invention provides:
A storage device for storing registration information which is information to be registered;
A registration device that is connected to the storage device so as to be able to transmit and receive information, and that registers the registration information to the storage device;
Information similar to the registration information that is the same as or similar to the search information that is the search target information among the stored registration information that is connected to the storage device so as to be able to transmit and receive information. A search device for searching;
A similar information retrieval system having
The natural numbers are c, d, k, L, n, m, q, and r, respectively, and d operations of insertion / deletion / replacement are performed on the operation unit information that is the unit information to be operated in the search information. The search information is converted into the similar information, and n pieces of partial information having q pieces of the operation unit information can be calculated for the registration information. m or more pieces of partial information having q pieces of the operation unit information can be calculated, c is calculated based on d, q, n, m, and c ≧ L, m ≧ L, r ≧ n Are assumed to hold,
The registration device
Based on the registration information, a polynomial computing means for computing the polynomial that is a univariate (k-1) dimension and that can restore the registration information;
Registered partial information extracting means for extracting registered partial information, which is n pieces of partial information capable of restoring the registered information, based on the registered information;
A registered set calculation means for calculating a registered set, which is a set having n registered numeric values as elements, based on the extracted n pieces of registered partial information;
A registered substitution value calculating means for calculating a registered substitution value that is a numerical value of the polynomial into which the registered numeric value is substituted;
Pseudo numerical value calculating means for calculating (rn) types of pseudo numerical values which are numerical values other than the registered numerical values;
Pseudo-substitution value calculating means for calculating a pseudo-substitution value that is a numerical value other than the numerical value of the polynomial into which the pseudo-numeric value is substituted;
A point on the polynomial having a set of the registered numerical value and the registered substitution value corresponding to the registered numerical value is a registered polynomial point, and the pseudo numerical value and the pseudo substitution value corresponding to the pseudo numerical value are a set. When a point other than the polynomial is set as a pseudo-polynomial point, a polynomial point set that is a set of r points having n registered polynomial points and (r−n) pseudo-polynomial points is calculated. A secret registration information calculation means for calculating secret registration information in which the registration information is concealed;
Secret registration information transmitting means for transmitting the calculated secret registration information to the storage device;
Have
The storage device
Confidential registration information receiving means for receiving the confidential registration information transmitted by the confidential registration information transmitting means;
Secret registration information storage means for storing the received secret registration information;
Have
The search device includes:
Search partial information extraction means for extracting search partial information which is m pieces of partial information capable of restoring the search information based on the search information;
A search set calculation means for calculating a search set that is a set having m search numerical values as elements based on the extracted m pieces of search partial information;
Search set storage means for storing the calculated search set;
By calculating a search subset that is a subset of the search set having (m−L) types of the search numerical values excluding L types of the search numerical values among the m types of search numerical values, the search Secret search information calculation means for calculating secret search information in which information is concealed;
A secret search information transmitting means for transmitting the calculated secret search information to the storage device;
Have
The storage device
Secret search information receiving means for receiving the secret search information transmitted by the secret search information transmitting means;
Search numerical value determining means for determining whether the search numerical value included in the received confidential search information is the same as the registered numerical value or the pseudo numerical value included in each stored confidential registration information;
By extracting the registered polynomial point of the registered numerical value and the pseudo-polynomial point of the pseudo-numerical value that are the same as the search numerical value, a projection set of the search numerical value in the polynomial point set, A polynomial point subset computing means for computing a polynomial point subset which is a subset;
Polynomial point subset element number determining means for determining whether or not the calculated points of the polynomial point subset are (c−L) or more;
(CL) Concealed similarity information computing means for computing concealed similarity information in which the similar information is concealed by computing each concealment registration information corresponding to the polynomial point subset having at least (c−L) points ,
A secret similarity information transmitting means for transmitting the calculated secret similarity information to the search device;
Have
The search device includes:
A concealment similarity information receiving means for receiving the concealment similarity information transmitted by the concealment similarity information transmission means;
Search numerical value determining means for determining whether or not the search numerical value included in the stored search set is the same as the registered numerical value or the pseudo numerical value included in each received secret similar information;
Numerical value extraction means for extracting the registered numerical value and the pseudo numerical value that are the same as the search numerical value;
It is determined whether or not each of the secret similar information can be restored as the similar information with respect to the search information by determining whether or not the extracted registered numerical value and the pseudo numerical value are c or more. Similar information restoration discrimination means,
Similar information restoration means for computing the polynomial and restoring the similar information based on the registered numeric value and the pseudo numeric value when the extracted registered numeric value and the pseudo numeric value are c or more,
It is characterized by having.

請求項２に記載の発明は、請求項１に記載の類似情報検索システムにおいて、
前記自然数ｃ，ｋ，ｎについて、ｃ≧（ｎ＋ｋ）／２が成立するものとした場合に、
前記登録装置は、
前記登録情報に基づいて、（ｋ−１）次元で１変数の多項式であって、前記登録情報を復元するために必要な前記多項式上の点が｛（ｎ＋ｋ）／２｝個以上となる前記多項式を演算する多項式演算手段と、
を有する
ことを特徴とする。 The invention according to claim 2 is the similar information search system according to claim 1,
When c ≧ (n + k) / 2 holds for the natural numbers c, k, n,
The registration device
Based on the registration information, the polynomial is a one-variable polynomial in (k−1) dimensions, and the number of points on the polynomial necessary for restoring the registration information is {(n + k) / 2} or more. A polynomial calculation means for calculating a polynomial;
It is characterized by having.

前記技術的課題を解決するために、請求項３記載の発明の類似情報検索システムは、
登録対象の情報を登録情報とし、
前記登録情報と同一または類似する情報を類似情報とし、
自然数をそれぞれｄ，ｋ，ｎ，ｑ，ｒとし、
前記登録情報のうちの操作対象となる単位の情報である操作単位情報について、ｄ回の挿入・削除・置換の操作を行うことにより、前記登録情報が前記類似情報に変換され、且つ、前記登録情報について、ｑ個の前記操作単位情報を有する部分情報をｎ個以上演算可能であり、且つ、ｒ≧ｎが成立するものとし、
前記登録情報に基づいて抽出された前記登録情報を復元可能なｎ個の前記部分情報を登録部分情報とし、
抽出されたｎ個の前記登録部分情報に基づいて演算されたｎ種類の数値である登録数値を要素とする集合を登録集合とし、
前記登録情報に基づいて演算された（ｋ−１）次元で１変数の多項式であって、前記登録情報を復元可能な前記多項式に前記登録数値が代入されて演算された数値を登録代入値とし、
前記登録数値および前記登録数値に対応する前記登録代入値を一組とする前記多項式上の点を登録多項式点とし、
前記登録数値以外の数値である擬似数値および前記擬似数値に対応する擬似代入値であって、前記擬似数値が代入された前記多項式の数値以外の数値である前記擬似代入値を一組とする前記多項式以外の点を擬似多項式点とした場合に、
ｎ個の前記登録多項式点と、（ｒ−ｎ）個の前記擬似多項式点とを有するｒ個の点の集合である多項式点集合を、前記登録情報が秘匿化された秘匿登録情報として記憶する秘匿登録情報記憶手段と、
検索対象の情報を検索情報とし、
自然数をそれぞれｃ，Ｌ，ｍとし、
前記検索情報について、ｑ個の前記操作単位情報を有する部分情報をｍ個以上演算可能であり、且つ、ｃがｄ，ｑ，ｎ，ｍに基づいて演算され、且つ、ｃ≧Ｌ，ｍ≧Ｌがそれぞれ成立するものとし、
前記検索情報のうちの前記操作単位情報について、ｄ回の挿入・削除・置換の操作を行うことにより、前記検索情報が前記類似情報に変換され、且つ、前記検索情報に基づいて抽出された前記検索情報を復元可能なｍ個の前記部分情報を検索部分情報とし、
抽出されたｍ個の前記検索部分情報に基づいて演算されたｍ種類の数値である検索数値を要素とする集合を検索集合とし、
ｍ種類の前記検索数値のうち、Ｌ種類の前記検索数値を除く（ｍ−Ｌ）種類の前記検索数値を要素とする前記検索集合の部分集合である検索部分集合を、前記検索情報が秘匿化された秘匿検索情報とした場合に、
前記秘匿検索情報に含まれる前記検索数値が、記憶した前記各秘匿登録情報に含まれる前記登録数値または前記擬似数値と同値であるか否かを判別する検索数値判別手段と、
前記検索数値と同値となる前記登録数値の前記登録多項式点および前記擬似数値の前記擬似多項式点を抽出することにより、前記多項式点集合における前記検索数値の射影集合であって、前記多項式点集合の部分集合である多項式点部分集合を演算する多項式点部分集合演算手段と、
演算された前記多項式点部分集合の点が（ｃ−Ｌ）個以上であるか否かを判別する多項式点部分集合要素数判別手段と、
（ｃ−Ｌ）個以上の点を有する前記多項式点部分集合に対応する前記各秘匿登録情報を演算することにより、前記類似情報が秘匿化された秘匿類似情報を演算する秘匿類似情報演算手段と、
を備えたことを特徴とする。 In order to solve the technical problem, a similar information retrieval system according to claim 3 is provided.
Information to be registered is registered information,
Information that is the same as or similar to the registration information is similar information,
The natural numbers are d, k, n, q, r, respectively.
With respect to operation unit information that is information of a unit to be operated in the registration information, the registration information is converted into the similar information by performing d insertion / deletion / replacement operations, and the registration For information, it is assumed that n or more pieces of partial information having q pieces of operation unit information can be calculated, and r ≧ n holds.
The n pieces of partial information that can restore the registration information extracted based on the registration information are used as registration partial information.
A set having a registered numerical value, which is an n-type numerical value calculated based on the extracted n pieces of registered partial information, as a registered set,
A (k−1) -dimensional univariate polynomial calculated based on the registration information, and a numerical value calculated by substituting the registered numerical value for the polynomial that can restore the registration information is used as a registered substitution value. ,
A point on the polynomial having a set of the registered numerical value and the registered substitution value corresponding to the registered numerical value is a registered polynomial point,
A pseudo numerical value that is a numerical value other than the registered numerical value and a pseudo substituted value corresponding to the pseudo numerical value, and the pseudo substituted value that is a numerical value other than the numerical value of the polynomial to which the pseudo numerical value is substituted is a set. When a point other than a polynomial is a pseudo-polynomial point,
A polynomial point set, which is a set of r points each having n registration polynomial points and (r−n) pseudo-polynomial points, is stored as secret registration information in which the registration information is concealed. A secret registration information storage means;
The search target information is the search information,
Let natural numbers be c, L, m respectively.
For the search information, m or more pieces of partial information having q pieces of operation unit information can be calculated, c is calculated based on d, q, n, m, and c ≧ L, m ≧ L is established,
With respect to the operation unit information in the search information, the search information is converted into the similar information by performing d insertion / deletion / replacement operations, and the search information is extracted based on the search information. The m pieces of partial information capable of restoring the search information are set as search partial information,
A set having as elements search numerical values that are m types of numerical values calculated based on the extracted m pieces of search partial information is defined as a search set.
Of the m types of search numerical values, the search information conceals a search subset that is a subset of the search set whose elements are the (m−L) types of search numerical values excluding the L types of search numerical values. If the search information is hidden,
Search numerical value determination means for determining whether the search numerical value included in the confidential search information is the same as the registered numerical value or the pseudo numerical value included in each stored confidential registration information;
By extracting the registered polynomial point of the registered numerical value and the pseudo-polynomial point of the pseudo-numerical value that are the same as the search numerical value, a projection set of the search numerical value in the polynomial point set, A polynomial point subset computing means for computing a polynomial point subset which is a subset;
Polynomial point subset element number determining means for determining whether or not the calculated points of the polynomial point subset are (c−L) or more;
(CL) Concealed similarity information computing means for computing concealed similarity information in which the similar information is concealed by computing each concealment registration information corresponding to the polynomial point subset having at least (c−L) points ,
It is provided with.

前記技術的課題を解決するために、請求項４記載の発明の類似情報検索プログラムは、
コンピュータを、
登録対象の情報を登録情報とし、
前記登録情報と同一または類似する情報を類似情報とし、
自然数をそれぞれｄ，ｋ，ｎ，ｑ，ｒとし、
前記登録情報のうちの操作対象となる単位の情報である操作単位情報について、ｄ回の挿入・削除・置換の操作を行うことにより、前記登録情報が前記類似情報に変換され、且つ、前記登録情報について、ｑ個の前記操作単位情報を有する部分情報をｎ個以上演算可能であり、且つ、ｒ≧ｎが成立するものとし、
前記登録情報に基づいて抽出された前記登録情報を復元可能なｎ個の前記部分情報を登録部分情報とし、
抽出されたｎ個の前記登録部分情報に基づいて演算されたｎ種類の数値である登録数値を要素とする集合を登録集合とし、
前記登録情報に基づいて演算された（ｋ−１）次元で１変数の多項式であって、前記登録情報を復元可能な前記多項式に前記登録数値が代入されて演算された数値を登録代入値とし、
前記登録数値および前記登録数値に対応する前記登録代入値を一組とする前記多項式上の点を登録多項式点とし、
前記登録数値以外の数値である擬似数値および前記擬似数値に対応する擬似代入値であって、前記擬似数値が代入された前記多項式の数値以外の数値である前記擬似代入値を一組とする前記多項式以外の点を擬似多項式点とした場合に、
ｎ個の前記登録多項式点と、（ｒ−ｎ）個の前記擬似多項式点とを有するｒ個の点の集合である多項式点集合を、前記登録情報が秘匿化された秘匿登録情報として記憶する秘匿登録情報記憶手段、
検索対象の情報を検索情報とし、
自然数をそれぞれｃ，Ｌ，ｍとし、
前記検索情報について、ｑ個の前記操作単位情報を有する部分情報をｍ個以上演算可能であれば、ｃがｄ，ｑ，ｎ，ｍに基づいて演算され、且つ、ｃ≧Ｌ，ｍ≧Ｌがそれぞれ成立するものとし、
前記検索情報のうちの前記操作単位情報について、ｄ回の挿入・削除・置換の操作を行うことにより、前記検索情報が前記類似情報に変換され、且つ、前記検索情報に基づいて抽出された前記検索情報を復元可能なｍ個の前記部分情報を検索部分情報とし、
抽出されたｍ個の前記検索部分情報に基づいて演算されたｍ種類の数値である検索数値を要素とする集合を検索集合とし、
ｍ種類の前記検索数値のうち、Ｌ種類の前記検索数値を除く（ｍ−Ｌ）種類の前記検索数値を要素とする前記検索集合の部分集合である検索部分集合を、前記検索情報が秘匿化された秘匿検索情報とした場合に、
前記秘匿検索情報に含まれる前記検索数値が、記憶した前記各秘匿登録情報に含まれる前記登録数値または前記擬似数値と同値であるか否かを判別する検索数値判別手段、
前記検索数値と同値となる前記登録数値の前記登録多項式点および前記擬似数値の前記擬似多項式点を抽出することにより、前記多項式点集合における前記検索数値の射影集合であって、前記多項式点集合の部分集合である多項式点部分集合を演算する多項式点部分集合演算手段、
演算された前記多項式点部分集合の点が（ｃ−Ｌ）個以上であるか否かを判別する多項式点部分集合要素数判別手段、
（ｃ−Ｌ）個以上の点を有する前記多項式点部分集合に対応する前記各秘匿登録情報を演算することにより、前記類似情報が秘匿化された秘匿類似情報を演算する秘匿類似情報演算手段、
として機能させることを特徴とする。 In order to solve the technical problem, a similar information search program according to a fourth aspect of the present invention provides:
Computer
Information to be registered is registered information,
Information that is the same as or similar to the registration information is similar information,
The natural numbers are d, k, n, q, r, respectively.
With respect to operation unit information that is information of a unit to be operated in the registration information, the registration information is converted into the similar information by performing d insertion / deletion / replacement operations, and the registration For information, it is assumed that n or more pieces of partial information having q pieces of operation unit information can be calculated, and r ≧ n holds.
The n pieces of partial information that can restore the registration information extracted based on the registration information are used as registration partial information.
A set having a registered numerical value, which is an n-type numerical value calculated based on the extracted n pieces of registered partial information, as a registered set,
A (k−1) -dimensional univariate polynomial calculated based on the registration information, and a numerical value calculated by substituting the registered numerical value for the polynomial that can restore the registration information is used as a registered substitution value. ,
A point on the polynomial having a set of the registered numerical value and the registered substitution value corresponding to the registered numerical value is a registered polynomial point,
A pseudo numerical value that is a numerical value other than the registered numerical value and a pseudo substituted value corresponding to the pseudo numerical value, and the pseudo substituted value that is a numerical value other than the numerical value of the polynomial to which the pseudo numerical value is substituted is a set. When a point other than a polynomial is a pseudo-polynomial point,
A polynomial point set, which is a set of r points each having n registration polynomial points and (r−n) pseudo-polynomial points, is stored as secret registration information in which the registration information is concealed. Secret registration information storage means,
The search target information is the search information,
Let natural numbers be c, L, m respectively.
If m or more pieces of partial information having q pieces of operation unit information can be calculated for the search information, c is calculated based on d, q, n, m, and c ≧ L, m ≧ L Each holds,
With respect to the operation unit information in the search information, the search information is converted into the similar information by performing d insertion / deletion / replacement operations, and the search information is extracted based on the search information. The m pieces of partial information capable of restoring the search information are set as search partial information,
A set having as elements search numerical values that are m types of numerical values calculated based on the extracted m pieces of search partial information is defined as a search set.
Of the m types of search numerical values, the search information conceals a search subset that is a subset of the search set whose elements are the (m−L) types of search numerical values excluding the L types of search numerical values. If the search information is hidden,
Search numerical value determining means for determining whether the search numerical value included in the confidential search information is the same as the registered numerical value or the pseudo numerical value included in each stored confidential registration information;
By extracting the registered polynomial point of the registered numerical value and the pseudo-polynomial point of the pseudo-numerical value that are the same as the search numerical value, a projection set of the search numerical value in the polynomial point set, A polynomial point subset computing means for computing a polynomial point subset which is a subset;
A polynomial point subset element number discriminating means for discriminating whether or not there are (c−L) or more calculated points of the polynomial point subset;
(CL) Secret similar information calculation means for calculating secret similar information in which the similar information is concealed by calculating each secret registration information corresponding to the polynomial point subset having more than (c−L) points;
It is made to function as.

請求項１に記載の発明によれば、前記各装置間で送受信される各情報（登録情報、検索情報、類似情報）が秘匿化されているため、前記各装置のユーザ以外の第三者から前記各情報を秘匿することができる。また、請求項１に記載の発明によれば、前記記憶装置では、前記登録情報が前記秘匿登録情報として秘匿化されており、且つ、前記検索情報が前記秘匿検索情報として秘匿化されているため、前記記憶装置のユーザである管理者からも前記各情報を秘匿することができる。この結果、前記第三者および前記管理者から、前記各情報が秘匿された状態で、前記登録情報の所有者であって前記登録装置のユーザである登録者と、前記検索情報の知得者であって前記検索装置のユーザである検索者と、前記記憶装置の管理業務を委託された外部委託業者としての前記管理者とを有する前記ＤＡＳモデルを構成できる。 According to the first aspect of the present invention, since each piece of information (registration information, search information, similar information) transmitted / received between the devices is concealed, a third party other than the user of each device Each information can be kept secret. According to the first aspect of the present invention, in the storage device, the registration information is concealed as the secret registration information, and the search information is concealed as the secret search information. The information can be concealed from an administrator who is a user of the storage device. As a result, a registrant who is the owner of the registration information and is a user of the registration device in a state where the information is kept secret from the third party and the administrator, and an acquaintance of the search information The DAS model can be configured to include a searcher who is a user of the search device and the manager as an outsourcer entrusted with the management operation of the storage device.

また、請求項１に記載の発明によれば、前記登録装置は、前記登録者ごとの鍵を使用しなくても、前記登録情報を秘匿化できると共に、前記登録装置は、前記検索者ごとの鍵を使用しなくても、前記検索情報を秘匿化できる。この結果、前記各装置との間で送受信される各情報（登録情報、検索情報、類似情報）が、前記鍵を使用せずに秘匿化でき、従来公知の前記暗号化データベースにおける鍵管理の安全性（厳格性）やそれに伴う管理コスト等の問題を解決することができる。
また、請求項１に記載の発明によれば、前記秘匿類似情報を検索する際に、前記秘匿検索情報に含まれる検索数値と、前記各秘匿登録情報に含まれる各数値とを照合するため、暗号化された登録情報や検索情報を復号化してから照合を行う場合に比べ、効率的に前記秘匿類似情報を検索できる。 In addition, according to the first aspect of the present invention, the registration device can conceal the registration information without using a key for each registrant, and the registration device can be used for each searcher. The search information can be concealed without using a key. As a result, each information (registration information, search information, similar information) transmitted / received to / from each device can be concealed without using the key, and the key management safety in the conventionally known encryption database can be secured. It is possible to solve problems such as sex (rigidity) and management costs associated therewith.
Further, according to the invention of claim 1, when searching for the secret similar information, in order to collate the search numerical value included in the secret search information and each numerical value included in the respective secret registration information, The secret similar information can be searched more efficiently than in the case where the verification is performed after decrypting the encrypted registration information and search information.

請求項２に記載の発明によれば、｛（ｎ＋ｋ）／２｝個以上の前記登録多項式点が判明すれば前記多項式が復元できることが保証されており、且つ、ｃがｄ，ｑ，ｎ，ｍに基づいて演算され、且つ、ｃ≧（ｎ＋ｋ）／２，ｃ≧Ｌがそれぞれ成立する。このため、前記検索装置は、受信した前記各秘匿類似情報に含まれるｃ個以上の前記登録数値または前記擬似数値に、｛（ｎ＋ｋ）／２｝個以上の前記登録数値が含まれていれば、前記各秘匿類似情報から前記各類似情報を常に復元することができる。この結果、前記検索装置は、前記秘匿類似情報を復元する際に、抽出された前記登録数値および前記擬似数値がｃ個以上であるか否かを判別することにより、前記各秘匿類似情報が、前記検索情報に対する前記類似情報として復元可能であるか否かを適切に判別することができる。また、前記記憶装置は、前記秘匿類似情報を検索する際に、抽出された前記多項式点部分集合の点が（ｃ−Ｌ）個以上であるか否かを判別することにより、前記多項式点集合が、前記検索情報に対する前記類似情報として復元可能であるか否かを適切に判別することができる。 According to the invention described in claim 2, it is guaranteed that the polynomial can be restored if {(n + k) / 2} or more of the registered polynomial points are found, and c is d, q, n, It is calculated based on m, and c ≧ (n + k) / 2 and c ≧ L are established. For this reason, the search device may include {(n + k) / 2} or more of the registered numerical values in the c or more of the registered numerical values or the pseudo numerical values included in the received secret similar information. The similar information can be always restored from the secret similar information. As a result, when the search device restores the concealment similarity information, by determining whether or not the extracted registered numerical value and the pseudo numerical value are c or more, the respective concealment similar information is It is possible to appropriately determine whether or not the similar information with respect to the search information can be restored. Further, the storage device determines whether or not the number of extracted polynomial point subsets is (c−L) or more when searching for the secret similarity information, thereby determining the polynomial point set. However, it is possible to appropriately determine whether or not it can be restored as the similar information with respect to the search information.

請求項３に記載の発明によれば、記憶した前記秘匿登録情報を、前記登録情報に復元されずに秘匿化されたままの状態で検索できる。また、検索対象としての前記検索情報も、前記秘匿検索情報として秘匿化されたままの状態で前記検索が実行できる。したがって、前記検索情報の知得者である検索者以外の第三者から前記各情報（登録情報、検索情報、類似情報）を秘匿することができると共に、前記類似情報検索システムの管理者からも前記各情報を秘匿することができる。
また、請求項３に記載の発明によれば、前記検索者ごとの鍵を使用しなくても、前記検索情報を秘匿化できる。この結果、従来公知の前記暗号化データベースにおける鍵管理の安全性（厳格性）やそれに伴う管理コスト等の問題を解決することができる。
また、請求項３に記載の発明によれば、前記秘匿類似情報を検索する際に、前記秘匿検索情報に含まれる検索数値と、前記各秘匿登録情報に含まれる各数値とを照合するため、暗号化された登録情報や検索情報を復号化してから照合を行う場合に比べ、効率的に前記秘匿類似情報を検索できる。 According to the third aspect of the present invention, the stored secret registration information can be searched without being restored to the registration information. Further, the search can be executed in a state where the search information as a search target is kept secret as the secret search information. Accordingly, the information (registration information, search information, and similar information) can be concealed from a third party other than the searcher who is the acquirer of the search information, and also from the administrator of the similar information search system. Each information can be kept secret.
According to the invention of claim 3, the search information can be concealed without using a key for each searcher. As a result, it is possible to solve problems such as the security (strictness) of key management in the conventionally known encrypted database and the management cost associated therewith.
Further, according to the invention of claim 3, when searching for the secret similar information, in order to collate the search numerical value included in the secret search information and each numerical value included in the respective secret registration information, The secret similar information can be searched efficiently compared to the case where the verification is performed after decrypting the encrypted registration information and search information.

請求項４に記載の発明によれば、記憶した前記秘匿登録情報を、前記登録情報に復元されずに秘匿化されたままの状態で検索できる。また、検索対象としての前記検索情報も、前記秘匿検索情報として秘匿化されたままの状態で前記検索が実行できる。したがって、前記検索情報の知得者である検索者以外の第三者から前記各情報（登録情報、検索情報、類似情報）を秘匿することができると共に、前記類似情報検索システムの管理者からも前記各情報を秘匿することができる。
また、請求項４に記載の発明によれば、前記検索者ごとの鍵を使用しなくても、前記検索情報を秘匿化できる。この結果、従来公知の前記暗号化データベースにおける鍵管理の安全性（厳格性）やそれに伴う管理コスト等の問題を解決することができる。
また、請求項４に記載の発明によれば、前記秘匿類似情報を検索する際に、前記秘匿検索情報に含まれる検索数値と、前記各秘匿登録情報に含まれる各数値とを照合するため、暗号化された登録情報や検索情報を復号化してから照合を行う場合に比べ、効率的に前記秘匿類似情報を検索できる。 According to the fourth aspect of the present invention, the stored secret registration information can be searched without being restored to the registration information. Further, the search can be executed in a state where the search information as a search target is kept secret as the secret search information. Accordingly, the information (registration information, search information, and similar information) can be concealed from a third party other than the searcher who is the acquirer of the search information, and also from the administrator of the similar information search system. Each information can be kept secret.
According to the invention of claim 4, the search information can be concealed without using a key for each searcher. As a result, it is possible to solve problems such as the security (strictness) of key management in the conventionally known encrypted database and the management cost associated therewith.
According to the invention of claim 4, when searching for the secret similar information, in order to collate the search numerical value included in the secret search information and each numerical value included in the respective secret registration information, The secret similar information can be searched efficiently compared to the case where the verification is performed after decrypting the encrypted registration information and search information.

次に図面を参照しながら、本発明の実施の形態の具体例（以下、実施例と記載する）を説明するが、本発明は以下の実施例に限定されるものではない。
なお、以下の図面を使用した説明において、理解の容易のために説明に必要な部材以外の図示は適宜省略されている。 Next, specific examples of embodiments of the present invention (hereinafter referred to as examples) will be described with reference to the drawings, but the present invention is not limited to the following examples.
In the following description using the drawings, illustrations other than members necessary for the description are omitted as appropriate for easy understanding.

図１は本発明の実施例１の類似情報検索システムの全体説明図である。
図１において、本発明の実施例１の類似情報検索システムＳは、登録対象の情報（データ）である登録情報を記憶する記憶装置の一例としてのデータベースサーバ（類似情報検索装置）ＤＳを有する。また、前記類似情報検索システムＳは、通信回線の一例としてのインターネット１を介して前記データベースサーバＤＳと情報の送受信が可能に接続され、前記データベースサーバＤＳに対して、前記登録情報を登録させる（格納させる）登録装置の一例としての登録用クライアントパソコンＰＣａを有する。 FIG. 1 is an overall explanatory diagram of a similar information search system according to a first embodiment of the present invention.
1, the similar information search system S according to the first embodiment of the present invention includes a database server (similar information search device) DS as an example of a storage device that stores registration information that is information (data) to be registered. Further, the similar information search system S is connected to the database server DS through the Internet 1 as an example of a communication line so that information can be transmitted and received, and causes the database server DS to register the registration information ( A registration client personal computer PCa as an example of a registration device.

さらに、前記類似情報検索システムＳは、前記インターネット１を介して前記データベースサーバＤＳと情報の送受信が可能に接続され、前記データベースサーバＤＳに対して、記憶された前記登録情報のうち、検索対象の情報である検索情報と同一または類似する前記登録情報である類似情報を検索させる検索装置の一例としての検索用クライアントパソコンＰＣｂを有する。
実施例１の前記データベースサーバＤＳおよび前記各クライアントパソコンＰＣａ，ＰＣｂは、コンピュータ本体Ｈ１、ディスプレイＨ２、入力装置であるキーボードＨ３、マウスＨ４、図示しないハードディスクドライブ、ＤＶＤ（Digital Versatile Disc）ドライブ等により構成されている。 Further, the similar information search system S is connected to the database server DS through the Internet 1 so as to be able to send and receive information. It has a search client personal computer PCb as an example of a search device that searches for similar information that is the registered information that is the same as or similar to the search information that is information.
The database server DS and each of the client personal computers PCa and PCb of the first embodiment are configured by a computer main body H1, a display H2, a keyboard H3 as an input device, a mouse H4, a hard disk drive (not shown), a DVD (Digital Versatile Disc) drive, and the like. Has been.

（実施例１の制御部の説明）
図２は本発明の実施例１の類似情報検索システムを構成する各装置の機能をブロック図（機能ブロック図）で示した説明図である。
図２において、前記データベースサーバＤＳおよび前記各クライアントパソコンＰＣａ，ＰＣｂのコンピュータ本体Ｈ１は、外部との信号の入出力および入出力信号レベルの調節等を行うＩ／Ｏ（入出力インターフェース）、必要な起動処理を行うためのプログラムおよびデータ等が記憶されたＲＯＭ（リードオンリーメモリ、記録媒体）、必要なデータ及びプログラムを一時的に記憶するためのＲＡＭ（ランダムアクセスメモリ、記録媒体）、ＲＯＭ等に記憶された起動プログラムに応じた処理を行うＣＰＵ（中央演算処理装置）、ならびにクロック発振器等を有しており、前記ＲＯＭ及びＲＡＭ等に記憶されたプログラムを実行することにより種々の機能を実現することができる。
前記構成の前記データベースサーバＤＳおよび前記各クライアントパソコンＰＣａ，ＰＣｂは、前記ハードディスクやＲＯＭ等に記憶されたプログラムを実行することにより種々の機能を実現することができる。 (Description of the control part of Example 1)
FIG. 2 is an explanatory diagram showing the function of each device constituting the similar information retrieval system of the first embodiment of the present invention in a block diagram (functional block diagram).
In FIG. 2, the database server DS and the computer main body H1 of each of the client personal computers PCa and PCb have an I / O (input / output interface) that performs input / output of signals to / from the outside and adjustment of input / output signal levels, etc. ROM (read-only memory, recording medium) that stores programs and data for performing startup processing, RAM (random access memory, recording medium), ROM, etc. for temporarily storing necessary data and programs It has a CPU (Central Processing Unit) that performs processing according to the stored startup program, a clock oscillator, and the like, and implements various functions by executing programs stored in the ROM, RAM, and the like. be able to.
The database server DS and the client personal computers PCa and PCb configured as described above can realize various functions by executing programs stored in the hard disk, ROM, or the like.

（登録用クライアントパソコンＰＣａの制御部の説明）
前記登録用クライアントパソコンＰＣａのハードディスクドライブには、前記登録用クライアントパソコンＰＣａの基本動作を制御する基本ソフト（オペレーティングシステム）ＯＳや、アプリケーションプログラムとしての秘匿登録情報送信プログラムＡＰ１、その他の図示しないソフトウェアが記憶されている。 (Description of control unit of registration client personal computer PCa)
The hard disk drive of the registration client personal computer PCa includes a basic software (operating system) OS that controls the basic operation of the registration client personal computer PCa, a secret registration information transmission program AP1 as an application program, and other software (not shown). It is remembered.

（秘匿登録情報送信プログラムＡＰ１）
前記秘匿登録情報送信プログラムＡＰ１は、下記の機能手段（プログラムモジュール）を有する。 (Secret registration information transmission program AP1)
The secret registration information transmission program AP1 has the following functional means (program modules).

図３は実施例１の登録画像の説明図である。
ＣＡ１：登録画像表示手段
登録画像表示手段ＣＡ１は、図３に示す、前記登録情報を前記データベースサーバＤＳに登録させるための登録画像２を前記ディスプレイＨ２に表示する。実施例１の前記登録画像２は、前記登録情報を入力するための登録情報入力部２ａと、入力された前記登録情報を前記データベースサーバＤＳに登録するための登録ボタン２ｂとを有する。なお、実施例１では、前記登録情報入力部２ａには、図３に示すように、文字列情報により構成された前記登録情報ｓ（例えば、塩基配列の文字列情報である「ＡＧＣＣＧＧＡＡＧＧＣＣ」）が入力される。 FIG. 3 is an explanatory diagram of a registered image according to the first embodiment.
CA1: Registered image display means The registered image display means CA1 displays a registered image 2 shown in FIG. 3 for registering the registration information in the database server DS on the display H2. The registration image 2 according to the first embodiment includes a registration information input unit 2a for inputting the registration information, and a registration button 2b for registering the input registration information in the database server DS. In Example 1, as shown in FIG. 3, the registration information input unit 2a includes the registration information s (for example, “AGCCCGGAAGGGCC” which is character string information of a base sequence) configured by character string information. Entered.

ＣＡ２：登録判別手段
登録判別手段ＣＡ２は、前記登録情報ｓを前記データベースサーバＤＳに登録するか否かを判別する。実施例１の前記登録判別手段ＣＡ２は、前記登録情報入力部２ａに前記登録情報ｓが入力されて前記登録ボタン２ｂが入力されたか否かを判別することにより、前記登録情報ｓを前記データベースサーバＤＳに登録するか否かを判別する。
ここで、実施例１の前記類似情報検索システムＳでは、類似文字列検索手法であるｑ−ｇｒａｍの技術と、生体認証やパスワードの復元等に利用されている曖昧照合法であるファジーボールト（Fuzzy Vault、Fuzzy Vault Scheme）の技術とを組み合わせることにより、前記登録情報ｓが秘匿化された状態で登録・検索等の各処理が実行される。 CA2: Registration Discriminating Unit The registration discriminating unit CA2 discriminates whether or not to register the registration information s in the database server DS. The registration determination unit CA2 according to the first embodiment determines whether the registration information s is input to the registration information input unit 2a and the registration button 2b is input, so that the registration information s is stored in the database server. It is determined whether or not to register with the DS.
Here, in the similar information search system S according to the first embodiment, the q-gram technique, which is a similar character string search method, and the fuzzy vault (Fuzzy vault, which is an ambiguous collation method used for biometric authentication, password restoration, etc.) By combining with the technology of Vault and Fuzzy Vault Scheme, each process such as registration and search is executed in a state where the registration information s is kept secret.

（ｑ−ｇｒａｍを用いた類似文字列検索手法について）
ここで、類似文字列検索手法とは、ある文字列に対して、文字の挿入・削除・置換の各編集操作の回数である編集距離が、所定の値以下になる文字列を類似文字列として検出する手法をいう。すなわち、２つの文字列を同一にするために必要な前記各編集操作の最小値である前記編集距離に基づいて、データベース内の全ての文字列から前記類似文字列を検索する手法をいう。
また、ｑ−ｇｒａｍとは、０を除いた自然数をｑとした場合に、元の文字列の長さｑとなる部分文字列のことをいう。例えば、元の文字列を「ＡＢＣＤＥＦＧ」とし、ｑ＝４とした場合、前記ｑ−ｇｒａｍは、「ＡＢＣＤ」，「ＢＣＤＥ」，「ＣＤＥＦ」，「ＤＥＦＧ」の４つである。 (About similar character string search method using q-gram)
Here, the similar character string search method refers to a character string in which the edit distance, which is the number of character insertion / deletion / replacement operations, is less than or equal to a predetermined value for a character string. This is a detection method. That is, it refers to a method of searching for the similar character string from all the character strings in the database based on the edit distance that is the minimum value of the editing operations necessary to make two character strings the same.
Further, q-gram refers to a partial character string having the length q of the original character string, where q is a natural number excluding 0. For example, when the original character string is “ABCDEFG” and q = 4, the q-gram is four of “ABCD”, “BCDE”, “CDEF”, and “DEFG”.

なお、０を除いた自然数をそれぞれｄ，ｎ，ｍとし、長さｎの第１の文字列と長さｍの第２の文字列との前記編集距離をｄとし、前記自然数ｎ，ｍのうちのいずれか大きい値をｍａｘ（ｎ，ｍ）とした場合に、前記第１の文字列と前記第２の文字列とは、少なくとも、｛ｍａｘ（ｎ，ｍ）−（ｄ−１）ｑ−１｝個の前記ｑ−ｇｒａｍを共通に持つことが保証されている（例えば、非特許文献１０参照）。
したがって、ｑ−ｇｒａｍを用いた類似文字列検索手法は、まず、前記文字列をｓ_ｑｇとした場合に、予め設定された前記自然数ｎ，ｍ，ｑ，ｄの値に基づいて、前記ｑ−ｇｒａｍの共有数の最小値（ｍａｘ（ｎ，ｍ）−（ｄ−１）ｑ−１）を演算し、前記文字列ｓ_ｑｇと、前記データベース内の各文字列との各共有数を演算することにより、共有数が前記最小値以下となる前記各文字列を前記類似文字列の候補から排除する、いわゆる、フィルタリング処理を実行する。そして、前記文字列ｓ_ｑｇと、前記類似文字列の候補となった前記各文字列との前記編集距離を実際に演算することにより、最終的な解である前記類似文字列を演算する、いわゆる、洗練化処理を実行する。 The natural numbers excluding 0 are d, n, and m, respectively, and the edit distance between the first character string of length n and the second character string of length m is d, and the natural numbers n, m When any one of the values is max (n, m), the first character string and the second character string are at least {max (n, m) − (d−1) q −1} pieces of the q-gram are guaranteed in common (for example, see Non-Patent Document 10).
Therefore, in the similar character string search method using q-gram, first, when the character string is set to s _qg , based on the values of the natural numbers n, m, q, and d set in advance, The minimum value (max (n, m) − (d−1) q−1) of the shared number of gram is calculated, and each shared number between the character string s _qg and each character string in the database is calculated. Thus, a so-called filtering process is performed in which each character string having the number of shares less than or equal to the minimum value is excluded from the similar character string candidates. Then, by actually calculating the edit distance between the character string s _qg and each character string that is a candidate for the similar character string, the similar character string that is the final solution is calculated. Execute the refinement process.

この結果、ｑ−ｇｒａｍを用いた類似文字列検索手法は、総当りの類似文字列検索手法に比べ、処理を高速化できる。すなわち、長さｎの第１の文字列と長さｍの第２の文字列との前記編集距離の計算について、計算量を評価するためのランダウの記号（ランダウの記法、Ｏ-記法）を用いた場合、総当りの類似文字列検索手法では、前記計算量がＯ（ｎｍ）時間となる。すなわち、ｎ＝ｍとした場合、Ｏ（ｎ^２）時間であるため、計算量がｎの二乗関数的に大きくなる。これに対して、前記ｑ−ｇｒａｍを用いた類似文字列検索手法では、前記計算量がＯ（ｎ＋ｍ）時間となる。このため、前記フィルタリング処理を高速化できる。すなわち、ｎ＝ｍとした場合、Ｏ（２ｎ）時間であるため、計算量がｎの線形関数的に大きくなり、計算量を低減できる。 As a result, the similar character string search method using q-gram can speed up processing compared to the brute force similar character string search method. That is, for the calculation of the edit distance between the first character string of length n and the second character string of length m, Landau symbols (Landau notation, O-notation) for evaluating the amount of calculation are used. When used, in the brute force similar character string search method, the calculation amount is O (nm) time. That is, when n = m, since the time is O (n ² ), the amount of calculation increases as a square function of n. On the other hand, in the similar character string search method using the q-gram, the calculation amount is O (n + m) time. For this reason, the filtering process can be speeded up. That is, when n = m, since it is O (2n) time, the calculation amount increases in a linear function of n, and the calculation amount can be reduced.

（ファジーボールトについて）
また、ファジーボールトとは、誤り訂正符号を用いて、秘匿化された情報どうしの曖昧な照合を実行する手法である。ここで、誤り訂正符号とは、冗長性（redundancy）が付加された情報の集合としての符号であって、前記冗長性が前記情報に含まれる誤りの訂正に用いられる符号のことである。このため、前記誤り訂正符号であれば、例えば、送信中にノイズが混入した場合でも、前記冗長性に基づいて、前記ノイズが除去された前記情報を受信できる。
前記ファジーボールトでは、秘密情報を施錠する施錠処理と、施錠された前記秘密情報を開錠する開錠処理とが実行される。 (About fuzzy vault)
The fuzzy vault is a technique for executing ambiguous collation between concealed information using an error correction code. Here, the error correction code is a code as a set of information to which redundancy is added, and the redundancy is used for correcting an error included in the information. Therefore, with the error correction code, for example, even when noise is mixed during transmission, the information from which the noise has been removed can be received based on the redundancy.
In the fuzzy vault, a locking process for locking the secret information and an unlocking process for unlocking the locked secret information are executed.

まず、０を除いた自然数をそれぞれｋ，ｔ，ｐ，ｒとし、前記自然数ｐを位数とする有限体（finite field）をＦとし、前記有限体Ｆのｋ次拡大体をＦ^ｋとし、前記有限体Ｆのｔ次拡大体をＦ^ｔとし、前記有限体Ｆのｒ次拡大体をＦ^ｒとし、前記ｋ次拡大体Ｆ^ｋの部分体としての秘密情報をｓ_ｆｖとし（ｓ_ｆｖ∈Ｆ^ｋ）、前記ｔ次拡大体Ｆ^ｔの部分体としての第１の集合をＡとし（Ａ∈Ｆ^ｔ）、前記ｒ次拡大体Ｆ^ｒの部分体としての集合、いわゆる、ボールト（vault）をＲとし（Ｒ∈Ｆ^ｒ）、ｒ＞ｔとした場合に、前記秘密情報ｓ_ｆｖと、前記第１の集合Ａとに基づいて、前記ボールトＲを演算する前記施錠処理を実行する。
そして、前記ｔ次拡大体Ｆ^ｔの部分体としての第２の集合をＢとし（Ｂ∈Ｆ^ｔ）、前記第１の集合Ａと前記第２の集合Ｂとが十分近い場合、例えば、前記各集合Ａ，Ｂの各要素について、前記各要素の値が８割以上同一である場合に、前記ボールトＲと、前記第２の集合Ｂとに基づいて、前記秘密情報ｓ_ｆｖを演算する前記開錠処理を実行する。 First, let k, t, p, r be natural numbers excluding 0, F be a finite field with the natural number p being the order, and F ^k be a k-th order extension of the finite field F; The t-order extension field of the finite field F is F ^t , the r-order extension field of the finite field F is F ^r, and the secret information as a subfield of the ^k-th extension field F ^k is s _fv (s _fv ∈ F ^k ), the first set as a sub-field of the t-order extension field F ^t is A (A∈F ^t ), and the set as a sub-body of the r-order extension field F ^r , the so-called vault When R is R (RεF ^r ) and r> t, the locking process for calculating the vault R is executed based on the secret information s _fv and the first set A.
If the second set as a sub-field of the t-th order extension field F ^t is B (B∈F ^t ), and the first set A and the second set B are sufficiently close, Calculating the secret information s _fv based on the vault R and the second set B when the value of each element is equal to 80% or more for each element of each set A, B Execute unlocking process.

なお、体（field）とは、加減乗除の四則演算が定義された集合であって、前記集合の要素である元どうしの前記四則演算が前記集合内で閉じている（演算結果が元として集合に含まれる）場合の前記集合のことをいう。また、有限体とは、前記集合の元の数（位数）が、有限個である体のことであり、前記有限体は、ガロア体（Galois field）とも呼ばれ、特に位数が素数の場合には素体（prime field）とも呼ばれる。また、拡大体とは、前記体を部分集合として含む体のことであり、例えば、実数体（実数全体の集合）は、有理数体（有理数全体の集合）の拡大体である。また、有限体のｋ次拡大体とは、加減乗除の四則演算が定義された、前記有限体の元を係数とする（ｋ−１）次以下の多項式を元とする集合（多項式集合）のことをいう。さらに、部分体とは、前記体の部分集合であり、前記体で定義されている前記四則演算が成立する前記部分集合のことをいう。
なお、体、有限体、拡大体、部分体等については、例えば、非特許文献６に記載されており、公知であるため、詳細な説明を省略する。 A field is a set in which four arithmetic operations of addition, subtraction, multiplication, and division are defined, and the four arithmetic operations of elements that are elements of the set are closed in the set (the operation result is a set based on the original). In the case of the above). A finite field is a field in which the original number (order) of the set is a finite number, and the finite field is also called a Galois field. In some cases, it is also called a prime field. An extension field is a field including the field as a subset. For example, a real number field (a set of all real numbers) is an extension field of a rational number field (a set of all rational numbers). Further, a k-th order extension field of a finite field is a set (polynomial set) in which four arithmetic operations of addition, subtraction, multiplication, and division are defined, and an element of the finite field is a coefficient (k-1) or lower polynomial. That means. Furthermore, a partial body is a subset of the body, and refers to the subset in which the four arithmetic operations defined in the body are established.
In addition, about a body, a finite field, an expansion body, a partial body, etc., it describes in the nonpatent literature 6, for example, Since it is well-known, detailed description is abbreviate | omitted.

（ＲＳ符号を用いたファジーボールトについて）
前記ファジーボールトでは、前記施錠処理および前記開錠処理を実行するために、前記誤り訂正符号として、誤り訂正能力が高いとされている、ＲＳ符号（Reed-Solomon符号、リード・ソロモン符号）が利用される場合がある。ここで、前記ＲＳ符号とは、前記情報がビットのブロックに分割されて表現され、前記ブロック単位の誤り訂正ができる符号である。なお、前記ブロックは、具体的には、前記有限体（Ｆ）のｋ次拡大体（Ｆ^ｋ）の元である前記多項式上の点である。すなわち、前記ＲＳ符号では、所定の数の前記多項式上の点を要素とする集合が、いわゆる、符号語としてビットで表現される。なお、ｎ個の前記多項式上の点からなる集合を前記符号語として表現する場合、前記ＲＳ符号は、特に、（ｎ，ｋ）符号（（ｎ，ｋ）ＲＳ符号）という。
なお、前記ＲＳ符号等については、例えば、非特許文献６等に記載されており、公知であるため、詳細な説明を省略する。 (About fuzzy vault using RS code)
In the fuzzy vault, an RS code (Reed-Solomon code, Reed-Solomon code), which has high error correction capability, is used as the error correction code in order to execute the locking process and the unlocking process. May be. Here, the RS code is a code in which the information is expressed by being divided into bit blocks, and error correction can be performed in units of blocks. Note that the block is specifically a point on the polynomial that is an element of the k-th order extension field (F ^k ) of the finite field (F). That is, in the RS code, a set whose elements are a predetermined number of points on the polynomial is expressed by bits as a so-called code word. When a set of n points on the polynomial is expressed as the code word, the RS code is particularly referred to as an (n, k) code ((n, k) RS code).
Note that the RS code and the like are described in, for example, Non-Patent Document 6 and the like and are well known, and thus detailed description thereof is omitted.

前記ＲＳ符号を用いたファジーボールトでは、前記施錠処理において、まず、ｎ＝ｔ，ｒ＞ｎとした場合に、ｋ個の要素からなる前記秘密情報ｓ_ｆｖ（ｓ_ｆｖ∈Ｆ^ｋ）と、ｎ個の要素からなる前記第１の集合Ａ（Ａ∈Ｆ^ｎ）とに基づいて、前記多項式上のｎ個の点からなる前記（ｎ，ｋ）符号を演算する。次に、前記（ｎ，ｋ）符号に対して、前記多項式上の点以外のランダムな（ｒ−ｎ）点からなる集合であるチャフ（chaff）を前記（ｎ，ｋ）符号に付与する。そして、前記（ｎ，ｋ）符号と、前記チャフとによって構成されたｒ個の点からなる集合が前記ボールトＲ（Ｒ∈Ｆ^ｒ）として出力される。
また、前記ＲＳ符号を用いたファジーボールトでは、前記開錠処理において、前記第１の集合Ａのｎ個以下の要素を有する前記第２の集合Ｂ（Ｂ∈Ｆ^ｔ）と、ｒ個の点からなる前記ボールトＲとに基づいて、前記多項式を復元することにより、前記秘密情報ｓ_ｆｖを演算する。 In the fuzzy vault using the RS code, in the locking process, first, when n = t, r> n, the secret information s _fv (s _fv εF ^k ) including ^k elements and n Based on the first set A (AεF ⁿ ) composed of elements, the (n, k) code composed of n points on the polynomial is calculated. Next, a chaff that is a set of random (rn) points other than the points on the polynomial is added to the (n, k) code. A set of r points formed by the (n, k) code and the chaff is output as the vault R (RεF ^r ).
In the fuzzy vault using the RS code, in the unlocking process, the second set B (BεF ^t ) having n elements or less of the first set A and r points The secret information s _fv is calculated by restoring the polynomial based on the vault R consisting of

なお、前記ＲＳ符号では、前記（ｎ，ｋ）符号における（ｎ−ｋ）／２個以下の点の誤りを訂正できることが保証されている（例えば、非特許文献６、７参照）。すなわち、前記（ｎ，ｋ）符号における｛（ｎ＋ｋ）／２｝個以上の点が判明すれば前記多項式の復元に成功することが知られている（ｎ−｛（ｎ−ｋ）／２｝＝（ｎ＋ｋ）／２）。この結果、前記ＲＳ符号を用いたファジーボールトでは、前記各集合Ａ，Ｂの各要素について、｛（ｎ＋ｋ）／２｝個以上の要素が同一である場合には、前記多項式が復元でき、前記秘密情報ｓ_ｆｖを演算できることが保証されている。
したがって、前記ＲＳ符号を用いたファジーボールトでは、前記チャフを含む前記ボールトＲから前記（ｎ，ｋ）符号を特定する問題と、前記（ｎ，ｋ）符号から前記多項式を復元する問題、いわゆる、多項式復元問題との困難性によって、前記秘密情報ｓ_ｆｖの秘匿化についての安全性が保証されている。
なお、前記ＲＳ符号を用いたファジーボールトについては、例えば、非特許文献７、８等に記載されており、公知であるため、詳細な説明を省略する。 In the RS code, it is guaranteed that errors of (n−k) / 2 or less points in the (n, k) code can be corrected (for example, see Non-Patent Documents 6 and 7). That is, it is known that if the {n + k) / 2} or more points in the (n, k) code are found, the polynomial can be successfully restored (n − {(n−k) / 2}). = (N + k) / 2). As a result, in the fuzzy vault using the RS code, for each element of the sets A and B, when {(n + k) / 2} or more elements are the same, the polynomial can be restored, It is guaranteed that the secret information s _fv can be calculated.
Therefore, in the fuzzy vault using the RS code, the problem of specifying the (n, k) code from the vault R including the chaff, and the problem of restoring the polynomial from the (n, k) code, so-called, Due to the difficulty with the polynomial restoration problem, the security of the confidential information s _fv is secured.
Note that the fuzzy vault using the RS code is described in, for example, Non-Patent Documents 7 and 8, and is well known, and thus detailed description thereof is omitted.

ＣＡ３：多項式演算手段
多項式演算手段ＣＡ３は、前記登録情報ｓに基づいて、（ｋ−１）次元で１変数の多項式であって、前記登録情報ｓを復元するために必要な前記多項式上の点が｛（ｎ＋ｋ）／２｝個以上となる前記多項式を演算する。実施例１の前記多項式演算手段ＣＡ３は、前記多項式の変数をｘとし、前記多項式のｋ個の係数をｓ_０〜ｓ_ｋ−１とし、前記多項式をｆ（ｘ）として以下の式（１）に示すものとした場合、前記係数ｓ_０〜ｓ_ｋ−１を演算することにより、前記多項式ｆ（ｘ）を演算する。
ｆ（ｘ）＝ｓ_０＋ｓ_１ｘ＋ｓ_２ｘ^２＋…＋ｓ_ｋ−２ｘ^ｋ−２＋ｓ_ｋ−１ｘ^ｋ−１…（１）
すなわち、実施例１の前記多項式演算手段ＣＡ３は、前記登録情報ｓを前記秘密情報ｓ_ｆｖとみなして（ｓ＝ｓ_ｆｖ）、前記登録情報ｓに基づいて、前記ＲＳ符号を用いたファジーボールトにおける前記多項式ｆ（ｘ）の係数ｓ_０〜ｓ_ｋ−１を演算する。 CA3: Polynomial calculation means The polynomial calculation means CA3 is a (k−1) -dimensional univariate polynomial based on the registration information s, and is a point on the polynomial necessary to restore the registration information s. The polynomial is calculated such that becomes {(n + k) / 2} or more. The polynomial calculation means CA3 according to the first embodiment uses the following equation (1), where x is a variable of the polynomial, s ₀ to s _k-1 are _k coefficients of the polynomial, and f (x) is the polynomial. When calculating the coefficient f (x), the coefficients s _{0 to} s _k−1 are calculated.
f (x) = s ₀ + s ₁ x + s ₂ x ² +... + s _k−2 x ^k−2 + s _k−1 x ^k−1 (1)
That is, the polynomial calculation unit CA3 according to the first embodiment regards the registration information s as the secret information s _fv (s = s _fv ), and uses the RS code based on the registration information s. The coefficients s _{0 to} s _k−1 of the polynomial f (x) are calculated.

実施例１の前記登録情報ｓは、具体的には、まず、前記登録情報ｓを前記文字列ｓ_ｑｇとみなすことにより（ｓ＝ｓ_ｆｖ＝ｓ_ｑｇ）、（ｋ−１）個の部分文字列を抽出する。ここで、前記登録情報ｓの長さ（文字数）を‖ｓ‖とした場合、（‖ｓ‖−ｑ＋１）個のｑ−ｇｒａｍが抽出されることが知られている（例えば、非特許文献６等参照）。すなわち、前記登録情報ｓからｎ個のｑ−ｇｒａｍが抽出されるとした場合には、ｑ＝‖ｓ‖−ｎ＋１が成立する。よって、前記部分文字列をｓ_ｋ（１）〜ｓ_{ｋ（ｋ−１）}とした場合、長さ（‖ｓ‖−（ｋ−１）＋１）の前記部分文字列ｓ_ｋ（１）〜ｓ_{ｋ（ｋ−１）}が抽出される。そして、前記部分文字列ｓ_ｋ（１）〜ｓ_{ｋ（ｋ−１）}を、前記有限体Ｆの元としての前記係数ｓ_０〜ｓ_ｋ−１（ｓ_０，ｓ_１，…，ｓ_ｋ−２，ｓ_ｋ−１∈Ｆ）に対応付けることにより、前記多項式ｆ（ｘ）を演算する。例えば、ｓ_ｋ−１＝１とし、且つ、予め設定された双方向関数の一例としての可逆関数により、前記部分文字列ｓ_ｋ（１）〜ｓ_{ｋ（ｋ−１）}を、前記有限体Ｆの元となる前記係数ｓ_０〜ｓ_ｋ−２に変換する。 Specifically, the registration information s of the first embodiment is as follows. First, the registration information s is regarded as the character string s _qg (s = s _fv = s _qg ), and (k−1) partial characters. Extract columns. Here, it is known that (文字 s‖−q + 1) q-grams are extracted when the length (number of characters) of the registration information s is ‖s‖ (for example, Non-Patent Document 6). Etc.). That is, when n q-grams are extracted from the registration information s, q = ‖s‖-n + 1 holds. Thus, if the partial string was _{_{s k (1) ~s k (}} k-1), the substring _{s k (1)} of the length (‖s‖- (k-1) +1 ) ~s _{k (k-1)} is extracted. Then, the partial character strings s _{k (1) to} s _{k (} _k−1) are converted into the coefficients s _{0 to} s _k−1 (s ₀ , s ₁ ,..., S _k− as the elements of the finite field F. ₂ , s _k−1 εF), the polynomial f (x) is calculated. For example, s _k-1 = 1 and the partial character strings s _{k (1) to} s _{k (k−1)} are converted into the finite field F by a reversible function as an example of a preset bidirectional function. Are converted into the coefficients s _{0 to} s _{k−2 that} are the basis of the above.

ここで、可逆関数とは、前記部分文字列ｓ_ｋ（１）〜ｓ_{ｋ（ｋ−１）}から前記係数ｓ_０〜ｓ_ｋ−１に変換可能、且つ、前記係数ｓ_０〜ｓ_ｋ−１から前記部分文字列ｓ_ｋ（１）〜ｓ_{ｋ（ｋ−１）}に変換可能な関数である。例えば、前記登録情報ｓを「ＡＧＣＣＧＧＡＡＧＧＣＣ」とし、ｋ＝１０とし、長さ４（１２−（１０−１）＋１＝４）の前記部分文字列ｓ_ｋ（１）を「ＡＧＣＣ」とし、ｓ_０＝５とした場合に、前記可逆関数とは、「ＡＧＣＣ」→５が成立する対応関係、いわゆる、写像（map）のことである。また、この場合、前記可逆関数をＴとし、前記可逆関数Ｔの逆関数をＴ^−１とし、前記部分文字列ｓ_ｋ（１）を入力した前記可逆関数Ｔの出力値をＴ（ｓ_ｋ（１））とし、前記係数ｓ_０を入力した前記逆関数Ｔ^−１の出力値をＴ^−１（ｓ_０）とした場合に、Ｔ（ｓ_ｋ（１））＝ｓ_０＝５、Ｔ^−１（ｓ_０）＝Ｔ^−１（５）＝ｓ_ｋ（１）が成立する。 Here, the reversible function, the substring _{_{s k (1) ~s k (}} k-1) from the convertible to the coefficient _s 0 _{~s k-1,} and the coefficient _s 0 _{~s k-1} To the partial character string s _{k (1) to} s _{k (} _k−1) . For example, the registration information s is “AGCCCGGAAGGGCC”, k = 10, the partial character string s _{k (1)} of length 4 (12− (10−1) + 1 = 4 ₎ is “AGCC”, and s ₀ = 5, the reversible function is a correspondence relationship that establishes “AGCC” → 5, that is, a so-called map. In this case, the reversible function is T, the inverse function of the reversible function T is T- ^1, and the output value of the reversible function T to which the partial character string s _{k (1)} is input is T (s _{k ( 1)),} and the output value of the inverse function ^{T -1} input the coefficient _{s 0} in case of a ^{_{_{T -1 (s 0), T}}} (s k (1)) = s 0 = 5, T - ¹ (s ₀ ) = T ⁻¹ (5) = s _{k (1)} is established.

なお、実施例１では、前記登録情報ｓのｑ−ｇｒａｍを抽出するために、前記自然数ｑの値が予め設定されている。よって、前記登録情報入力部２ａに前記登録情報ｓが入力された場合、前記自然数ｎの値が、ｎ＝‖ｓ‖−ｑ＋１として設定されている。また、実施例１では、前記登録情報ｓと前記検索情報との前記編集距離ｄについての最大値がｄ_ｍａｘ（ｄ≦ｄ_ｍａｘ）として予め設定されている。さらに、実施例１では、前記自然数（次元数）ｋの値が、以下の式（２）を前記自然数ｋについて解いた式（２）′により設定されている。
（ｎ＋ｋ）／２＝ｎ−（ｄ_ｍａｘ−１）ｑ−１ …（２）
ｋ＝ｎ−２（ｄ_ｍａｘ−１）ｑ−２ …（２）′ In Example 1, the value of the natural number q is set in advance in order to extract the q-gram of the registration information s. Therefore, when the registration information s is input to the registration information input unit 2a, the value of the natural number n is set as n = ‖s‖−q + 1. In the first embodiment, the maximum value for the edit distance d between the registration information s and the search information is preset as d _max (d ≦ d _max ). Furthermore, in the first embodiment, the value of the natural number (dimension number) k is set by the equation (2) ′ obtained by solving the following equation (2) for the natural number k.
(N + k) / 2 = n− (d _max −1) q−1 (2)
k = n−2 (d _max −1) q−2 (2) ′

ＣＡ４：登録部分情報抽出手段
登録部分情報抽出手段ＣＡ４は、前記登録情報ｓに基づいて、前記登録情報ｓを復元可能なｎ個の部分情報である登録部分情報を抽出する。実施例１の前記登録部分情報抽出手段ＣＡ４は、前記登録情報ｓを前記文字列ｓ_ｑｇとみなすことにより（ｓ＝ｓ_ｆｖ＝ｓ_ｑｇ）、ｎ個の前記ｑ−ｇｒａｍを前記登録部分情報として抽出する。実施例１の前記登録部分情報抽出手段ＣＡ４は、前記登録部分情報をｓ_ｑ１〜ｓ_ｑｎとした場合に、前記多項式演算手段ＣＡ３と同様に、ｎ個（ｎ＝‖ｓ‖−ｑ＋１）の部分文字列である前記登録部分情報ｓ_ｑ１〜ｓ_ｑｎを抽出する。例えば、ｑ＝４とした場合、前記登録情報ｓ（「ＡＧＣＣＧＧＡＡＧＧＣＣ」）の登録部分情報ｓ_ｑ１〜ｓ_ｑｎは、ｓ_ｑ１（「ＡＧＣＣ」），ｓ_ｑ２「ＧＣＣＧ」，…，ｓ_ｑｎ＝ｓ_ｑ９「ＧＧＣＣ」の９つである（ｎ＝１２−４＋１＝９）。
ＣＡ５：登録集合演算手段
登録集合演算手段ＣＡ５は、前記登録部分情報抽出手段ＣＡ４により抽出されたｎ個の前記登録部分情報に基づいて、ｎ種類の数値である登録数値を要素とする集合である登録集合を演算する。すなわち、実施例１の前記登録集合演算手段ＣＡ５は、前記ＲＳ符号を用いたファジーボールトにおける前記第１の集合Ａの一例としての前記登録集合Ａを演算する（Ａ∈Ｆ^ｎ）。 CA4: Registered partial information extracting means The registered partial information extracting means CA4 extracts registered partial information which is n pieces of partial information capable of restoring the registered information s based on the registered information s. The registered part information extraction unit CA4 according to the first embodiment regards the registration information s as the character string s _qg (s = s _fv = s _qg ), and uses n q-grams as the registered part information. Extract. The registered part information extracting unit CA4 according to the first embodiment has n (n = ‖s‖−q + 1) parts as in the polynomial calculating unit CA3 when the registered part information is s _q1 to s _qn. The registered partial information s _{q1 to} s _qn that are character strings are extracted. For example, when q = 4, the registration partial information s _{q1 to} s _qn of the registration information s (“AGCCCGGAAGGGCC”) is s _q1 (“AGCC”), s _q2 “GCCG”,..., S _qn = s _q9 There are nine “GGCC” (n = 12−4 + 1 = 9).
CA5: Registered set calculation means The registered set calculation means CA5 is a set having n registered numeric values as elements based on the n pieces of registered partial information extracted by the registered partial information extracting means CA4. Compute the registered set. That is, the registered set calculation means CA5 according to the first embodiment calculates the registered set A as an example of the first set A in the fuzzy vault using the RS code (AεF ⁿ ).

なお、実施例１の前記登録集合演算手段ＣＡ５は、予め設定された一方向関数、いわゆる、ハッシュ関数により、前記登録集合Ａを演算する。前記登録集合演算手段ＣＡ５は、具体的には、前記ハッシュ関数をＨとし、前記ハッシュ関数Ｈによる前記登録部分情報ｓ_ｑ１〜ｓ_ｑｎの出力値をＨ（ｓ_ｑ１）〜Ｈ（ｓ_ｑｎ）とし、前記登録数値をａ_１〜ａ_ｎとした場合に（ａ_１，ａ_２，…，ａ_ｎ∈Ｆ）、Ｈ（ｓ_ｑ１）＝ａ_１，Ｈ（ｓ_ｑ２）＝ａ_２，…，Ｈ（ｓ_ｑｎ）＝ａ_ｎを演算することにより、前記登録集合Ａを演算する（Ａ＝（ａ_１，ａ_２，…，ａ_ｎ））。 The registered set calculation means CA5 according to the first embodiment calculates the registered set A using a preset one-way function, a so-called hash function. Specifically, the registered set calculation means CA5 sets the hash function as H, and outputs the registered partial information s _{q1 to} s _qn by the hash function H as H (s _q1 ) to H (s _qn ). , when the registration numerical values and _{_{_{_{a 1 ~a n (a 1,}}}} a 2, ..., a n ∈F), H (s q1) = a 1, H (s q2) = a 2, ..., H _(s qn) = by calculating the _{a n,} computing the registration set _{_{a (a = (a 1,}} a 2, ..., a n)).

ＣＡ６：登録代入値演算手段
登録代入値演算手段ＣＡ６は、前記登録数値ａ_１〜ａ_ｎが代入された前記多項式ｆ（ｘ）の数値である登録代入値を演算する。実施例１の前記登録代入値演算手段ＣＡ６は、前記式（１）について、ｘ＝ａ_１，ｘ＝ａ_２，…，ｘ＝ａ_ｎとした場合の前記多項式ｆ（ｘ）の値である前記登録代入値ｆ（ａ_１），ｆ（ａ_２），…，ｆ（ａ_ｎ）を演算する。
ＣＡ７：擬似数値演算手段
擬似数値演算手段ＣＡ７は、前記登録数値ａ_１〜ａ_ｎ以外の数値である（ｒ−ｎ）種類の擬似数値を演算する。実施例１の前記擬似数値演算手段ＣＡ７は、前記擬似数値をａ_ｎ＋１〜ａ_ｒとした場合に、前記有限体Ｆのｐ個の要素（元）のうち、前記登録数値ａ_１〜ａ_ｎ以外の（ｒ−ｎ）個の要素を抽出することにより、前記擬似数値ａ_ｎ＋１〜ａ_ｒを演算する（ａ_ｎ＋１，ａ_ｎ＋２，…，ａ_ｒ∈Ｆ）。 CA6: Registration assignment value calculating means registration assignment value calculating means CA6 calculates numerical a registered assignment value of the registration numerical value a ₁ ~a _n is imputed the polynomial f (x). The registration assignment value calculating means CA6 of Example 1, the formula for _{(1), x = a 1} , x = a 2, ..., is the value of the polynomial f (x) in the case of the x = _{a n} The registered substitution values f (a ₁ ), f (a ₂ ),..., F (a _n ) are calculated.
CA7: Pseudo math unit pseudo math unit CA7 calculates a is a number other than the registration numerical _{_{a 1 ~a n (r-n}} ) types of pseudo-values. Said pseudo math unit CA7 of Example 1, the pseudo number in case of the _{a n} + 1 ~a _r, of the p elements of the finite field F (original), other than the registered numerical _a 1 ~a _n The pseudo numerical values a _{n + 1 to} a _r are calculated by extracting (r−n) elements of (a _{n + 1} , a _{n + 2} ,..., A _r εF).

ＣＡ８：擬似代入値演算手段
擬似代入値演算手段ＣＡ８は、前記擬似数値が代入された前記多項式ｆ（ｘ）の数値以外の数値である擬似代入値を演算する。実施例１の前記擬似代入値演算手段ＣＡ８は、１からｒまで値をとる変数をｉ，ｊとし、前記擬似数値ａ_ｎ＋１〜ａ_ｒに対応する前記擬似代入値をｆ′（ａ_ｎ＋１）〜ｆ′（ａ_ｒ）とした場合に、ｆ′（ａ_ｉ）≠ｆ（ａ_ｉ）且つｆ′（ａ_ｉ）≠ｆ（ａ_ｊ）且つｆ′（ａ_ｉ）≠ｆ′（ａ_ｊ）となる前記擬似代入値をｆ′（ａ_ｎ＋１）〜ｆ′（ａ_ｒ）を演算する。 CA8: Pseudo-assignment value calculation means The pseudo-assignment value calculation means CA8 calculates a pseudo-assignment value that is a numerical value other than the numerical value of the polynomial f (x) to which the pseudo-numeric value is assigned. The pseudo-assignment value calculation means CA8 according to the first embodiment sets i, j as variables that take values from 1 to r, and sets the pseudo-assignment values corresponding to the pseudo-values a _{n + 1 to} a _r to f ′ (a _{n + 1} ) ˜ When f ′ (a _r ), f ′ (a _i ) ≠ f (a _i ) and f ′ (a _i ) ≠ f (a _j ) and f ′ (a _i ) ≠ f ′ (a _j ) F ′ (a _{n + 1} ) to f ′ (a _r ) are calculated from the pseudo-assigned values.

ＣＡ９：秘匿登録情報演算手段
秘匿登録情報演算手段ＣＡ９は、前記登録数値ａ_１〜ａ_ｎおよび前記登録数値ａ_１〜ａ_ｎに対応する前記登録代入値ｆ（ａ_１），ｆ（ａ_２），…，ｆ（ａ_ｎ）を一組とする前記多項式上の点を登録多項式点とし、前記擬似数値ａ_ｎ＋１〜ａ_ｒおよび前記擬似数値ａ_ｎ＋１〜ａ_ｒに対応する前記擬似代入値ｆ′（ａ_ｎ＋１）〜ｆ′（ａ_ｒ）を一組とする前記多項式以外の点を擬似多項式点とした場合に、ｎ個の前記登録多項式点と、（ｒ−ｎ）個の前記擬似多項式点とを有するｒ個の点の集合である多項式点集合を演算することにより、前記登録情報が秘匿化された秘匿登録情報を演算する。すなわち、実施例１の前記秘匿登録情報演算手段ＣＡ９は、前記ＲＳ符号を用いたファジーボールトにおける前記（ｎ，ｋ）符号の一例としての前記登録多項式点と、前記チャフの一例としての前記擬似多項式点とを有する前記ボールトＲの一例としての前記多項式点集合Ｒ（秘匿登録情報Ｒ）を演算する。 CA9: confidential registration information calculating means confidential registration information calculating means CA9, the registration numerical _a 1 ~a _n and the registration numerical value _a 1 ~a corresponding to _n the registered value substituted into _{_{f (a 1), f (}} a 2) , ..., f _{(a n)} of a point registration polynomial point on the polynomial to a set, the pseudo numerical _{a n} + 1 ~a _r and the pseudo numerical _{a n} + 1 ~a corresponding to _r the pseudo assignment value f ' When the points other than the polynomial having a set of (a _{n + 1} ) to f ′ (a _r ) are pseudo-polynomial points, n registered polynomial points and (r−n) pseudo-polynomial points The secret registration information in which the registration information is concealed is calculated by calculating a polynomial point set, which is a set of r points including: That is, the secret registration information calculation unit CA9 according to the first embodiment includes the registration polynomial point as an example of the (n, k) code in the fuzzy vault using the RS code and the pseudo polynomial as an example of the chaff. The polynomial point set R (secret registration information R) as an example of the vault R having points is calculated.

実施例１の前記秘匿登録情報演算手段ＣＡ９は、前記登録多項式点を（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ））とし、前記擬似多項式点を（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））とした場合に、以下の式（３）に示す前記多項式点集合Ｒを演算する。
Ｒ＝（（ａ_１，ｆ（ａ_１）），
（ａ_２，ｆ（ａ_２）），…，
（ａ_ｎ，ｆ（ａ_ｎ）），
（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１）），
（ａ_ｎ＋２，ｆ′（ａ_ｎ＋２）），…，
（ａ_ｒ，ｆ′（ａ_ｒ））） …（３）
なお、実施例１の前記秘匿登録情報演算手段ＣＡ９では、前記秘匿登録情報Ｒは、前記式（３）に示す前記多項式点集合Ｒの各数値ａ_１〜ａ_ｒについて、昇順に並び替えられた状態で出力される（演算される）。 The secret registration information calculation means CA9 according to the first embodiment sets the registration polynomial points as (a ₁ , f (a ₁ )) to (a _n , f (a _n )), and sets the pseudo polynomial points as (a _{n + 1} , When it is assumed that f ′ (a _{n + 1} )) to (a _r , f ′ (a _r )), the polynomial point set R shown in the following equation (3) is calculated.
R = ((a ₁ , f (a ₁ )),
_{_{(A 2, f (a 2}} )), ...,
_{_{(A n, f (a n}} )),
(A _{n + 1} , f ′ (a _{n + 1} )),
(A _{n + 2} , f ′ (a _{n + 2} )),...
( _Ar , f '( _ar ))) (3)
In the secret registration information calculation unit CA9 according to the first embodiment, the secret registration information R is rearranged in ascending order with respect to the numerical values a _{1 to} a _r of the polynomial point set R shown in the formula (3). It is output (calculated) in the state.

ＣＡ10：秘匿登録情報送信手段
秘匿登録情報送信手段ＣＡ10は、前記秘匿登録情報演算手段ＣＡ９により演算された前記秘匿登録情報Ｒを、前記データベースサーバＤＳに対して送信する。
ＣＡ11：終了判別手段
終了判別手段ＣＡ11は、前記秘匿登録情報送信プログラムＡＰ１を終了する入力がされたか否かを判別する。 CA10: Secret registration information transmission means Secret registration information transmission means CA10 transmits the secret registration information R calculated by the secret registration information calculation means CA9 to the database server DS.
CA11: End determination means The end determination means CA11 determines whether or not an input for ending the secret registration information transmission program AP1 has been made.

（検索用クライアントパソコンＰＣｂの制御部の説明）
前記検索用クライアントパソコンＰＣｂのハードディスクドライブには、前記検索用クライアントパソコンＰＣｂの基本動作を制御する基本ソフト（オペレーティングシステム）ＯＳや、アプリケーションプログラムとしての秘匿検索情報送信プログラムＡＰ２、秘匿類似情報復元プログラムＡＰ３、その他の図示しないソフトウェアが記憶されている。 (Description of control unit of search client personal computer PCb)
The hard disk drive of the search client personal computer PCb includes a basic software (operating system) OS for controlling the basic operation of the search client personal computer PCb, a secret search information transmission program AP2 as an application program, and a secret similar information restoration program AP3. Other software (not shown) is stored.

（秘匿検索情報送信プログラムＡＰ２）
前記秘匿検索情報送信プログラムＡＰ２は、下記の機能手段（プログラムモジュール）を有する。 (Secret search information transmission program AP2)
The secret search information transmission program AP2 includes the following functional means (program modules).

図４は実施例１の登録画像の説明図である。
ＣＢ１：検索画像表示手段
検索画像表示手段ＣＢ１は、図４に示す、前記検索情報と同一または類似する前記類似情報を前記データベースサーバＤＳに検索させるための検索画像３を前記ディスプレイＨ２に表示する。実施例１の前記登録画像３は、前記検索情報をｔとし、前記類似情報をｔ′とした場合に、前記検索情報ｔ（例えば、「ＧＧＣＣＡＧＧＧＣＡＣＣ」）を入力するための検索情報入力部３ａと、入力された前記検索情報ｔの類似情報ｔ′を、秘匿化した状態で、前記データベースサーバＤＳに検索させる処理、いわゆる、秘匿類似情報検索処理を開始させるための検索開始ボタン３ｂと、検索結果としての前記類似情報ｔ′（例えば、「ＧＧＣＣＧＧＧＧＴＧＣＡ」等）を表示するための類似情報出力部３ｃとを有する。 FIG. 4 is an explanatory diagram of a registered image according to the first embodiment.
CB1: Search image display means The search image display means CB1 displays a search image 3 shown in FIG. 4 for causing the database server DS to search for the similar information that is the same as or similar to the search information. The registered image 3 according to the first embodiment includes a search information input unit 3a for inputting the search information t (for example, “GGCCAGGGCACC”) when the search information is t and the similar information is t ′. A search start button 3b for starting the so-called secret similar information search process, which is a process for causing the database server DS to search the similar information t ′ of the input search information t in a concealed state, and a search result And a similar information output unit 3c for displaying the similar information t ′ (for example, “GGCCGGGGTGCA” or the like).

ＣＢ２：検索判別手段
検索判別手段ＣＢ２は、前記データベースサーバＤＳに前記類似情報検索処理を開始させるか否かを判別する。実施例１の前記検索判別手段ＣＢ２は、前記検索情報入力部３ａに前記検索情報ｔが入力されて前記検索開始ボタン３ｂが入力されたか否かを判別することにより、前記データベースサーバＤＳに前記秘匿類似情報検索処理を開始させるか否かを判別する。
ＣＢ３：検索部分情報抽出手段
検索部分情報抽出手段ＣＢ３は、前記検索情報ｔに基づいて、前記検索情報ｔを復元可能なｍ個の前記部分情報である検索部分情報を抽出する。実施例１の前記検索部分情報抽出手段ＣＢ３は、前記検索情報ｔを前記文字列ｓ_ｑｇとみなすことにより（ｔ＝ｓ_ｑｇ）、ｍ個の前記ｑ−ｇｒａｍを前記検索部分情報として抽出する。実施例１の前記検索部分情報抽出手段ＣＢ３は、前記登録部分情報抽出手段ＣＡ４と同様に、前記検索部分情報をｔ_ｑ１〜ｔ_ｑｍとし、前記検索情報ｔの長さ（文字数）を‖ｔ‖とした場合に、ｍ個（ｍ＝‖ｔ‖−ｑ＋１）の部分文字列である前記検索部分情報ｔ_ｑ１〜ｔ_ｑｍを抽出する。例えば、ｑ＝４とした場合、前記検索情報ｔ（「ＧＧＣＣＡＧＧＧＣＡＣＣ」）の検索部分情報ｔ_ｑ１〜ｇ_ｑｍは、ｔ_ｑ１（「ＧＧＣＣ」），ｔ_ｑ２（「ＧＣＣＡ」），…，ｔ_ｑｍ＝ｔ_ｑ９（「ＣＡＣＣ」）の９つである（ｍ＝１２−４＋１＝９）。 CB2: Search Discriminating Unit The search discriminating unit CB2 discriminates whether or not the database server DS starts the similar information search process. The search discriminating means CB2 according to the first embodiment determines whether the search information input unit 3a has input the search information t and the search start button 3b, thereby determining whether the database server DS has the secret. It is determined whether or not to start the similar information search process.
CB3: Search Partial Information Extraction Unit The search partial information extraction unit CB3 extracts search partial information which is m pieces of partial information capable of restoring the search information t based on the search information t. The search part information extraction unit CB3 of Example 1 extracts m pieces of the q-grams as the search part information by regarding the search information t as the character string s _qg (t = s _qg ). Similar to the registered part information extraction unit CA4, the search part information extraction unit CB3 of Embodiment 1 sets the search part information to t _q1 to t _qm, and sets the length (number of characters) of the search information t to ‖t‖. In this case, the search partial information t _{q1 to} t _qm which are m (m = ‖t‖−q + 1) partial character strings are extracted. For example, when q = 4, the search partial information t _{q1 to} g _qm of the search information t (“GGCCAGGGCACC”) is t _q1 (“GGCC”), t _q2 (“GCCA”),..., T _qm = There are nine _tq9 (“CACC”) (m = 12−4 + 1 = 9).

ＣＢ４：検索集合演算手段
検索集合演算手段ＣＢ４は、前記検索部分情報抽出手段ＣＢ３により抽出されたｍ個の前記検索部分情報ｔ_ｑ１〜ｔ_ｑｍに基づいて、ｍ種類の数値である検索数値を要素とする集合である検索集合を演算する。すなわち、実施例１の前記検索集合演算手段ＣＢ４は、前記ＲＳ符号を用いたファジーボールトにおける前記第２の集合Ｂの一例としての前記検索集合Ｂを演算する（Ｂ∈Ｆ^ｍ）。
なお、実施例１の前記検索集合演算手段ＣＢ４は、前記ハッシュ関数Ｈにより、前記検索集合Ｂを演算する。前記検索集合演算手段ＣＢ４は、具体的には、前記ハッシュ関数Ｈによる前記検索部分情報ｔ_ｑ１〜ｔ_ｑｍの出力値をＨ（ｔ_ｑ１）〜Ｈ（ｔ_ｑｍ）とし、前記検索数値をｂ_１〜ｂ_ｍとした場合に（ｂ_１，ｂ_２，…，ｂ_ｍ∈Ｆ）、Ｈ（ｔ_ｑ１）＝ｂ_１，Ｈ（ｔ_ｑ２）＝ｂ_２，…，Ｈ（ｔ_ｑｍ）＝ｂ_ｍを演算することにより、前記検索集合Ｂを演算する（Ｂ＝（ｂ_１，ｂ_２，…，ｂ_ｍ））。なお、実施例１の前記検索集合演算手段ＣＢ４では、前記検索集合Ｂは、前記検索数値ｂ_１〜ｂ_ｍが、昇順に並び替えられた状態で出力される（演算される）。 CB4: Search set calculation means The search set calculation means CB4 uses m pieces of search numerical values as elements based on the m pieces of search partial information t _{q1 to} t _qm extracted by the search partial information extraction means CB3. A search set that is a set is calculated. That is, the search set calculation means CB4 of the first embodiment calculates the search set B as an example of the second set B in the fuzzy vault using the RS code (BεF ^m ).
The search set calculation means CB4 according to the first embodiment calculates the search set B using the hash function H. Specifically, the search set calculation means CB4 uses H (t _q1 ) to H (t _qm ) as output values of the search partial information t _{q1 to} t _qm by the hash function H, and sets the search numerical value to b _1. ˜b _m (b ₁ , b ₂ ,..., B _m εF), H (t _q1 ) = b ₁ , H (t _q2 ) = b ₂ ,..., H (t _qm ) = b _m To calculate the search set B (B = (b ₁ , b ₂ ,..., B _m )). In the search set calculation means CB4 of the first embodiment, the search set B is output (calculated) in a state in which the search numerical values b _{1 to} b _m are rearranged in ascending order.

ＣＢ５：検索集合記憶手段
検索集合記憶手段ＣＢ５は、前記検索集合演算手段ＣＢ４により演算された前記検索集合Ｂ（Ｂ＝（ｂ_１，ｂ_２，…，ｂ_ｍ））を記憶する。
ＣＢ６：秘匿検索情報演算手段
秘匿検索情報演算手段ＣＢ６は、自然数をＬとし、Ｌ＜ｍが成立するとした場合に、ｍ種類の前記検索数値ｂ_１〜ｂ_ｍのうち、Ｌ種類の前記検索数値を除く（ｍ−Ｌ）種類の前記検索数値（ｂ_１〜ｂ_ｍ）を要素とする前記検索集合Ｂの部分集合である検索部分集合を演算することにより、前記検索情報ｔが秘匿化された秘匿検索情報を演算する。実施例１の前記秘匿検索情報演算手段ＣＢ６は、前記検索部分集合をＢ^＊とした場合に、前記検索集合Ｂの検索数値ｂ_１〜ｂ_ｍからＬ種類の検索数値が除かれた、（ｍ−Ｌ）種類の前記検索数値（ｂ_１〜ｂ_ｍ）を要素とする前記検索部分集合（秘匿検索情報）Ｂ^＊を演算する。例えば、Ｌ＝２とし、変数ｉを１〜３およびｍ以外の数値とし、前記検索数値ｂ_１〜ｂ_ｍのうちのｂ_２およびｂ_ｉの２種類の数値が除かれた場合、（ｂ_１，０，ｂ_３，…，ｂ_ｉ−１，０，ｂ_ｉ＋１，…，ｂ_ｍ）→（ｂ_１，ｂ_３，…，ｂ_ｉ−１，ｂ_ｉ＋１，…，ｂ_ｍ）＝Ｂ^＊を演算する。 CB5: Search Set Storage Unit The search set storage unit CB5 stores the search set B (B = (b ₁ , b ₂ ,..., B _m )) calculated by the search set calculation unit CB4.
CB6: Secret search information calculation means The secret search information calculation means CB6, when natural number is L and L <m is established, among the m kinds of search numerical values b _{1 to} b _m , the L types of search numerical values. The search information t is concealed by calculating a search subset that is a subset of the search set B having (m−L) types of search numerical values (b _{1 to} b _m ) as elements. Secret search information is calculated. The secret search information calculation means CB6 according to the first embodiment is configured such that when the search subset is B ^* , L types of search numerical values are removed from the search numerical values b _{1 to} b _{m of the} search set B (m -L) The search subset (secret search information) B ^* having the search numerical values (b _{1 to} b _m ) of the types as elements is calculated. For example, when L = 2, the variable i is a numerical value other than 1 to 3 and m, and two types of numerical values b ₂ and b _i out of the search numerical values b _{1 to} b _m are removed, (b ₁ _{_{, 0, b 3, ...,}} b i-1, 0, b i + 1, ..., b m) → (b 1, b 3, ..., b i-1, b i + 1, ..., a b m) = ^{B *} Calculate.

ＣＢ７：秘匿検索情報送信手段
秘匿検索情報送信手段ＣＢ７は、前記秘匿検索情報演算手段ＣＢ６により演算された前記秘匿検索情報Ｂ^＊を、前記データベースサーバＤＳに対して送信する。
ＣＢ８：終了判別手段
終了判別手段ＣＢ８は、前記秘匿検索情報送信プログラムＡＰ２を終了する入力がされたか否かを判別する。 CB7: Secret Search Information Transmission Unit The secret search information transmission unit CB7 transmits the secret search information B ^* calculated by the secret search information calculation unit CB6 to the database server DS.
CB8: End determination means The end determination means CB8 determines whether or not an input to end the confidential search information transmission program AP2 has been made.

（秘匿類似情報復元プログラムＡＰ３）
また、前記秘匿類似情報復元プログラムＡＰ３は、下記の機能手段（プログラムモジュール）を有する。 (Secret similarity information restoration program AP3)
The secret similar information restoration program AP3 includes the following functional means (program modules).

ＣＢ９：秘匿類似情報受信手段
秘匿類似情報受信手段ＣＢ９は、後述する秘匿類似情報送信手段ＣＤ８により送信された前記類似情報ｔ′が秘匿化された秘匿類似情報を受信する。実施例１の前記秘匿類似情報受信手段ＣＢ９では、前記秘匿類似情報として、前記データベースサーバＤＳに登録済の複数の前記多項式点集合Ｒの一例としてのＭ個の候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭを要素とする集合である解候補集合Ｒ_Ａ（Ｒ_Ａ＝（Ｒ_Ａ１，Ｒ_Ａ２，…，Ｒ_ＡＭ））を前記秘匿類似情報として受信する。なお、前記候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭおよび前記解候補集合Ｒ_Ａについては後述する。 CB9: Secret Similarity Information Receiving Unit The secret similar information receiving unit CB9 receives the secret similar information in which the similar information t ′ transmitted by the secret similar information transmitting unit CD8 described later is concealed. In the secret similar information receiving unit CB9 according to the first embodiment, M candidate polynomial point sets R _{A1 to} R _AM as an example of the plurality of polynomial point sets R registered in the database server DS as the secret similar information. _A candidate solution set R _A (R _A = (R _A1 , R _A2 ,..., R _AM )) is received as the secret similar information. The candidate polynomial point sets R _{A1 to} R _AM and the solution candidate set R _A will be described later.

ＣＢ10：候補多項式点集合演算手段
候補多項式点集合演算手段ＣＢ10は、検索数値判別手段ＣＢ10Aと、数値抽出手段ＣＢ10Bとを有し、前記類似情報ｔ′を演算可能な多項式点集合の候補としての候補多項式点集合を演算する。実施例１の前記候補多項式点集合演算手段ＣＢ10は、前記各候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭの部分集合となる前記候補多項式点部分集合をＱ_１〜Ｑ_Ｍとした場合に、前記検索集合記憶手段ＣＢ５に記憶した前記検索集合Ｂに含まれる検索数値ｂ_１〜ｂ_ｍと、前記秘匿類似情報受信手段ＣＢ９により受信した前記秘匿類似情報（解候補集合Ｒ_Ａ）に含まれる各候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭとに基づいて、前記各候補多項式点部分集合Ｑ_１〜Ｑ_Ｍを演算する。 CB10: Candidate Polynomial Point Set Calculation Means Candidate polynomial point set calculation means CB10 has search numerical value determination means CB10A and numerical value extraction means CB10B, and is a candidate as a candidate polynomial point set capable of calculating the similarity information t ′. Compute a set of polynomial points. The candidate polynomial point set computing means CB10 of the first embodiment uses the search set when the candidate polynomial point subsets that are subsets of the candidate polynomial point sets R _{A1 to} R _AM are Q _{1 to} Q _M. Search numerical values b _{1 to} b _m included in the search set B stored in the storage unit CB5, and candidate polynomial points included in the secret similar information (solution candidate set _RA ) received by the secret similar information receiving unit CB9 The candidate polynomial point subsets Q _{1 to} Q _M are calculated based on the sets R _{A1 to} R _AM .

ＣＢ10A：検索数値判別手段
検索数値判別手段ＣＢ10Aは、前記検索数値ｂ_１〜ｂ_ｍが、前記各候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭに含まれる登録数値ａ_１〜ａ_ｎまたは前記擬似数値ａ_ｎ＋１〜ａ_ｒと同値であるか否かを判別する。実施例１の前記検索数値判別手段ＣＢ10Aは、ｍ個の前記検索数値ｂ_１〜ｂ_ｍが、前記各数値ａ_１〜ａ_ｒと同値であるか否かを、Ｍ個の前記各候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭについて全て判別する。なお、前記検索数値判別手段ＣＢ10Aでは、ｍ個の前記検索数値ｂ_１〜ｂ_ｍと、ｒ個の前記各数値ａ_１〜ａ_ｒとが昇順に並べられているため、例えば、ｂ_１＝ａ_３が成立する場合には、検索数値ｂ_２は数値ａ_４以降の数値と比較してゆくことにより、処理を高速化できる。
ＣＢ10B：数値抽出手段
数値抽出手段ＣＢ10Bは、前記検索数値ｂ_１〜ｂ_ｍと同値となる前記登録数値ａ_１〜ａ_ｎおよび前記擬似数値ａ_ｎ＋１〜ａ_ｒを抽出することにより、同値となる各数値ａ_１〜ａ_ｒに対応する多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ）），（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））を抽出する。 CB10A: Search numerical determination means searches numerical discriminating means CB10A, the search numerical _b 1 ~b _m is registered numerical included to the each candidate polynomial point set _{_R} A1 _~R _AM _a 1 ~a _n or the pseudo numeric _{a n + 1} it is determined whether or not the ~a _r and equivalent. The search numerical value discriminating means CB10A according to the first embodiment determines whether the m search numerical values b _{1 to} b _m are equal to the numerical values a _{1 to} a _r or not. All of the sets R _{A1 to} R _AM are determined. In the search numerical value determining means CB10A, the m search numerical values b _{1 to} b _m and the r numerical values a _{1 to} a _r are arranged in ascending order, for example, b ₁ = a _{When 3} is established, the search numerical value b ₂ is compared with the numerical values after the numerical value a ₄ , thereby speeding up the processing.
CB10B: Numerical extractor numerical extracting means CB10B by extracting the registration numerical _a 1 ~a _n and the pseudo numerical _{a n} + 1 ~a _r serving as the search numerical _b 1 ~b _m and equivalence, each becomes equivalent Polynomial points (a ₁ , f (a ₁ )) to (a _n , f (a _n )), (a _{n + 1} , f ′ (a _{n + 1} )) to (a _r , f corresponding to the numerical values a _{1 to} a _r ′ ( _Ar )) is extracted.

実施例１の前記数値抽出手段ＣＢ10Bは、ｉ＝１，２，…，ｍとし、ｊ＝１，２，…，Ｍとし、変数をｘ_ｉ，ｙ_ｉとし、変数ｘ_ｉ，ｙ_ｉにより構成された点を（ｘ_ｉ，ｙ_ｉ）とした場合に、まず、前記検索情報Ｂに含まれるｉ番目の検索数値ｂ_ｉが、ｊ番目の候補多項式点集合Ｒ_Ａｊに含まれる数値ａ_１〜ａ_ｒのいずれかと同値である場合には、前記同値の数値ａ_１〜ａ_ｒに対応する多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ）），（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））を点（ｘ_ｉ，ｙ_ｉ）にセットする（代入する）。例えば、ｂ_ｉ＝ａ_１となる場合には、点（ｘ_ｉ，ｙ_ｉ）に多項式点（ａ_１，ｆ（ａ_１））をセットする。また、ｉ番目の検索数値ｂ_ｉが、ｊ番目の候補多項式点集合Ｒ_Ａｊの数値ａ_１〜ａ_ｒと同値でない場合には、点（ｘ_ｉ，ｙ_ｉ）に何もセットしない。例えば、ｂ_ｉ≠ａ_１，ａ_２，…，ａ_ｒとなる場合には、点（ｘ_ｉ，ｙ_ｉ）に点（ｏ，ｏ）をセットする（（ｘ_ｉ，ｙ_ｉ）＝（ｏ，ｏ）＝φ）。 The numerical value extraction means CB10B according to the first embodiment includes i = 1, 2,..., M, j = 1, 2,..., M, variables x _i and y _i, and variables x _i and y _i. When the determined point is (x _i , y _i ), first, the i-th search numerical value b _i included in the search information B is converted into numerical values a ₁ to n included in the j-th candidate polynomial point set R _Aj. If it is either the same value of a _r is a polynomial point corresponding to the equivalence of numbers _{_{_{_{a 1 ~a r (a 1,}}}} f (a 1)) ~ (a n, f (a n)), (a _{n + 1} , f ′ (a _{n + 1} )) to ( _ar , f ′ ( _ar )) are set (assigned) to the point (x _i , y _i ). For example, when b _i = a ₁ , a polynomial point (a ₁ , f (a ₁ )) is set at the point (x _i , y _i ). If the i-th search numerical value b _i is not the same as the numerical values a _{1 to} a _{r of} the j-th candidate polynomial point set R _Aj , nothing is set at the point (x _i , y _i ). For example, when b _i ≠ a ₁ , a ₂ ,..., A _r , the point (o, o) is set to the point (x _i , y _i ) ((x _i , y _i ) = (o , O) = φ).

すなわち、以下の式（４）に示すように、ｊ番目の候補多項式点集合Ｒ_Ａｊについてのｉ番目の検索数値ｂ_ｉの射影を演算する。
（ｂ_ｉ，ｏ）
（ｘ_ｉ，ｙ_ｉ）←−−−−−Ｒ_Ａｊ …（４）
そして、前記射影である点（ｘ_ｉ，ｙ_ｉ）を、ｊ番目の候補多項式点部分集合Ｑ_ｊの要素とする（Ｑ_ｊ←Ｑ_ｊ∪（ｘ_ｉ，ｙ_ｉ））。
したがって、実施例１の前記候補多項式点集合演算手段ＣＢ10は、変数ｉ，ｊの全ての値について、上記の処理を繰り返して、全ての候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭの候補多項式点部分集合Ｑ_１〜Ｑ_Ｍを演算する。 That is, as shown in the following equation (4), the projection of the i-th search numerical value b _{i for} the j-th candidate polynomial point set R _Aj is calculated.
(B _i , o)
(X _i , y _i ) ← −−−−− R _Aj (4)
Then, the point (x _i , y _i ) that is the projection is an element of the j-th candidate polynomial point subset Q _j (Q _j ← Q _j ∪ (x _i , y _i )).
Therefore, the candidate polynomial point set computing means CB10 of the first embodiment repeats the above processing for all values of the variables i and j, and sets the candidate polynomial point subsets of all candidate polynomial point sets R _{A1 to} R _AM. to calculate the Q ₁ ~Q _M.

ＣＢ11：解集合演算手段
解集合演算手段ＣＢ11は、類似情報復元判別手段ＣＢ11Aを有し、前記類似情報ｔ′を復元可能な各候補多項式点部分集合Ｑ_１〜Ｑ_Ｍのみを要素とする集合である解集合を演算する。実施例１の前記解集合演算手段ＣＢ11は、０を除いた自然数をｃとし、前記各候補多項式点部分集合Ｑ_１〜Ｑ_Ｍの要素数、すなわち、抽出された多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ）），（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））の総数をそれぞれ‖Ｑ_１‖〜‖Ｑ_Ｍ‖とし、前記解集合をＲ_Ｂとした場合に、前記要素数‖Ｑ_１‖〜‖Ｑ_Ｍ‖がｃ個以上であると判別された前記各候補多項式点部分集合Ｑ_１〜Ｑ_Ｍのみを要素とする前記解集合Ｒ_Ｂを演算する。 CB11: solution set calculation means The solution set calculation means CB11 is a set having similar information restoration discriminating means CB11A and having only candidate polynomial point subsets Q _{1 to} Q _M that can restore the similar information t ′ as elements. Compute a set of solutions. The solution set calculation means CB11 of the first embodiment uses c as a natural number excluding 0, and the number of elements of each of the candidate polynomial point subsets Q _{1 to} Q _M , that is, extracted polynomial points (a ₁ , f ( a ₁ )) to (a _n , f (a _n )), (a _{n + 1} , f ′ (a _{n + 1} )) to (a _r , f ′ (a _r )) are respectively expressed as ‖Q ₁ ‖ to ‖Q and _M ‖, the solution set in case of the R _B, the number of elements ‖Q ₁ ‖~‖Q _M ‖ is only discriminated each candidate polynomials point subset Q ₁ to Q _M is c or more The solution set R _B is computed with the element as the element.

ＣＢ11A：類似情報復元判別手段
類似情報復元判別手段ＣＢ11Aは、０を除いた自然数をｃとした場合に、前記数値抽出手段ＣＢ10Aにより抽出された前記登録数値ａ_１〜ａ_ｎおよび前記擬似数値ａ_ｎ＋１〜ａ_ｒがｃ個以上であるか否かを判別することにより、前記各秘匿類似情報が、前記検索情報ｂ_１〜ｂ_ｍに対する前記類似情報ｔ′として復元可能であるか否かを判別する。実施例１の前記類似情報復元判別手段ＣＢ11Aは、前記要素数‖Ｑ_１‖〜‖Ｑ_Ｍ‖がｃ個以上であるか否かを判別することにより、抽出された前記数値ａ_１〜ａ_ｒがｃ個以上であるか否かを判別する。なお、実施例１の前記類似情報復元判別手段ＣＢ12では、前記自然数ｃの値が、以下の式（５）により設定されている。
ｃ＝ｍａｘ（ｎ，ｍ）−（ｄ−１）ｑ−１ …（５） CB11A: Similar information restoring determination means similar information restoring determination means CB11A is a natural number excluding 0 when is c, the registration numerical values extracted by said numerical extracting means CB10A a ₁ ~a _n and the pseudo numerical a _{n + 1} It is determined whether or not each secret similar information can be restored as the similar information t ′ with respect to the search information b _{1 to} b _m by determining whether or not ~ _ar is c or more. . The similar information restoration discriminating means CB11A according to the _first embodiment discriminates whether or not the number of elements ‖Q ₁ ‖ to 以上 Q _M ‖ is c or more, thereby extracting the extracted numerical values a _{1 to} a _r. Whether or not is greater than or equal to c. In the similar information restoration determination means CB12 of the first embodiment, the value of the natural number c is set by the following equation (5).
c = max (n, m) − (d−1) q−1 (5)

ＣＢ12：類似情報復元手段
類似情報復元手段ＣＢ12は、前記解集合演算手段ＣＢ11により演算された前記解集合Ｒ_Ｂに含まれる前記登録数値ａ_１〜ａ_ｎおよび前記擬似数値ａ_ｎ＋１〜ａ_ｒに基づいて、前記多項式ｆ（ｘ）を演算して前記類似情報ｔ′を復元する。なお、実施例１の前記類似情報復元手段ＣＢ12では、前記解集合Ｒ_Ｂの各候補多項式点部分集合（Ｑ_１〜Ｑ_Ｍ）に含まれる前記登録数値ａ_１〜ａ_ｎに対応する登録多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ））が｛（ｎ＋ｋ）／２｝個以上既知である場合には、従来公知の誤り訂正符号の復号化アルゴリズムに基づいて、前記解集合Ｒ_Ｂから前記多項式ｆ（ｘ）を復元できる。すなわち、ｋ個の前記係数ｓ_０〜ｓ_ｋ−１が前記解集合Ｒ_Ｂの各候補多項式点部分集合（Ｑ_１〜Ｑ_Ｍ）ごとに演算される。このため、前記類似情報復元手段ＣＢ12では、まず、前記復号化アルゴリズムに基づいて、前記解集合Ｒ_Ｂの各候補多項式点部分集合（Ｑ_１〜Ｑ_Ｍ）を順番に前記多項式ｆ（ｘ）に復元する（係数ｓ_０〜ｓ_ｋ−１を演算する）。そして、前記多項式ｆ（ｘ）を復元できた場合には、演算された前記係数ｓ_０〜ｓ_ｋ−１については、前記逆関数Ｔ^−１により、前記部分文字列ｓ_ｋ（１）〜ｓ_{ｋ（ｋ−１）}に逆変換することにより、前記部分文字列ｓ_ｋ（１）〜ｓ_{ｋ（ｋ−１）}に基づく前記類似情報ｔ′（ｔ′＝ｓ）を演算する。 CB12: Similar information restoring means similar information restoring means CB12 is based on the registration numerical _a 1 ~a _n and the pseudo numerical _{a n} + 1 ~a _r included in computed by the solution set operation means CB11 the solution set _{R B} Then, the similarity information t ′ is restored by calculating the polynomial f (x). In the similar information restoring means CB12 Example 1, the solution set _R each candidate polynomials point subset _(Q 1 to Q _M) in the registration numerical value _a 1 ~a corresponding registers polynomial point to _n included in _B When (a ₁ , f (a ₁ )) to (a _n , f (a _n )) are known to be {(n + k) / 2} or more, based on a conventionally known error correction code decoding algorithm Thus, the polynomial f (x) can be restored from the solution set R _B. That, k-number of the coefficients _s 0 _{~s k-1} is calculated for each candidate polynomials point subset of the solution set _{_{_{R B (Q 1 ~Q M)}}} . Therefore, in the similarity information restoring means CB12, firstly, on the basis of the decoding algorithm, the candidate polynomial point subset of the solution set R _B a _(Q 1 _~Q _M) to said sequentially polynomial f (x) Restore (calculate coefficients s _{0 to} s _k-1 ). Then, when the polynomial f (x) can be restored, the partial characters s _{k (1) to} s are calculated by the inverse function T ^{−1 for} the calculated coefficients s _{0 to} s _k−1. _The similarity information t ′ (t ′ = s) based on the partial character strings s _{k (1) to} s _{k (k−1} ) is calculated by performing inverse conversion to _{k (k−1)} .

この場合、例えば、ｊ番目の候補多項式点部分集合Ｑ_ｊから前記多項式ｆ（ｘ）を復元できない場合には、前記候補多項式点部分集合Ｑ_ｊには｛（ｎ＋ｋ）／２｝個の前記登録多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ））が存在しなかったものとみなして、次のｊ＋１番目の候補多項式点部分集合Ｑ_ｊ＋１の復元処理を実行する。すなわち、｛（ｎ＋ｋ）／２｝個の前記登録多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ））が存在するか否かの判別ができないため、前記多項式ｆ（ｘ）を復元できた場合のみ、前記類似情報ｔ′を復元する。したがって、例えば、前記多項式ｆ（ｘ）を復元できる候補多項式点部分集合Ｑ_ｊが前記解集合Ｒ_Ｂに１個も含まれていない場合には、前記類似情報ｔ′を１つも復元せずに終了する場合もあり得る。 In this case, for example, j-th when the candidate polynomial point subset Q _j can not restore the polynomial f (x), said a candidate polynomial point subset _{Q j {(n + k)} / 2} pieces of the registration It is assumed that the polynomial points (a ₁ , f (a ₁ )) to (a _n , f (a _n )) do not exist, and the restoration process of the next j + 1-th candidate polynomial point subset Q _{j + 1} is executed. To do. That is, since it cannot be determined whether there are {(n + k) / 2} registered polynomial points (a ₁ , f (a ₁ )) to (a _n , f (a _n )), the polynomial The similar information t ′ is restored only when f (x) can be restored. Therefore, for example, when the candidate set of polynomial points Q _j that can restore the polynomial f (x) is not included in the solution set R _B , the similarity information t ′ is not restored. It may be terminated.

なお、実施例１の前記類似情報復元手段ＣＢ12では、実装が容易であり且つ効率的な前記復号化アルゴリズムの一例としてのＢＭアルゴリズム（Berlekamp-Masseyアルゴリズム、バーレカンプ・マッシィ法、ＢＭ法）により、前記多項式ｆ（ｘ）を演算する。なお、前記ＢＭアルゴリズムについては、例えば、非特許文献６、７等に記載されており、公知であるため、詳細な説明を省略する。
ＣＢ13：編集距離演算手段
編集距離演算手段ＣＢ13は、前記検索情報入力部３ａに入力された前記検索情報ｔと、前記類似情報復元手段ＣＢ12により復元された前記類似情報ｔ′との前記編集距離ｄを演算する。実施例１の前記編集距離演算手段ＣＢ13は、前記検索情報ｔと、復元された前記解集合Ｒ_Ｂの各候補多項式点部分集合に対応する各類似情報ｔ′との各編集距離ｄを全て演算する。 In the similar information restoring means CB12 of the first embodiment, the BM algorithm (Berlekamp-Massey algorithm, Balecamp Massy method, BM method) as an example of the decoding algorithm that is easy to implement and efficient can be used. The polynomial f (x) is calculated. Note that the BM algorithm is described in, for example, Non-Patent Documents 6 and 7, and is well known, and thus detailed description thereof is omitted.
CB13: Edit distance calculation means The edit distance calculation means CB13 is the edit distance d between the search information t input to the search information input unit 3a and the similarity information t 'restored by the similarity information restoration means CB12. Is calculated. The edit distance calculation means CB13 of Example 1, all said search information t, the respective edit distance d between each similarity information t 'corresponding to each candidate polynomial point subset of the restored the solution set R _B operation To do.

ＣＢ14：類似情報表示手段
類似情報表示手段ＣＢ14は、編集距離判別手段ＣＢ14Aを有し、前記検索情報入力部３ａに入力された前記検索情報ｔと、前記類似情報復元手段ＣＢ12により復元された前記類似情報ｔ′との前記編集距離ｄが予め設定された前記最大値ｄ_ｍａｘ以下である場合に（ｄ≦ｄ_ｍａｘ）、前記各類似情報ｔ′を前記類似情報出力部３ｃに表示する（図４参照）。実施例１の前記類似情報表示手段ＣＢ14は、前記編集距離ｄが前記最大値ｄ_ｍａｘ以下となる、前記解集合Ｒ_Ｂの各候補多項式点部分集合に対応する各類似情報ｔ′を一覧表示する。
ＣＢ14A：編集距離判別手段
編集距離判別手段ＣＢ14Aは、前記検索情報ｔと、前記解集合Ｒ_Ｂの各候補多項式点部分集合に対応する各類似情報ｔ′との各編集距離ｄが、前記最大値ｄ_ｍａｘ以下であるか否かを判別する。 CB14: Similar information display means The similar information display means CB14 has an edit distance discrimination means CB14A, and the search information t input to the search information input unit 3a and the similar information restored by the similar information restoration means CB12. When the edit distance d with respect to the information t ′ is equal to or less than the preset maximum value d _max (d ≦ d _max ), the similar information t ′ is displayed on the similar information output unit 3c (FIG. 4). reference). The similar information display means CB14 of the first embodiment displays a list of each piece of similar information t ′ corresponding to each candidate polynomial point subset of the solution set R _{B in which} the edit distance d is equal to or less than the maximum value d _max. .
CB14A: Edit distance discriminating means edit distance discriminating means CB14A, said the search information t, the edit distance d between each similarity information t 'corresponding to each candidate polynomial point subset of the solution set R _B is, the maximum value It is determined whether or not d _max or less.

（データベースサーバＤＳの制御部の説明）
前記データベースサーバＤＳのハードディスクドライブには、前記データベースサーバＤＳの基本動作を制御する基本ソフト（オペレーティングシステム）ＯＳや、アプリケーションプログラムとしての秘匿類似情報検索プログラム（類似情報検索プログラム）ＡＰ４、その他の図示しないソフトウェアが記憶されている。 (Description of the control unit of the database server DS)
In the hard disk drive of the database server DS, a basic software (operating system) OS for controlling basic operations of the database server DS, a secret similar information search program (similar information search program) AP4 as an application program, and the like are not shown. Software is stored.

（秘匿類似情報検索プログラムＡＰ４）
前記秘匿類似情報検索プログラムＡＰ４は、下記の機能手段（プログラムモジュール）を有する。 (Secret similarity information search program AP4)
The secret similar information search program AP4 has the following functional means (program modules).

ＣＤ１：秘匿登録情報受信手段
秘匿登録情報受信手段ＣＤ１は、前記秘匿登録情報送信手段ＣＡ10により送信された前記秘匿登録情報Ｒを受信する。
ＣＤ２：秘匿登録情報記憶手段
秘匿登録情報記憶手段ＣＤ２は、前記秘匿登録情報受信手段ＣＤ１により受信した前記秘匿登録情報Ｒを記憶する。実施例１の前記秘匿登録情報記憶手段ＣＤ２は、０を除いた自然数をＮとし、受信した前記秘匿登録情報ＲがＮ番目に記憶される秘匿登録情報とし、記憶される前記秘匿登録情報（ボールト、多項式点集合）をＲ_１，Ｒ_２，…，Ｒ_Ｎ，…とした場合に、受信した前記秘匿登録情報ＲをＲ_Ｎとして記憶する。 CD1: Secret registration information receiving means Secret registration information receiving means CD1 receives the secret registration information R transmitted by the secret registration information transmitting means CA10.
CD2: Secret registration information storage means Secret registration information storage means CD2 stores the secret registration information R received by the secret registration information receiving means CD1. The secret registration information storage means CD2 of the first embodiment sets the natural number excluding 0 as N, and the received secret registration information R as the secret registration information stored Nth, and stores the secret registration information (vault , a set polynomial point) _R _1, R 2, ..., _{R N,} when ... and storing the confidential registration information R received as _{R N.}

ＣＤ３：秘匿検索情報受信手段
秘匿検索情報受信手段ＣＤ３は、前記秘匿検索情報送信手段ＣＢ７により送信された前記秘匿検索情報Ｂ^＊（例えば、Ｂ^＊＝（ｂ_１，ｂ_３，…，ｂ_ｉ−１，ｂ_ｉ＋１，…，ｂ_ｍ）、Ｌ＝２）を受信する。
ＣＤ４：検索数値判別手段
検索数値判別手段ＣＤ４は、前記秘匿登録情報受信手段ＣＤ１により受信した前記秘匿検索情報Ｂ^＊に含まれる（ｍ−Ｌ）種類の前記検索数値（ｂ_１，ｂ_３，…，ｂ_ｉ−１，ｂ_ｉ＋１，…，ｂ_ｍ）が、前記秘匿登録情報記憶手段ＣＤ２に記憶した前記各秘匿登録情報（各多項式点集合）Ｒ_１〜Ｒ_Ｎに含まれる前記登録数値ａ_１〜ａ_ｎまたは前記擬似数値ａ_ｎ＋１〜ａ_ｒと同値であるか否かを判別する。実施例１の前記検索数値判別手段ＣＤ４は、前記検索数値判別手段ＣＢ10Aと同様に、前記検索数値（ｂ_１，ｂ_３，…，ｂ_ｉ−１，ｂ_ｉ＋１，…，ｂ_ｍ）が、前記各多項式点集合Ｒ_１〜Ｒ_Ｎに含まれる数値ａ_１〜ａ_ｒと同値であるか否かを、Ｎ個の前記各多項式点集合Ｒ_１〜Ｒ_Ｎについて全て判別する。 CD3: Secret search information receiving means Secret search information receiving means CD3 is the secret search information B ^* (for example, B ^* = (b ₁ , b ₃ ,..., B _i−) transmitted by the secret search information transmitting means CB7. ₁ , b _{i + 1} ,..., B _m ), L = 2).
CD4: Search numerical value determining means The search numerical value determining means CD4 is the (m−L) types of search numerical values (b ₁ , b ₃ ,...) Included in the confidential search information B ^* received by the confidential registration information receiving means CD1. , B _i−1 , b _{i + 1} ,..., B _m ) are registered numerical values a ₁ included in the respective secret registration information (each set of polynomial points) R _{1 to} R _N stored in the secret registration information storage means CD2. ~a _n or determining whether said is a pseudo numerical _{a n} + 1 ~a _r and equivalent. The search numerical determination unit CD4 of Example 1, similar to the search numerical determination means CB10A, the search numerical _{_{_{(b 1, b 3, ...}}} , b i-1, b i + 1, ..., b m) is the It is determined for each of the _N polynomial point sets R _{1 to} R _N whether or not they are the same as the numerical values a _{1 to} a _r included in the polynomial point sets R _{1 to} R _N.

ＣＤ５：多項式点部分集合演算手段
多項式点部分集合演算手段ＣＤ５は、（ｍ−Ｌ）種類の前記検索数値（ｂ_１，ｂ_３，…，ｂ_ｉ−１，ｂ_ｉ＋１，…，ｂ_ｍ）と同値となる前記登録数値ａ_１〜ａ_ｎの前記登録多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ））および前記擬似数値ａ_ｎ＋１〜ａ_ｒの前記擬似多項式点（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））を抽出することにより、前記多項式点集合（Ｒ_１〜Ｒ_Ｎ）における前記検索数値（ｂ_１，ｂ_３，…，ｂ_ｉ−１，ｂ_ｉ＋１，…，ｂ_ｍ）の前記射影の集合であって、前記多項式点集合Ｒ_１〜Ｒ_Ｎの部分集合である多項式点部分集合を演算する。実施例１の前記多項式点部分集合演算手段ＣＤ５は、前記各多項式点集合Ｒ_１〜Ｒ_Ｎに対応する前記各多項式点部分集合をＱ_１ ^＊〜Ｑ_Ｎ ^＊とした場合に、前記各多項式点集合Ｒ_１〜Ｒ_Ｎから、（ｍ−Ｌ）種類の前記検索数値（ｂ_１，ｂ_３，…，ｂ_ｉ−１，ｂ_ｉ＋１，…，ｂ_ｍ）と同値となる数値ａ_１〜ａ_ｒに対応する多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ）），（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））が抽出された前記各多項式点部分集合Ｑ_１ ^＊〜Ｑ_Ｎ ^＊を演算する。 CD5: Polynomial point subset computing means Polynomial point subset computing means CD5 includes (m−L) types of search numerical values (b ₁ , b ₃ ,..., B _i−1 , b _{i + 1} ,..., B _m ). the pseudo polynomial of the registration numerical value _a 1 the registration polynomial point _{_{_{~a n (a 1, f (}}} a 1)) ~ (a n, f (a n)) and the pseudo numerical _{a n} + 1 ~a _r to be equivalent By extracting the points (a _{n + 1} , f ′ (a _{n + 1} )) to ( _ar , f ′ ( _ar )), the search numerical values (b ₁ , b) in the polynomial point set (R _{1 to} R _N ) are extracted. ₃ ,..., B _i−1 , b _{i + 1} ,..., B _m ), and a polynomial point subset that is a subset of the polynomial point sets R _{1 to} R _N is calculated. The polynomial point subset computing means CD5 of the first embodiment uses the polynomial point subsets when the polynomial point subsets corresponding to the polynomial point sets R _{1 to} R _N are Q ₁ ^{* to} Q _N ^*. From the sets R _{1 to} R _N , numerical values a _{1 to} a _r that have the same values as the (m−L) types of search numerical values (b ₁ , b ₃ ,..., B _i−1 , b _{i + 1} ,..., B _m ). Polynomial points (a ₁ , f (a ₁ )) to (a _n , f (a _n )), (a _{n + 1} , f ′ (a _{n + 1} )) to (a _r , f ′ (a _r )) The polynomial point subsets Q ₁ ^{* to} Q _N ^* are extracted.

具体的には、前記多項式点部分集合演算手段ＣＤ５は、前記数値抽出手段ＣＢ10Bと同様に、ｉ＝１，２，…，（ｍ−Ｌ）とし、ｊ＝１，２，…，Ｎとし、（ｍ−Ｌ）種類の前記検索数値をｂ_１〜ｂ_ｍ−Ｌとした場合に、まず、前記検索情報Ｂに含まれるｉ番目の検索数値ｂ_ｉが、ｊ番目の多項式点集合Ｒ_ｊに含まれる数値ａ_１〜ａ_ｒのいずれかと同値である場合には、前記同値の数値ａ_１〜ａ_ｒに対応する多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ）），（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））を点（ｘ_ｉ，ｙ_ｉ）にセットする（代入する）。例えば、ｂ_ｉ＝ａ_１となる場合には、点（ｘ_ｉ，ｙ_ｉ）に多項式点（ａ_１，ｆ（ａ_１））をセットする。また、ｉ番目の検索数値ｂ_ｉが、ｊ番目の多項式点集合Ｒ_ｊの数値ａ_１〜ａ_ｒと同値でない場合には、点（ｘ_ｉ，ｙ_ｉ）に何もセットしない。例えば、ｂ_ｉ≠ａ_１，ａ_２，…，ａ_ｒとなる場合には、点（ｘ_ｉ，ｙ_ｉ）に点（ｏ，ｏ）をセットする（（ｘ_ｉ，ｙ_ｉ）＝（ｏ，ｏ）＝φ）。 Specifically, the polynomial point subset calculation means CD5 sets i = 1, 2,... (M−L), j = 1, 2,..., N, similarly to the numerical value extraction means CB10B. When (m−L) types of search numerical values are b _{1 to} b _m−L , first, the i-th search numerical value b _i included in the search information B is changed to the j-th polynomial point set R _j . If the same value as either a numerical value _a 1 ~a _r included are polynomial point corresponding to the equivalence of numbers _{_{_{_{a 1 ~a r (a 1,}}}} f (a 1)) ~ (a n, f (a _{_{_{n)), (a n +}}} 1, f '(a n + 1)) ~ (a r, f' (a r)) the point _(x i, (substituted set to _{y i)).} For example, when b _i = a ₁ , a polynomial point (a ₁ , f (a ₁ )) is set at the point (x _i , y _i ). If the i-th search numerical value b _i is not equivalent to the numerical values a _{1 to} a _{r of} the j-th polynomial point set R _j , nothing is set at the point (x _i , y _i ). For example, when b _i ≠ a ₁ , a ₂ ,..., A _r , the point (o, o) is set to the point (x _i , y _i ) ((x _i , y _i ) = (o , O) = φ).

すなわち、以下の式（４）′に示すように、ｊ番目の多項式点集合Ｒ_ｊについてのｉ番目の検索数値ｂ_ｉの射影を演算する。
（ｂ_ｉ，ｏ）
（ｘ_ｉ，ｙ_ｉ）←−−−−−Ｒ_ｊ …（４）′
そして、前記射影である点（ｘ_ｉ，ｙ_ｉ）を、ｊ番目の多項式点部分集合Ｑ_ｊ ^＊の要素とする（Ｑ_ｊ ^＊←Ｑ_ｊ ^＊∪（ｘ_ｉ，ｙ_ｉ））。
したがって、実施例１の前記多項式点部分集合演算手段ＣＤ５は、変数ｉ，ｊの全ての値について、上記の処理を繰り返して、全ての多項式点集合Ｒ_１〜Ｒ_Ｎの多項式点部分集合Ｑ_１ ^＊〜Ｑ_Ｎ ^＊を演算する。 That is, as shown in the following equation (4) ′, the projection of the i-th search numerical value b _{i for} the j-th polynomial point set R _j is calculated.
(B _i , o)
(X _i , y _i ) ← −−−−− R _j (4) ′
Then, the point (x _i , y _i ) that is the projection is an element of the j-th polynomial point subset Q _j ^* (Q _j ^* ← Q _j ^* _ｊ (x _i , y _i )).
Therefore, the polynomial point subset computing means CD5 of the first embodiment repeats the above processing for all values of the variables i and j, and the polynomial point subset Q _{1 of} all the polynomial point sets R _{1 to} R _N. ^* ~ Q _N ^* is calculated.

ＣＤ６：多項式点部分集合要素数判別手段
多項式点部分集合要素数判別手段ＣＤ６は、前記多項式点部分集合演算手段ＣＤ５により演算された前記各多項式点部分集合Ｑ_１ ^＊〜Ｑ_Ｎ ^＊の点が（ｃ−Ｌ）個以上であるか否かを判別する。実施例１の前記多項式点部分集合要素数判別手段ＣＤ６は、前記類似情報復元判別手段ＣＢ11Aと同様に、前記各多項式点部分集合Ｑ_１ ^＊〜Ｑ_Ｎ ^＊の多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ）），（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））の総数をそれぞれ‖Ｑ_１ ^＊‖〜‖Ｑ_Ｎ ^＊‖とした場合に、前記要素数‖Ｑ_１ ^＊‖〜‖Ｑ_Ｎ ^＊‖が（ｃ−Ｌ）個以上であるか否かを判別する。 CD6: Polynomial point subset element number discriminating means Polynomial point subset element number discriminating means CD6 has the points of the polynomial point subsets Q ₁ ^{* to} Q _N ^* calculated by the polynomial point subset arithmetic means CD5 ( c-L) It is determined whether or not the number is greater than or equal to. The polynomial point subset element number discriminating means CD6 of the first embodiment is similar to the similar information restoration discriminating means CB11A in that the polynomial points (a ₁ , f (a) of the respective polynomial point subsets Q ₁ ^{* to} Q _N ^* are used. _{_{_{_{1)) ~ (a n,}}}} f (a n)), (a n + 1, f '(a n + 1)) ~ (a r, f' (a r) the total number of) each ‖Q ₁ ^* ‖~‖Q in case of the _N ^* ‖, the number of elements ‖Q ₁ ^{^*} ‖~‖Q _N ^* ‖ it is determined whether or not (c-L) or more.

ＣＤ７：秘匿類似情報演算手段
秘匿類似情報演算手段ＣＤ７は、（ｃ−Ｌ）個以上の点を有する前記各多項式点部分集合（Ｑ_１ ^＊〜Ｑ_Ｎ ^＊）に対応する前記各秘匿登録情報（Ｒ_１〜Ｒ_Ｎ）を演算することにより、前記類似情報が秘匿化された秘匿類似情報を演算する。実施例１の前記秘匿類似情報演算手段では、０を除いた自然数をＭとし、Ｍ≦Ｎとし、（ｃ−Ｌ）個以上の点を有する前記各多項式点部分集合（Ｑ_１ ^＊〜Ｑ_Ｎ ^＊）に対応するＭ個の前記秘匿登録情報（Ｒ_１〜Ｒ_Ｎ）を前記類似情報ｔ′が秘匿化された可能性がある候補の多項式点集合である候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭとし、前記候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭを要素とする集合である解候補集合をＲ_Ａとした場合に（Ｒ_Ａ＝（Ｒ_Ａ１，Ｒ_Ａ２，…，Ｒ_ＡＭ））、前記秘匿類似情報である前記解候補集合Ｒ_Ａを演算する。
ＣＤ８：秘匿類似情報送信手段
秘匿類似情報送信手段ＣＤ８は、前記秘匿類似情報演算手段ＣＤ７により演算された前記秘匿類似情報（解候補集合Ｒ_Ａ）を、前記検索用クライアントパソコンＰＣｂに対して送信する。 CD7: Concealed similarity information computing means Concealed similarity information computing means CD7 is configured to provide each of the secret registration information (Q ₁ ^{* to} Q _N ^* ) corresponding to each of the polynomial point subsets (Q ₁ ^{* to} Q _N ^* ) having (c−L) or more points. By calculating R _{1 to} R _N ), secret similar information in which the similar information is concealed is calculated. In the secret similar information calculation means of the first embodiment, each of the polynomial point subsets (Q ₁ ^{* to} Q _N ) has a natural number excluding 0, M, M ≦ N, and (c−L) or more points. ^* ) M pieces of the secret registration information (R _{1 to} R _N ) corresponding to the candidate polynomial point sets R _{A1 to} R _AM which are candidate polynomial point sets for which the similar information t ′ may be concealed. And the candidate candidate point set R _{A1 to} R _AM as a set of solution candidates, where R _A is (R _A = (R _A1 , R _A2 ,..., R _AM )), the secret similarity The solution candidate set _RA which is information is calculated.
CD8: Secret similar information transmitting means Secret similar information transmitting means CD8 transmits the secret similar information (solution candidate set R _A ) calculated by the secret similar information calculating means CD7 to the search client personal computer PCb. .

（実施例１のフローチャートの説明）
次に、実施例１の前記各プログラムＡＰ１〜ＡＰ４の処理の流れをフローチャートを使用して説明する。 (Description of Flowchart of Example 1)
Next, the processing flow of each of the programs AP1 to AP4 according to the first embodiment will be described using a flowchart.

（実施例１の秘匿登録情報送信プログラムＡＰ１の秘匿登録情報送信処理のフローチャートの説明）
図５は実施例１の秘匿登録情報送信プログラムの秘匿登録情報送信処理のフローチャートである。
図５のフローチャートの各ＳＴ（ステップ）の処理は、前記登録用クライアントパソコンＰＣａの制御部のＲＯＭ等に記憶されたプログラムに従って行われる。また、この処理は前記制御部の他の各種処理と並行してマルチタスクで実行される。 (Explanation of Flowchart of Secret Registration Information Transmission Processing of Secret Registration Information Transmission Program AP1 of Embodiment 1)
FIG. 5 is a flowchart of the secret registration information transmission process of the secret registration information transmission program according to the first embodiment.
The processing of each ST (step) in the flowchart of FIG. 5 is performed according to a program stored in the ROM or the like of the control unit of the registration client personal computer PCa. This process is executed in a multitasking manner in parallel with other various processes of the control unit.

図５に示すフローチャートは、前記秘匿登録情報送信プログラムＡＰ１が起動した場合に開始される。
図５のＳＴ１０１において、登録画像２（図３参照）を表示する。そして、ＳＴ１０２に移る。
ＳＴ１０２において、登録情報入力部２ａに登録情報ｓの入力があったか否かを判別する。イエス（Ｙ）の場合はＳＴ１０３に移り、ノー（Ｎ）の場合はＳＴ１０４に移る。
ＳＴ１０３において、登録情報ｓの入力に応じて登録画像２を更新する。そして、ＳＴ１０２に戻る。
ＳＴ１０４において、登録ボタン２ｂが入力されたか否かを判別する。ノー（Ｎ）の場合はＳＴ１０５に移り、イエス（Ｙ）の場合はＳＴ１０６に移る。
ＳＴ１０５において、秘匿登録情報送信プログラムＡＰ１を終了する入力がされたか否かを判別する。ノー（Ｎ）の場合はＳＴ１０２に戻り、イエス（Ｙ）の場合は前記秘匿登録情報送信処理を終了する。 The flowchart shown in FIG. 5 is started when the secret registration information transmission program AP1 is activated.
In ST101 of FIG. 5, the registered image 2 (see FIG. 3) is displayed. Then, the process proceeds to ST102.
In ST102, it is determined whether or not registration information s has been input to registration information input unit 2a. If yes (Y), the process proceeds to ST103, and, if no (N), the process proceeds to ST104.
In ST103, the registered image 2 is updated according to the input of the registration information s. Then, the process returns to ST102.
In ST104, it is determined whether or not the registration button 2b has been input. If no (N), the process moves to ST105, and if yes (Y), the process moves to ST106.
In ST105, it is determined whether or not an input to end the secret registration information transmission program AP1 has been made. If no (N), the process returns to ST102, and if yes (Y), the secret registration information transmission process is terminated.

ＳＴ１０６において、登録情報入力部２ａに登録情報ｓが表示されているか否かを判別する。イエス（Ｙ）の場合はＳＴ１０７に移り、ノー（Ｎ）の場合はＳＴ１０２に戻る。
ＳＴ１０７において、登録情報ｓから（ｋ−１）次元１変数多項式ｆの係数ｓ_０〜ｓ_ｋ−１（式（１）参照）を演算する。すなわち、ｓ_ｋ−１＝１とし、且つ、予め設定された可逆関数Ｔにより、登録情報ｓから抽出された部分文字列ｓ_ｋ（１）〜ｓ_{ｋ（ｋ−１）}を、係数ｓ_０〜ｓ_ｋ−２に変換する。そして、ＳＴ１０８に移る。
ＳＴ１０８において、登録情報ｓから登録部分情報ｓ_ｑ１〜ｓ_ｑｎを抽出する。そして、ＳＴ１０９に移る。
ＳＴ１０９において、登録部分情報ｓ_ｑ１〜ｓ_ｑｎと、ハッシュ関数Ｈとにより、登録集合Ａを演算する（Ａ＝（ａ_１，ａ_２，…，ａ_ｎ）、Ｈ（ｓ_ｑ１）＝ａ_１，Ｈ（ｓ_ｑ２）＝ａ_２，…，Ｈ（ｓ_ｑｎ）＝ａ_ｎ）。そして、ＳＴ１１０に移る。 In ST106, it is determined whether or not registration information s is displayed in registration information input unit 2a. If yes (Y), the process proceeds to ST107, and, if no (N), the process returns to ST102.
In ST107, the coefficients s _{0 to} s _k-1 (see Expression (1)) of the (k-1) -dimensional one-variable polynomial f are calculated from the registration information s. That is, s _k−1 = 1 and partial character strings s _{k (1) to} sk _(k−1) extracted from the registration information s by a reversible function T set in advance are used as coefficients s ₀ to Convert to s _k-2 . Then, the process proceeds to ST108.
In ST 108, it extracts the registration component information _s q1 _{~s qn} from the registration information s. Then, the process proceeds to ST109.
In ST 109, the registration component information _s q1 _{~s qn,} by the hash function H, calculates the registration set _{_{_{A (A = (a 1,}}} a 2, ..., a n), H (s q1) = a 1, H (s _q2 ) = a ₂ ,..., H (s _qn ) = a _n ). Then, the process proceeds to ST110.

ＳＴ１１０において、変数ｘ_ｉの値を要素とする集合をＸとした場合に、以下の（１）〜（３）の処理を実行し、ＳＴ１１１に移る。
（１）変数ｉ，ｘ_ｉ，ｙ_ｉに、１，０，０をセットする（ｉ＝１，ｘ_ｉ＝０，ｙ_ｉ＝０）。
（２）集合Ｘにφをセットする。すなわち、集合Ｘを空集合とする（Ｘ←φ）。
（３）多項式点集合Ｒにφをセットする。すなわち、多項式点集合Ｒを空集合とする（Ｒ←φ）。
ＳＴ１１１において、次の（１）〜（３）の処理を実行し、ＳＴ１１２に移る。
（１）ｉ番目の登録数値ａ_ｉおよび登録数値ａ_ｉに対応する登録代入値ｆ（ａ_ｉ）を一組とする登録多項式点（ａ_ｉ，ｆ（ａ_ｉ））を演算して、点（ｘ_ｉ，ｙ_ｉ）にセットする（（ｘ_ｉ，ｙ_ｉ）←（ａ_ｉ，ｆ（ａ_ｉ）））。
（２）変数（点（ｘ_ｉ，ｙ_ｉ）のｘ座標の値）ｘ_ｉを、集合Ｘの要素とする（Ｘ←Ｘ∪｛ｘｉ｝）。
（３）点（ｘ_ｉ，ｙ_ｉ）を、多項式点集合Ｒの要素とする（Ｒ←Ｒ∪（ｘ_ｉ，ｙ_ｉ）） In ST110, when the set having the value of variable x _i as an element is X, the following processes (1) to (3) are executed, and the process proceeds to ST111.
(1) Set 1, 0, 0 to variables i, x _i , y _i (i = 1, x _i = 0, y _i = 0).
(2) Set φ to the set X. That is, the set X is an empty set (X ← φ).
(3) Set φ to the polynomial point set R. That is, the polynomial point set R is an empty set (R ← φ).
In ST111, the following processes (1) to (3) are executed, and the process proceeds to ST112.
(1) A registration polynomial point (a _i , f (a _i )) having a set of the registration substitution value f (a _i ) corresponding to the i-th registration value a _i and the registration value a _i is calculated, _(x i, _{y i)} is set to _{_{((x i, y i)}} ← (a i, f (a i))).
(2) Variable (value of x coordinate of point (x _i , y _i )) x _i is an element of set X (X ← X （{xi}).
(3) Let the point (x _i , y _i ) be an element of the polynomial point set R (R ← R∪ (x _i , y _i ))

ＳＴ１１２において、変数ｉに＋１を加算する（ｉ＝ｉ＋１）。そして、ＳＴ１１３に移る。
ＳＴ１１３において、変数ｉが登録多項式点（ａ_ｉ，ｆ（ａ_ｉ））の個数であるｎより大きくなったか否かを判別する。イエス（Ｙ）の場合はＳＴ１１４に移り、ノー（Ｎ）の場合はＳＴ１１１に戻る。
ＳＴ１１４において、次の（１），（２）の処理を実行し、ＳＴ１１５に移る。
（１）ｉ番目の擬似数値ａ_ｉおよび擬似数値ａ_ｉに対応する擬似代入値ｆ′（ａ_ｉ）を一組とする擬似多項式点（ａ_ｉ，ｆ′（ａ_ｉ））を演算して、点（ｘ_ｉ，ｙ_ｉ）にセットする（（ｘ_ｉ，ｙ_ｉ）←（ａ_ｉ，ｆ′（ａ_ｉ））、ｘ_ｉ∈Ｆ−Ｘ、ｙ_ｉ∈Ｆ−｛ｆ（ｘ_ｉ）｝）。
（２）点（ｘ_ｉ，ｙ_ｉ）を、多項式点集合Ｒの要素とする（Ｒ←Ｒ∪（ｘ_ｉ，ｙ_ｉ））） In ST112, +1 is added to the variable i (i = i + 1). Then, the process proceeds to ST113.
In ST113, it is determined whether or not the variable i is larger than n, which is the number of registered polynomial points (a _i , f (a _i )). If yes (Y), the process proceeds to ST114, and, if no (N), the process returns to ST111.
In ST114, the following processes (1) and (2) are executed, and the process moves to ST115.
(1) A pseudo-polynomial point (a _i , f ′ (a _i )) having a set of pseudo-assignment values f ′ (a _i ) corresponding to the i-th pseudo-value a _i and the pseudo-value a _i is calculated. , point _(x i, _{y i)} is set to _{_{((x i, y i)}} ← (a i, f '(a i)), x i ∈F-X, y i ∈F- {f (x i )}).
(2) The point (x _i , y _i ) is an element of the polynomial point set R (R ← R∪ (x _i , y _i )))

ＳＴ１１５において、変数ｉが多項式点集合Ｒに含まれる点の個数であるｒになったか否かを判別する。ノー（Ｎ）の場合はＳＴ１１６に移り、イエス（Ｙ）の場合はＳＴ１１７に移る。
ＳＴ１１６において、変数ｉに＋１を加算する（ｉ＝ｉ＋１）。そして、ＳＴ１１４に戻る。
ＳＴ１１７において、多項式点集合Ｒの要素（ｘ_ｉ，ｙ_ｉ）を変数ｘ_ｉ（ｉ＝１，２，…，ｒ）の昇順に並び替える。
ＳＴ１１８において、秘匿登録情報である多項式点集合ＲをデータベースサーバＤＳに送信する。そして、ＳＴ１０２に戻る。 In ST115, it is determined whether or not the variable i has reached r which is the number of points included in the polynomial point set R. If no (N), the process moves to ST116, and if yes (Y), the process moves to ST117.
In ST116, +1 is added to the variable i (i = i + 1). Then, the process returns to ST114.
In ST117, the elements (x _i , y _i ) of the polynomial point set R are rearranged in ascending order of the variables x _i (i = 1, 2,..., R).
In ST118, the polynomial point set R which is confidential registration information is transmitted to the database server DS. Then, the process returns to ST102.

（実施例１の秘匿検索情報送信プログラムＡＰ２の秘匿検索情報送信処理のフローチャートの説明）
図６は実施例１の秘匿検索情報送信プログラムの秘匿検索情報送信処理のフローチャートである。
図６のフローチャートの各ＳＴ（ステップ）の処理は、前記検索用クライアントパソコンＰＣｂの制御部のＲＯＭ等に記憶されたプログラムに従って行われる。また、この処理は前記制御部の他の各種処理と並行してマルチタスクで実行される。 (Explanation of Flowchart of Secret Search Information Transmission Process of Secret Search Information Transmission Program AP2 of Embodiment 1)
FIG. 6 is a flowchart of the confidential search information transmission process of the confidential search information transmission program according to the first embodiment.
The processing of each ST (step) in the flowchart of FIG. 6 is performed according to a program stored in the ROM or the like of the control unit of the search client personal computer PCb. This process is executed in a multitasking manner in parallel with other various processes of the control unit.

図６に示すフローチャートは、前記秘匿検索情報送信プログラムＡＰ２が起動した場合に開始される。
図６のＳＴ２０１において、検索画像３（図４参照）を表示する。そして、ＳＴ２０２に移る。
ＳＴ２０２において、検索情報入力部３ａに検索情報ｔの入力があったか否かを判別する。イエス（Ｙ）の場合はＳＴ２０３に移り、ノー（Ｎ）の場合はＳＴ２０４に移る。
ＳＴ２０３において、検索情報ｔの入力に応じて検索画像３を更新する。そして、ＳＴ２０２に戻る。
ＳＴ２０４において、検索開始ボタン３ｂが入力されたか否かを判別する。ノー（Ｎ）の場合はＳＴ２０５に移り、イエス（Ｙ）の場合はＳＴ２０６に移る。 The flowchart shown in FIG. 6 is started when the secret search information transmission program AP2 is activated.
In ST201 of FIG. 6, the search image 3 (see FIG. 4) is displayed. Then, the process proceeds to ST202.
In ST202, it is determined whether or not search information input unit 3a has input search information t. If yes (Y), the process proceeds to ST203, and, if no (N), the process proceeds to ST204.
In ST203, the search image 3 is updated according to the input of the search information t. Then, the process returns to ST202.
In ST204, it is determined whether or not the search start button 3b has been input. If no (N), the process moves on to ST205, and if yes (Y), the process moves on to ST206.

ＳＴ２０５において、秘匿検索情報送信プログラムＡＰ２を終了する入力がされたか否かを判別する。ノー（Ｎ）の場合はＳＴ２０２に戻り、イエス（Ｙ）の場合は前記秘匿検索情報送信処理を終了する。
ＳＴ２０６において、検索情報入力部３ａに検索情報ｔが表示されているか否かを判別する。イエス（Ｙ）の場合はＳＴ２０７に移り、ノー（Ｎ）の場合はＳＴ２０２に戻る。
ＳＴ２０７において、検索情報ｔから検索部分情報ｔ_ｑ１〜ｔ_ｑｍを抽出する。そして、ＳＴ２０８に移る。
ＳＴ２０８において、検索部分情報ｔ_ｑ１〜ｔ_ｑｍと、ハッシュ関数Ｈとにより、検索集合Ｂを演算して記憶する（Ｂ＝（ｂ_１，ｂ_２，…，ｂ_ｍ）、Ｈ（ｔ_ｑ１）＝ｂ_１，Ｈ（ｔ_ｑ２）＝ｂ_２，…，Ｈ（ｔ_ｑｎ）＝ｂ_ｍ）。そして、ＳＴ２０９に移る。 In ST205, it is determined whether or not an input for terminating the confidential search information transmission program AP2 has been made. If no (N), the process returns to ST202, and if yes (Y), the confidential search information transmission process is terminated.
In ST206, it is determined whether or not the search information t is displayed in the search information input unit 3a. If yes (Y), the process proceeds to ST207, and, if no (N), the process returns to ST202.
In ST207, and it extracts a search portion information _t q1 _{~t qm} from the search information t. Then, the process proceeds to ST208.
In ST208, the search part information _{_t} q1 _{~t qm,} by the hash function H, and calculates and stores the retrieval data set _{_{B (B = (b 1,}} b 2, ..., b m), H (t q1) = b ₁ , H (t _q2 ) = b ₂ ,..., H (t _qn ) = b _m ). Then, the process proceeds to ST209.

ＳＴ２０９において、検索集合Ｂの検索数値ｂ_１〜ｂ_ｍのうちのＬ種類の検索数値を除いて、（ｍ−Ｌ）種類の検索数値ｂ_１〜ｂ_ｍ−Ｌを要素とする検索部分集合Ｂ^＊を演算する（Ｂ^＊＝（ｂ_１，ｂ_２，…，ｂ_ｍ−Ｌ））。そして、ＳＴ２１０に移る。
ＳＴ２１０において、秘匿検索情報である検索部分集合Ｂ^＊をデータベースサーバＤＳに送信する。そして、ＳＴ２１１に移る。
ＳＴ２１１において、データベースサーバＤＳから送信された各秘匿類似情報（解候補集合Ｒ_Ａ、Ｒ_Ａ＝（Ｒ_Ａ１，Ｒ_Ａ２，…，Ｒ_ＡＭ））から対応する各類似情報ｔ′を復元する秘匿類似情報復元処理（後述する図８のフローチャート参照）を実行する。そして、ＳＴ２０２に戻る。 In ST209, except for L types of search figures in the search numerical _b 1 ~b _m search set B, (m-L) types of search numerical _b 1 _{~b m-L} and the element searching the subset B ^* Is calculated (B ^* = (b ₁ , b ₂ ,..., B _m−L )). Then, the process proceeds to ST210.
In ST210, search subset B ^* which is confidential search information is transmitted to database server DS. Then, the process proceeds to ST211.
In ST211, the secret similarity that restores the corresponding similar information t ′ from each secret similar information (solution candidate set R _A , R _A = (R _A1 , R _A2 ,..., R _AM )) transmitted from the database server DS. Information restoration processing (see the flowchart of FIG. 8 described later) is executed. Then, the process returns to ST202.

（実施例１の秘匿類似情報検索プログラムＡＰ４の秘匿類似情報検索処理のフローチャートの説明）
図７は実施例１の秘匿類似情報検索プログラムの秘匿類似情報検索処理のフローチャートである。
図７のフローチャートの各ＳＴ（ステップ）の処理は、前記データベースサーバＤＳの制御部のＲＯＭ等に記憶されたプログラムに従って行われる。また、この処理は前記制御部の他の各種処理と並行してマルチタスクで実行される。 (Explanation of Flowchart of Secret Similarity Information Retrieval Process of Secret Similarity Information Search Program AP4 of Example 1)
FIG. 7 is a flowchart of the secret similar information search process of the secret similar information search program according to the first embodiment.
The processing of each ST (step) in the flowchart of FIG. 7 is performed according to a program stored in the ROM or the like of the control unit of the database server DS. This process is executed in a multitasking manner in parallel with other various processes of the control unit.

図７に示すフローチャートは、前記秘匿類似情報検索プログラムＡＰ４が起動した場合に開始される。
図７のＳＴ３０１において、秘匿登録情報である多項式点集合Ｒを受信したか否かを判別する。イエス（Ｙ）の場合はＳＴ３０２に移り、ノー（Ｎ）の場合はＳＴ３０３に移る。
ＳＴ３０２において、受信した多項式点集合Ｒを秘匿登録情報Ｒ_Ｎとして記憶する。そして、ＳＴ３０１に戻る。
ＳＴ３０３において、秘匿検索情報である検索部分集合Ｂ^＊を受信したか否かを判別する。イエス（Ｙ）の場合はＳＴ３０４に移り、ノー（Ｎ）の場合はＳＴ３０１に戻る。 The flowchart shown in FIG. 7 is started when the secret similar information search program AP4 is activated.
In ST301 of FIG. 7, it is determined whether or not a polynomial point set R that is confidential registration information has been received. If yes (Y), the process proceeds to ST302, and, if no (N), the process proceeds to ST303.
In ST 302, it stores the polynomial point set R received as confidential registration information _{R N.} Then, the process returns to ST301.
In ST303, it is determined whether or not search subset B ^* , which is confidential search information, has been received. If yes (Y), the process proceeds to ST304, and, if no (N), the process returns to ST301.

ＳＴ３０４において、以下の（１）〜（３）の処理を実行し、ＳＴ３０５に移る。
（１）変数ｉ，ｊ，ｘ_ｉ，ｙ_ｉに、１，１，０，０をセットする（ｉ＝１，ｊ＝１，ｘ_ｉ＝０，ｙ_ｉ＝０）。
（２）各多項式点部分集合Ｑ_１ ^＊〜Ｑ_Ｎ ^＊にφをセットする。すなわち、各多項式点部分集合Ｑ_１ ^＊〜Ｑ_Ｎ ^＊を空集合とする（Ｑ_１ ^＊←φ，Ｑ_２ ^＊←φ，…，Ｑ_Ｎ ^＊←φ）。
（３）解候補集合Ｒ_Ａにφをセットする。すなわち、解候補集合Ｒ_Ａを空集合とする（Ｒ_Ａ←φ）。 In ST304, the following processes (1) to (3) are executed, and the process proceeds to ST305.
(1) Set 1, 1, 0, 0 to variables i, j, x _i , y _i (i = 1, j = 1, x _i = 0, y _i = 0).
(2) Set φ to each polynomial point subset Q ₁ ^{* to} Q _N ^* . That is, each polynomial point subset Q ₁ ^{* to} Q _N ^* is an empty set (Q ₁ ^* ← φ, Q ₂ ^* ← φ,..., Q _N ^* ← φ).
(3) Set φ to the solution candidate set _RA . That is, the solution candidate set R _A is an empty set (R _A ← φ).

ＳＴ３０５において、以下の（１），（２）の処理を実行し、ＳＴ３０６に移る。
（１）ｊ番目の秘匿登録情報Ｒ_ｊについて、ｉ番目の検索数値ｂ_ｉの射影を演算する（式（４）′参照）。すなわち、秘匿検索情報Ｂ^＊に含まれるｉ番目の検索数値ｂ_ｉが、ｊ番目の秘匿登録情報Ｒ_ｊに含まれる数値ａ_１〜ａ_ｒと同値である場合には、点（ｘ_ｉ，ｙ_ｉ）に、同値の数値ａ_１〜ａ_ｒに対応する多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ）），（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））をセットし、ｂ_ｉ≠ａ_１，ａ_２，…，ａ_ｒとなる場合には、点（ｘ_ｉ，ｙ_ｉ）に、点（ｏ，ｏ）をセットする（（ｘ_ｉ，ｙ_ｉ）＝（ｏ，ｏ）＝φ）。
（２）点（ｘ_ｉ，ｙ_ｉ）を、ｊ番目の多項式点部分集合Ｑ_ｊ ^＊の要素とする（Ｑ_ｊ ^＊←Ｑ_ｊ ^＊∪（ｘ_ｉ，ｙ_ｉ））。 In ST305, the following processes (1) and (2) are executed, and the process proceeds to ST306.
(1) The projection of the i-th search numerical value b _i is calculated for the j-th secret registration information R _j (see Expression (4) ′). That is, when the i-th search numerical value b _i included in the confidential search information B ^* is the same value as the numerical values a _{1 to} a _r included in the j-th confidential registration information R _j , the point (x _i , y _i ), polynomial points (a ₁ , f (a ₁ )) to (a _n , f (a _n )), (a _{n + 1} , f ′ (a _{n + 1} )) corresponding to the same number a _{1 to} a _r ˜ ( _ar , f ′ ( _ar )) are set, and when b _i ≠ a ₁ , a ₂ ,..., A _r , the point (x _i , y _i ) is changed to the point (o, o ) Is set ((x _i , y _i ) = (o, o) = φ).
(2) Let the point (x _i , y _i ) be an element of the j-th polynomial point subset Q _j ^* (Q _j ^* ← Q _j ^* ∪ (x _i , y _i )).

ＳＴ３０６において、変数ｉが秘匿検索情報Ｂ^＊に含まれる点の個数である（ｍ−Ｌ）になったか否かを判別する。ノー（Ｎ）の場合はＳＴ３０７に移り、イエス（Ｙ）の場合はＳＴ３０８に移る。
ＳＴ３０７において、変数ｉに＋１を加算する（ｉ＝ｉ＋１）。そして、ＳＴ３０５に戻る。
ＳＴ３０８において、ｊ番目の多項式点部分集合Ｑ_ｊ ^＊の点が（ｃ−Ｌ）個以上であるか否かを判別する（‖Ｑ_ｊ ^＊‖≧ｃ−Ｌ）。イエス（Ｙ）の場合はＳＴ３０９に移り、ノー（Ｎ）の場合はＳＴ３１０に移る。
ＳＴ３０９において、ｊ番目の秘匿登録情報Ｒ_ｊを、解候補集合Ｒ_Ａの要素とする（Ｒ_Ａ←Ｒ_Ａ∪Ｒ_ｊ）。そして、ＳＴ３１０に移る。 In ST306, it is determined whether or not the variable i has reached (m−L), which is the number of points included in the confidential search information B ^* . If no (N), the process proceeds to ST307, and if yes (Y), the process proceeds to ST308.
In ST307, +1 is added to the variable i (i = i + 1). Then, the process returns to ST305.
In ST308, it is determined whether or not there are (c−L) or more points in the j-th polynomial point subset Q _j ^* (‖Q _j ^* ‖ ≧ c−L). If yes (Y), the process proceeds to ST309, and, if no (N), the process proceeds to ST310.
In ST309, the j-th secret registration information R _j is set as an element of the solution candidate set _RA (R _A ← R _A ∪R _j ). Then, the process proceeds to ST310.

ＳＴ３１０において、変数ｊが秘匿登録情報Ｒ_１〜Ｒ_Ｎの個数であるＮになったか否かを判別する。ノー（Ｎ）の場合はＳＴ３１１に移り、イエス（Ｙ）の場合はＳＴ３１２に移る。
ＳＴ３１１において、次の（１），（２）の処理を実行し、ＳＴ３０５に戻る。
（１）変数ｉに１をセットする（ｉ＝１）。
（２）変数ｊに＋１を加算する（ｊ＝ｊ＋１）。
ＳＴ３１２において、秘匿類似情報である解候補集合Ｒ_Ａ（Ｒ_Ａ＝（Ｒ_Ａ１，Ｒ_Ａ２，…，Ｒ_ＡＭ），Ｍ≦Ｎ）を検索用クライアントパソコンＰＣｂに送信する。そして、ＳＴ３０１に戻る。 In ST 310, the variable j is determined whether it is N is the number of confidential registration information _R 1 to R _N. If no (N), the process moves on to ST311, and if yes (Y), the process moves on to ST312.
In ST311, the following processes (1) and (2) are executed, and the process returns to ST305.
(1) Set 1 to variable i (i = 1).
(2) Add +1 to the variable j (j = j + 1).
In ST312, the solution candidate set R _A (R _A = (R _A1 , R _A2 ,..., R _AM ), M ≦ N), which is confidential similar information, is transmitted to the search client personal computer PCb. Then, the process returns to ST301.

（実施例１の秘匿類似情報復元プログラムＡＰ３の秘匿類似情報復元処理のフローチャートの説明）
図８は実施例１の秘匿類似情報復元プログラムの秘匿類似情報復元処理のフローチャートであり、図６のＳＴ２１１のサブルーチンの説明図である。 (Explanation of Flowchart of Secret Similarity Information Restoration Process of Secret Similarity Information Restoration Program AP3 of Embodiment 1)
FIG. 8 is a flowchart of the secret similar information restoration process of the secret similar information restoration program according to the first embodiment, and is an explanatory diagram of the subroutine of ST211 in FIG.

図８のＳＴ４０１において、秘匿類似情報である解候補集合Ｒ_Ａを受信したか否かを判別する。イエス（Ｙ）の場合はＳＴ４０２に移り、ノー（Ｎ）の場合はＳＴ４０１を繰り返す。
ＳＴ４０２において、以下の（１）〜（３）の処理を実行し、ＳＴ４０３に移る。
（１）変数ｉ，ｊ，ｘ_ｉ，ｙ_ｉに、１，１，０，０をセットする（ｉ＝１，ｊ＝１，ｘ_ｉ＝０，ｙ_ｉ＝０）。
（２）各候補多項式点部分集合Ｑ_１〜Ｑ_Ｍにφをセットする。すなわち、各候補多項式点部分集合Ｑ_１〜Ｑ_Ｍを空集合とする（Ｑ_１←φ，Ｑ_２←φ，…，Ｑ_Ｍ←φ）。
（３）解集合Ｒ_Ｂにφをセットする。すなわち、解集合Ｒ_Ｂを空集合とする（Ｒ_Ｂ←φ）。 In ST401 of FIG. 8, it is determined whether or not a solution candidate set _RA that is confidential similar information is received. If yes (Y), the process transfers to ST402, and, if no (N), ST401 is repeated.
In ST402, the following processes (1) to (3) are executed, and the process proceeds to ST403.
(1) Set 1, 1, 0, 0 to variables i, j, x _i , y _i (i = 1, j = 1, x _i = 0, y _i = 0).
(2) Set φ to each candidate polynomial point subset Q _{1 to} Q _M. That is, each candidate polynomial point subset Q _{1 to} Q _M is an empty set (Q ₁ ← φ, Q ₂ ← φ,..., Q _M ← φ).
(3) sets φ to disassembly _{R B.} That is, the solution set R _B is an empty set (R _B ← φ).

ＳＴ４０３において、以下の（１），（２）の処理を実行し、ＳＴ４０４に移る。
（１）ｊ番目の候補多項式点集合Ｒ_Ａｊについて、ｉ番目の検索数値ｂ_ｉの射影を演算する（式（４）参照）。すなわち、検索情報Ｂに含まれるｉ番目の検索数値ｂ_ｉが、ｊ番目の候補多項式点集合Ｒ_Ａｊに含まれる数値ａ_１〜ａ_ｒと同値である場合には、点（ｘ_ｉ，ｙ_ｉ）に、同値の数値ａ_１〜ａ_ｒに対応する多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ）），（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））をセットし、ｂ_ｉ≠ａ_１，ａ_２，…，ａ_ｒとなる場合には、点（ｘ_ｉ，ｙ_ｉ）に、点（ｏ，ｏ）をセットする（（ｘ_ｉ，ｙ_ｉ）＝（ｏ，ｏ）＝φ）。
（２）点（ｘ_ｉ，ｙ_ｉ）を、ｊ番目の候補多項式点部分集合Ｑ_ｊの要素とする（Ｑ_ｊ←Ｑ_ｊ∪（ｘ_ｉ，ｙ_ｉ））。 In ST403, the following processes (1) and (2) are executed, and the process proceeds to ST404.
(1) For the j-th candidate polynomial point set R _Aj , the projection of the i-th search numerical value b _i is calculated (see Expression (4)). That is, when the i-th search numerical value b _i included in the search information B is the same value as the numerical values a _{1 to} a _r included in the j-th candidate polynomial point set R _Aj , the point (x _i , y _i ), Polynomial points (a ₁ , f (a ₁ )) to (a _n , f (a _n )), (a _{n + 1} , f ′ (a _{n + 1} )) corresponding to the numerical values a _{1 to} a _r of the same value If ( _ar , f ′ ( _ar )) is set and b _i ≠ a ₁ , a ₂ ,..., A _r , then the point (x _i , y _i ) is changed to the point (o, o) Is set ((x _i , y _i ) = (o, o) = φ).
(2) The point (x _i , y _i ) is set as an element of the j-th candidate polynomial point subset Q _j (Q _j ← Q _j ∪ (x _i , y _i )).

ＳＴ４０４において、変数ｉが検索情報Ｂに含まれる点の個数であるｍになったか否かを判別する。ノー（Ｎ）の場合はＳＴ４０５に移り、イエス（Ｙ）の場合はＳＴ４０６に移る。
ＳＴ４０５において、変数ｉに＋１を加算する（ｉ＝ｉ＋１）。そして、ＳＴ４０３に戻る。
ＳＴ４０６において、ｊ番目の候補多項式点部分集合Ｑ_ｊの点がｃ個以上であるか否かを判別する（‖Ｑ_ｊ‖≧ｃ）。イエス（Ｙ）の場合はＳＴ４０７に移り、ノー（Ｎ）の場合はＳＴ４０８に移る。
ＳＴ４０７において、ｊ番目の候補多項式点部分集合Ｑ_ｊを、解集合Ｒ_Ｂの要素とする（Ｒ_Ｂ←Ｒ_Ｂ∪Ｒ_Ａｊ）。そして、ＳＴ４０８に移る。 In ST404, it is determined whether or not the variable i has reached m which is the number of points included in the search information B. If no (N), the process moves to ST405, and if yes (Y), the process moves to ST406.
In ST405, +1 is added to the variable i (i = i + 1). Then, the process returns to ST403.
In ST406, it is determined whether or not there are c or more points in the j-th candidate polynomial point subset Q _j (‖Q _j ‖ ≧ c). If yes (Y), the process proceeds to ST407, and, if no (N), the process proceeds to ST408.
In ST407, j-th candidate polynomial point subset Q _j is set as an element of solution set R _B (R _B ← R _B ∪R _Aj ). Then, the process proceeds to ST408.

ＳＴ４０８において、変数ｊが秘匿登録情報Ｒ_１〜Ｒ_Ｎの個数であるＮになったか否かを判別する。ノー（Ｎ）の場合はＳＴ４０９に移り、イエス（Ｙ）の場合はＳＴ４１０に移る。
ＳＴ４０９において、次の（１），（２）の処理を実行し、ＳＴ４０３に戻る。
（１）変数ｉに１をセットする（ｉ＝１）。
（２）変数ｊに＋１を加算する（ｊ＝ｊ＋１）。
ＳＴ４１０において、解集合Ｒ_Ｂに含まれる各候補多項式点部分集合Ｑ_ｊ（‖Ｑｊ‖≧ｃ））から、ＢＭアルゴリズムによって前記各多項式ｆ（ｘ）を復元し（係数ｓ_０〜ｓ_ｋ−１を演算し）、前記逆関数Ｔ^−１によって各類似情報ｔ′（ｔ′＝ｓ）を復元する。そして、ＳＴ４１１に移る。
ＳＴ４１１において、復元が成功し、且つ、編集距離ｄが最大値ｄ_ｍａｘ以下となる各類似情報ｔ′を類似情報出力部３ｂに一覧表示する（図４参照）。 In ST408, it is determined whether or not the variable j has reached _N , which is the number of the secret registration information R _{1 to} R _N. If no (N), the process moves to ST409, and if yes (Y), the process moves to ST410.
In ST409, the following processes (1) and (2) are executed, and the process returns to ST403.
(1) Set 1 to variable i (i = 1).
(2) Add +1 to the variable j (j = j + 1).
In ST410, the respective polynomials f (x) are restored by the BM algorithm from the respective candidate polynomial point subsets Q _j (‖Qj‖ ≧ c) included in the solution set R _B (coefficients s _{0 to} s _k−1). And the similar information t ′ (t ′ = s) is restored by the inverse function T ⁻¹ . Then, the process proceeds to ST411.
In ST411, the similar information t ′ whose restoration is successful and the editing distance d is equal to or less than the maximum value d _max is displayed in a list on the similar information output unit 3b (see FIG. 4).

（実施例１の作用）
前記構成を備えた実施例１の前記類似情報検索システムＳでは、前記登録用クライアントパソコンＰＣａにおいて、前記登録画像２の登録情報入力部２ａに前記登録情報ｓの入力があり、且つ、前記登録ボタン２ｂが入力された場合に、前記登録情報ｓが秘匿化された前記秘匿登録情報（多項式点集合Ｒ）を演算して、前記データベースサーバＤＳに送信する前記秘匿登録情報送信処理が実行される（図３、図５のＳＴ１０１〜ＳＴ１１８参照）。
また、実施例１の前記類似情報検索システムＳでは、前記検索用クライアントパソコンＰＣｂにおいて、前記検索画像３の検索情報入力部３ａに前記検索情報ｔの入力があり、且つ、前記検索開始ボタン３ｂが入力された場合に、前記検索情報ｔが秘匿化された前記秘匿検索情報（検索部分集合Ｂ^＊）を演算して、前記データベースサーバＤＳに送信する前記秘匿検索情報送信処理が実行される（図４、図６のＳＴ２０１〜ＳＴ２１０参照）。 (Operation of Example 1)
In the similar information search system S according to the first embodiment having the above-described configuration, the registration information input unit 2a of the registration image 2 has the input of the registration information s and the registration button in the registration client personal computer PCa. When 2b is input, the secret registration information transmission process is performed in which the secret registration information (polynomial point set R) in which the registration information s is concealed is calculated and transmitted to the database server DS ( (Refer to ST101 to ST118 in FIGS. 3 and 5).
In the similar information search system S according to the first embodiment, the search client personal computer PCb includes the search information input unit 3a of the search image 3 and the search start button 3b. When the search information t is input, the secret search information transmission process is executed in which the secret search information (search subset B ^* ) in which the search information t is concealed is calculated and transmitted to the database server DS (see FIG. 4, see ST201 to ST210 in FIG.

また、実施例１の前記類似情報検索システムＳでは、前記データベースサーバＤＳにおいて、前記秘匿登録情報（多項式点集合Ｒ）を受信した場合には、前記秘匿登録情報Ｒ_Ｎ（Ｎ＝１，２，…）として記憶する（図７のＳＴ３０１，ＳＴ３０２参照）。また、前記秘匿検索情報（検索部分集合Ｂ^＊）を受信した場合には、前記検索情報ｔおよび前記類似情報ｔ′を秘匿化した状態で検索する前記秘匿類似情報検索処理が実行される（図７のＳＴ３０３〜ＳＴ３１２参照）。
そして、実施例１の前記類似情報検索システムＳでは、前記検索用クライアントパソコンＰＣｂにおいて、検索検索結果としての前記秘匿類似情報（解候補集合Ｒ_Ａ）を受信し、前記類似情報ｔ′を復元して前記類似情報出力部３ｂに表示する前記秘匿類似情報復元処理が実行される（図６のＳＴ２１１、図８のＳＴ４０１〜ＳＴ４１１参照）。 In the similar information search system S of the first embodiment, when the secret registration information (polynomial point set R) is received by the database server DS, the secret registration information R _N (N = 1, 2, ... (See ST301 and ST302 in FIG. 7). Further, when the secret search information (search subset B ^* ) is received, the secret similar information search process for searching the search information t and the similar information t ′ in a concealed state is executed (FIG. 7 ST303 to ST312).
In the similar information search system S of the first embodiment, the search client personal computer PCb receives the secret similar information (solution candidate set R _A ) as a search search result, and restores the similar information t ′. Then, the secret similar information restoration process displayed on the similar information output unit 3b is executed (see ST211 in FIG. 6 and ST401 to ST411 in FIG. 8).

したがって、実施例１の前記類似情報検索システムＳでは、前記インターネット１を介して、前記各クライアントパソコンＰＣａ，ＰＣｂと、前記データベースサーバＤＳとの間で送受信される各情報Ｒ，Ｂ^＊，Ｒ_Ａは、前記登録情報ｓ、前記検索情報ｔ、前記類似情報ｔ′が秘匿化された情報である。このため、前記各クライアントパソコンＰＣａ，ＰＣｂおよび前記データベースサーバＤＳのユーザ以外の第三者から前記各情報ｓ，ｔ，ｔ′を秘匿することができる。
また、前記秘匿類似情報検索処理では、前記登録情報ｓ、前記検索情報ｔ、前記類似情報ｔ′を復元することなく、秘匿化したままの状態で実行される。すなわち、前記秘匿類似情報検索処理では、受信した前記秘匿検索情報（検索部分集合Ｂ^＊）に含まれる検索数値ｂ_１〜ｂ_ｍ−Ｌと、記憶した前記各秘匿登録情報Ｒ_１〜Ｒ_Ｎに含まれる数値ａ_１〜ａ_ｒとが同値であるか否かを判別するため（図７のＳＴ３０５参照）、復元された各情報ｓ，ｔ，ｔ′を特定できないようになっている。 Therefore, in the similar information retrieval system S of the first embodiment, the information R, B ^* , R _A transmitted / received between the client personal computers PCa, PCb and the database server DS via the Internet 1. Is information in which the registration information s, the search information t, and the similar information t ′ are concealed. Therefore, the information s, t, t ′ can be kept secret from third parties other than the users of the client personal computers PCa, PCb and the database server DS.
Further, the secret similar information search process is executed in a concealed state without restoring the registration information s, the search information t, and the similar information t ′. That is, the in concealment similar information retrieval processing, a retrieval numerical b ₁ ~b _m-L included in the received secure search information (search subset B ^*), the each confidential registration information R ₁ to R _N stored In order to determine whether or not the contained numerical values a _{1 to} a _r are the same value (see ST305 in FIG. 7), the restored information s, t, and t ′ cannot be specified.

なお、前記データベースサーバＤＳ上では、記憶した前記秘匿登録情報（多項式点集合Ｒ_１〜Ｒ_Ｎ）と、受信した前記秘匿検索情報（検索部分集合Ｂ^＊）とを、前記登録情報ｓと、前記検索情報ｔとに復元できないようになっている。すなわち、前記多項式点集合Ｒには、前記チャフの一例としての前記擬似多項式点（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））が含まれている。このため、前記登録集合Ａを知らない限り、前記（ｎ，ｋ）符号の一例としての登録多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ））を完全に特定できない。なお、前記登録集合Ａに十分近い前記検索集合Ｂを知得した場合には（Ｂ≒Ａ）、前記登録多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ））の一部を特定でき、｛（ｎ＋ｋ）／２｝個以上の点を特定できれば、前記多項式点集合Ｒから前記登録情報ｓを復元できる。よって、前記検索集合Ｂを知得していない限り、前記多項式点集合Ｒから前記登録情報ｓを復元できない。また、前記検索部分集合Ｂ^＊は、前記検索集合ＢからＬ種類の検索数値が除かれている、例えば、Ｂ＝（ｂ_１，ｂ_２，…，ｂ_ｍ），Ｂ^＊＝（ｂ_１，ｂ_２，…，ｂ_ｍ−Ｌ）とした場合、Ｌ種類の検索数値ｂ_{ｍ−Ｌ＋１}〜ｂ_ｍが除かれている（図６のＳＴ２０９参照）。このため、前記検索集合Ｂを知らない限り、前記検索数値ｂ_{ｍ−Ｌ＋１}〜ｂ_ｍを特定できず、前記検索部分集合Ｂ^＊から前記検索情報ｔを復元できない。 On the database server DS, the stored secret registration information (polynomial point set R _{1 to} R _N ) and the received secret search information (search subset B ^* ) are stored in the registration information s, The search information t cannot be restored. In other words, the polynomial point set R includes the pseudo-polynomial points (a _{n + 1} , f ′ (a _{n + 1} )) to (a _r , f ′ (a _r )) as an example of the chaff. For this reason, unless the registration set A is known, the registration polynomial points (a ₁ , f (a ₁ )) to (a _n , f (a _n )) as an example of the (n, k) code are completely set. It can not be identified. When the search set B sufficiently close to the registration set A is acquired (B≈A), the registration polynomial points (a ₁ , f (a ₁ )) to (a _n , f (a _n ) ) Can be specified, and {(n + k) / 2} or more points can be specified, the registration information s can be restored from the polynomial point set R. Therefore, unless the search set B is known, the registration information s cannot be restored from the polynomial point set R. The search subset B ^* is obtained by removing L types of search numerical values from the search set B. For example, B = (b ₁ , b ₂ ,..., B _m ), B ^* = (b ₁ , b ₂ ,..., b _m−L ), L types of search numerical values b _{m−L + 1 to} b _m are excluded (see ST209 in FIG. 6). Therefore, unless the search set B is known, the search numerical values b _{m−L + 1 to} b _m cannot be specified, and the search information t cannot be restored from the search subset B ^* .

したがって、実施例１の前記類似情報検索システムＳでは、前記データベースサーバＤＳのユーザである管理者からも前記各情報ｓ，ｔ，ｔ′を秘匿することができる。
この結果、実施例１の前記類似情報検索システムＳは、前記第三者および前記管理者から、前記各情報ｓ，ｔ，ｔ′が秘匿された状態で、前記登録情報ｓの所有者である登録者と、前記検索情報ｔの知得者である検索者と、前記データベースサーバＤＳの管理業務を委託された外部委託業者としての前記管理者とを有する前記ＤＡＳモデルを構成できる。すなわち、前記登録用クライアントパソコンＰＣａのユーザである前記登録者と、前記検索用クライアントパソコンＰＣｂのユーザである前記検索者と、前記管理者とにより、前記ＤＡＳモデルを構成できる。 Therefore, in the similar information search system S of the first embodiment, the information s, t, and t ′ can be kept secret from an administrator who is a user of the database server DS.
As a result, the similar information search system S according to the first embodiment is the owner of the registration information s in a state where the information s, t, and t ′ are concealed from the third party and the administrator. The DAS model having a registrant, a searcher who is an acquirer of the search information t, and the manager as an outsourcer entrusted with management work of the database server DS can be configured. That is, the DAS model can be configured by the registrant who is the user of the registration client personal computer PCa, the searcher who is the user of the search client personal computer PCb, and the administrator.

また、前記構成を備えた実施例１の前記類似情報検索システムＳでは、前記登録情報ｓを秘匿化して登録する際に、まず、ｑ−ｇｒａｍの抽出処理により、前記登録情報ｓから前記部分文字列ｓ_ｋ（１）〜ｓ_{ｋ（ｋ−１）}や前記登録部分情報ｓ_ｑ１〜ｓ_ｑｎが抽出される（図５のＳＴ１０７，ＳＴ１０８参照）。次に、前記可逆関数Ｔや前記ハッシュ関数Ｈにより、前記多項式ｆ（ｘ）の係数ｓ_０〜ｓ_ｋ−１や前記登録集合（第１の集合）Ａ（Ａ＝（ａ_１，ａ_２，…，ａ_ｎ））が演算される（図５のＳＴ１０７，ＳＴ１０９参照）。そして、前記ファジーボールトにおける施錠処理と同様に、前記（ｎ，ｋ）符号の一例としての前記登録多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ））と、前記チャフの一例としての前記擬似多項式点（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））とを有する前記多項式点集合Ｒが演算される（図５のＳＴ１１０〜ＳＴ１１７）。したがって、前記抽出処理と、前記各関数Ｔ，Ｈと、前記施錠処理とを組み合わせることにより、前記登録者ごとの鍵を使用しなくても、前記登録情報ｓを秘匿化できる。 In the similar information search system S according to the first embodiment having the above-described configuration, when the registration information s is concealed and registered, first, the partial characters are extracted from the registration information s by q-gram extraction processing. The columns s _{k (1) to} s _{k (} _k−1) and the registered partial information s _{q1 to} s _qn are extracted (see ST107 and ST108 in FIG. 5). Next, by the reversible function T and the hash function H, the coefficients s _{0 to} s _{k−1 of the} polynomial f (x) and the registered set (first set) A (A = (a ₁ , a ₂ , _{..., a} n)) is calculated (see ST 107, ST 109 in FIG. 5). Then, similarly to the locking process in the fuzzy vault, the registered polynomial points (a ₁ , f (a ₁ )) to (a _n , f (a _n )) as an example of the (n, k) code, The polynomial point set R having the pseudo-polynomial points (a _{n + 1} , f ′ (a _{n + 1} )) to (a _r , f ′ (a _r )) as an example of the chaff is calculated (ST110 in FIG. 5). ~ ST117). Therefore, by combining the extraction process, the functions T and H, and the locking process, the registration information s can be concealed without using a key for each registrant.

また、前記検索情報ｔの検索を行う場合、まず、前記登録集合Ａと同様に、前記抽出処理と、前記ハッシュ関数Ｈと、前記施錠処理とにより、前記検索集合（第２の集合）Ｂ（Ｂ＝（ｂ_１，ｂ_２，…，ｂ_ｍ））が演算される（図６のＳＴ２０７，ＳＴ２０８参照）。そして、ｍ種類の検索数値ｂ_１〜ｂ_ｍからＬ種類の検索数値ｂ_{ｍ−Ｌ＋１}〜ｂ_ｍが除かれた前記検索部分集合Ｂ^＊が演算される（図６のＳＴ２０９参照）。したがって、前記抽出処理と、前記ハッシュ関数Ｈと、前記施錠処理とを組み合わせることにより、前記検索者ごとの鍵を使用しなくても、前記検索情報ｔを秘匿化できる。
この結果、実施例１の前記類似情報検索システムＳでは、前記インターネット１を介して前記データベースサーバＤＳが送受信する全ての情報Ｒ，Ｂ^＊，Ｒ_Ａ（Ｒ_Ａ＝（Ｒ_Ａ１，Ｒ_Ａ２，…，Ｒ_ＡＭ））が、前記鍵を使用せずに演算できる。なお、前記候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭは、受信した前記多項式点集合Ｒを記憶した前記多項式点集合Ｒ_１〜Ｒ_Ｎの全部または一部である。したがって、従来公知の暗号アルゴリズムで使用される鍵を、ユーザ（登録者、検索者）ごとに設定しなくても、前記第三者および前記管理者から前記各情報ｓ，ｔ，ｔ′を秘匿できる。よって、前記暗号化データベース（図９参照）における鍵管理の安全性（厳格性）やそれに伴う管理コスト等の問題を解決することができる。 When searching for the search information t, first, similarly to the registered set A, the search set (second set) B () is obtained by the extraction process, the hash function H, and the locking process. B = (b ₁ , b ₂ ,..., B _m )) is calculated (see ST207 and ST208 in FIG. 6). Then, m type L type from the search numerical _b 1 ~b _m of search numerical _{b m-L} + 1 wherein ~b _m is removed searched subset ^{B *} are calculated (see ST209 of FIG. 6). Therefore, by combining the extraction process, the hash function H, and the locking process, the search information t can be concealed without using a key for each searcher.
As a result, in the similar information retrieval system S of the first embodiment, all information R, B ^* , R _A (R _A = (R _A1 , R _A2 ,...) Transmitted and received by the database server DS via the Internet 1. , R _AM )) can be calculated without using the key. The candidate polynomial point sets R _{A1 to} R _AM are all or part of the polynomial point sets R _{1 to} R _N storing the received polynomial point set R. Accordingly, the information s, t, t ′ is concealed from the third party and the administrator without setting a key used in a conventionally known encryption algorithm for each user (registrant, searcher). it can. Therefore, it is possible to solve problems such as security (strictness) of key management in the encrypted database (see FIG. 9) and management costs associated therewith.

（各候補多項式点部分集合Ｑ_１〜Ｑ_Ｍから各類似情報ｔ′への復元について）
ここで、Ｌ種類の検索数値ｂ_{ｍ−Ｌ＋１}〜ｂ_ｍが除かれた前記検索部分集合Ｂ^＊によって検索された前記解候補集合Ｒ_Ａから前記類似情報ｔ′が確実に復元できるか否かが問題となる。すなわち、前記解候補集合Ｒ_Ａの各候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭから演算された各候補多項式点部分集合Ｑ_１〜Ｑ_Ｍに対応する各類似情報ｔ′が演算できるか否かが問題となる。 (Restoration from each candidate polynomial point subset Q _{1 to} Q _M to each similar information t ′)
Here, whether or not the similarity information t ′ can be reliably restored from the solution candidate set _RA searched by the search subset B ^* from which L types of search numerical values b _{m−L + 1 to} b _m are removed. It becomes a problem. That is, whether the similar information t ′ corresponding to each candidate polynomial point subset Q _{1 to} Q _M calculated from each candidate polynomial point set R _{A1 to} R _{AM of the} solution candidate set R _A can be calculated. It becomes.

ここで、前記ｑ−ｇｒａｍを用いた類似文字列検索手法において、長さ（文字数）ｎの文字列と、長さｍの文字列とは、少なくとも｛ｍａｘ（ｎ，ｍ）−（ｄ−１）ｑ−１｝個の前記ｑ−ｇｒａｍを共通に持つことが保証されている。このため、長さ（文字数）‖ｓ‖の文字列（登録情報ｓ）と、長さ‖ｔ‖の文字列（検索情報ｔ）とが、少なくとも｛ｍａｘ（‖ｓ‖，‖ｔ‖）−（ｄ−１）ｑ−１｝個の前記ｑ−ｇｒａｍを共通に持つことから、以下の式（６）が成立する。
ｍａｘ（‖ｓ‖，‖ｔ‖）−（ｄ−１）ｑ−１
＝ｍａｘ（ｎ＋ｑ−１，ｍ＋ｑ−１）−（ｄ−１）ｑ−１
≧ｍａｘ（ｎ，ｍ）−（ｄ−１）ｑ−１ …（６）
すなわち、前記ハッシュ関数Ｈによって前記登録情報のｑ−ｇｒａｍとしての前記登録部分情報ｓ_ｑ１〜ｓ_ｑｎに対応付けられた前記登録数値ａ_１〜ａ_ｎと、前記検索情報のｑ−ｇｒａｍとしての前記検索部分情報ｔ_ｑ１〜ｔ_ｑｍに対応付けられた前記検索数値ｂ_１〜ｂ_ｍとの共有数が｛ｍａｘ（ｎ，ｍ）−（ｄ−１）ｑ−１｝個以上であることが明らかである。 Here, in the similar character string search method using the q-gram, a character string having a length (number of characters) n and a character string having a length m are at least {max (n, m) − (d−1). ) It is guaranteed to have q-1} q-grams in common. For this reason, a character string (registration information s) having a length (number of characters) ‖s‖ and a character string (search information t) having a length ‖t‖ are at least {max (‖s‖, ‖t‖) − (D-1) Since q-1} q-grams are commonly used, the following equation (6) is established.
max (‖s‖, ‖t‖)-(d-1) q-1
= Max (n + q-1, m + q-1)-(d-1) q-1.
≧ max (n, m) − (d−1) q−1 (6)
In other words, said registration partial information _s q1 the registration numerical values associated with _{~s qn} _a 1 ~a _n as q-gram of the registration information by the hash function H, wherein as q-gram of the retrieval information It is clear that the number of shares with the search numerical values b _{1 to} b _m associated with the search partial information t _{q1 to} t _qm is {max (n, m) − (d−1) q−1} or more. It is.

また、前記ＲＳ符号を用いたファジーボールトの開錠処理において、前記ボールトＲから前記（ｎ，ｋ）符号の｛（ｎ＋ｋ）／２｝個以上の点が判明すれば前記多項式が復元できることが保証されている。
よって、前記各候補多項式点部分集合Ｑ_１〜Ｑ_Ｍから各類似情報ｔ′が復元できるか否かを判別するための多項式点の個数の下限値である前記自然数ｃについて、以下の式（７）が常に成立すると仮定すれば、前記（ｎ，ｋ）符号の｛（ｎ＋ｋ）／２｝個以上の点から前記多項式ｆ（ｘ）の係数ｓ_０〜ｓ_ｋ−１を常に演算できる（図８のＳＴ４１０参照）。
ｃ≧（ｎ＋ｋ）／２ …（７） Also, in the fuzzy vault unlocking process using the RS code, it is guaranteed that the polynomial can be restored if the (n, k) code {(n + k) / 2} or more points are found from the vault R. Has been.
Therefore, with respect to the natural number c which is the lower limit value of the number of polynomial points for determining whether or not each similar information t ′ can be restored from each of the candidate polynomial point subsets Q _{1 to} Q _M , the following equation (7 ) Always holds, the coefficients s _{0 to} s _k-1 of the polynomial f (x) can always be calculated from {(n + k) / 2} or more points of the (n, k) code (see FIG. 8 ST410).
c ≧ (n + k) / 2 (7)

ここで、前記自然数ｃ，ｋについて、前記式（２）′，（５）が成立し、且つ、ｎ≧ｍが成立すると仮定した場合、以下の式（８）が成立する。
‖Ｑ_ｊ‖≧ｃ（ｊ＝１，２，…，Ｍ）
≧（ｎ＋ｋ）／２
＝ｍａｘ（ｎ，ｍ）−（ｄ−１）ｑ−１
＝ｎ−（ｄ−１）ｑ−１
≧ｎ−（ｄ_ｍａｘ−１）ｑ−１
＝（ｎ＋ｋ）／２ …（８） Here, for the natural numbers c and k, when it is assumed that the expressions (2) ′ and (5) are satisfied and n ≧ m is satisfied, the following expression (8) is satisfied.
‖Q _j ‖ ≧ c (j = 1, 2,..., M)
≧ (n + k) / 2
= Max (n, m)-(d-1) q-1
= N- (d-1) q-1
≧ n− (d _max −1) q−1
= (N + k) / 2 (8)

また、前記自然数ｃ，ｋについて、前記式（２）′，（５）が成立し、且つ、ｎ＜ｍが成立すると仮定した場合には、以下の式（８）′が成立する。
‖Ｑ_ｊ‖≧ｃ
＝ｍａｘ（ｎ，ｍ）−（ｄ−１）ｑ−１
＝ｍ−（ｄ−１）ｑ−１
＞ｎ−（ｄ−１）ｑ−１
≧ｎ−（ｄ_ｍａｘ−１）ｑ−１
＝（ｎ＋ｋ）／２ …（８）′ Further, when it is assumed that the expressions (2) ′ and (5) are satisfied for the natural numbers c and k and n <m is satisfied, the following expression (8) ′ is satisfied.
‖Q _j ‖ ≧ c
= Max (n, m)-(d-1) q-1
= M- (d-1) q-1
> N- (d-1) q-1
≧ n− (d _max −1) q−1
= (N + k) / 2 (8) '

したがって、実施例１の前記類似情報検索システムＳでは、予め設定された前記自然数ｑ，ｎ，ｍ，ｄの値と、前記式（２）′，（５）とに基づいて、前記自然数ｃ，ｋの値が設定された場合、前記式（７）が常に成立する。このため、ｃ個以上の多項式点を有する前記各候補多項式点部分集合Ｑ_１〜Ｑ_Ｍに、前記（ｎ，ｋ）符号の｛（ｎ＋ｋ）／２｝個以上の点が含まれていれば、前記開錠処理により、前記係数ｓ_０〜ｓ_ｋ−１を常に演算できる。すなわち、実施例１の前記類似情報検索システムＳは、前記自然数ｃ，ｋを、前記式（２）′，（５）に基づいて設定すれば、前記類似情報ｔ′を復元可能か否かを判別するフィルタリング処理（図７のＳＴ３０８、図８のＳＴ４０６参照）を適切に実行することができる。よって、前記類似情報ｔ′が前記検索情報ｔと完全一致する場合には必ず復元でき、前記類似情報ｔ′が前記検索情報ｔとの類似度が高いほど（編集距離ｄが小さいほど）復元できる可能性が高まり、前記類似情報ｔ′が前記検索情報ｔとの類似度が低いほど（編集距離ｄが大きいほど）復元できる可能性が低くなると共に、前記フィルタリング処理で除外される可能性が高くなる。 Therefore, in the similar information search system S of the first embodiment, the natural numbers c, n, m, d are set based on the preset values of the natural numbers q, n, m, d and the equations (2) ′, (5). When the value of k is set, the equation (7) is always established. Therefore, if each of the candidate polynomial point subsets Q _{1 to} Q _M having c or more polynomial points includes {(n + k) / 2} or more points of the (n, k) code. The coefficients s _{0 to} s _k-1 can always be calculated by the unlocking process. That is, the similar information search system S according to the first embodiment determines whether or not the similar information t ′ can be restored if the natural numbers c and k are set based on the equations (2) ′ and (5). The filtering process to discriminate (see ST308 in FIG. 7 and ST406 in FIG. 8) can be appropriately executed. Therefore, it can be restored whenever the similarity information t ′ completely matches the search information t, and the similarity information t ′ can be restored as the similarity with the search information t is higher (the edit distance d is smaller). The possibility increases and the possibility that the similar information t ′ can be restored as the similarity with the search information t is lower (the edit distance d is larger) and is more likely to be excluded by the filtering process. Become.

この結果、実施例１の前記類似情報検索システムＳでは、前記ｑ−ｇｒａｍを用いた類似文字列検索手法の技術のみでは実現できない情報の秘匿化の問題が解決されていると共に、前記ＲＳ符号を用いたファジーボールトの技術のみでは実現できない前記開錠処理のフィルタリング条件の設定についての問題が解決されている。 As a result, in the similar information search system S of the first embodiment, the problem of information concealment that cannot be realized only by the technique of the similar character string search method using the q-gram is solved, and the RS code is changed. The problem of setting the filtering conditions for the unlocking process, which cannot be realized only by the fuzzy vault technique used, has been solved.

（秘匿類似情報検索処理の効率について）
また、前記構成を備えた実施例１の前記類似情報検索システムＳでは、前記秘匿類似情報（解候補集合Ｒ_Ａ）を検索する際に、前記秘匿検索情報（検索部分集合Ｂ^＊）に含まれる検索数値ｂ_１〜ｂ_ｍ−Ｌと、前記各秘匿登録情報Ｒ_１〜Ｒ_Ｎに含まれる数値ａ_１〜ａ_ｒとを照合する。このため、暗号化された登録情報や検索情報を復号化してから照合を行う場合に比べ、効率的に前記秘匿類似情報（解候補集合Ｒ_Ａ）を検索できる。
また、実施例１の前記類似情報検索システムＳでは、前記検索数値ｂ_１〜ｂ_ｍ−Ｌと、前記数値ａ_１〜ａ_ｒとが昇順に並べられている。このため、前記照合により各候補多項式点部分集合Ｑ_１〜Ｑ_Ｍを演算する前記計算量が、前記ランダウの記号を用いてＯ（ｎ＋ｍ）時間となる。したがって、総当りで照合して、計算量がＯ（ｎｍ）になる場合に比べ、効率的に前記秘匿類似情報検索処理を実行できる。 (About the efficiency of confidential similar information search processing)
Further, in the similar information search system S of the first embodiment having the above-described configuration, when searching for the secret similar information (solution candidate set _RA ), it is included in the secret search information (search subset B ^* ). a search numerical _b 1 _{~b m-L,} collates the numerical _a 1 ~a _r included the each confidential registration information _R 1 to R _N. For this reason, compared with the case where collation is performed after decrypting the encrypted registration information and search information, the secret similar information (solution candidate set _RA ) can be searched efficiently.
In the similar information search system S according to the first embodiment, the search numerical values b _{1 to} b _m-L and the numerical values a _{1 to} a _r are arranged in ascending order. For this reason, the calculation amount for calculating each candidate polynomial point subset Q _{1 to} Q _M by the collation is O (n + m) time using the Landau symbol. Therefore, compared with the case where the calculation amount is O (nm) by collating with the brute force, the secret similar information search process can be executed efficiently.

（秘匿類似情報復元処理の効率について）
また、実施例１の前記類似情報検索システムＳでは、前記秘匿類似情報（解候補集合Ｒ_Ａ）の復元について、前記ＢＭアルゴリズムを用いた前記開錠処理が実行される（図８のＳＴ４１０参照）。前記ＢＭアルゴリズムを用いた前記開錠処理は、誤り訂正可能な点の数をｚ（ｚ＝（ｎ−ｋ）／２）とした場合に、前記計算量が前記ランダウの記号を用いてＯ（ｚ^２）時間となることが知られている（例えば、特開２００３−１６８９８３号公報等参照）。このため、前記Ｏ（ｚ^２）時間以上の開錠処理（例えば、計算量がＯ（ｚ^３）時間となるピーターソン（Perterson）法を用いた開錠処理）に比べ、効率的に前記秘匿類似情報復元処理を実行できる。 (About the efficiency of confidential similar information restoration processing)
Further, in the similar information search system S of the first embodiment, the unlocking process using the BM algorithm is executed for restoring the secret similar information (solution candidate set _RA ) (see ST410 in FIG. 8). . In the unlocking process using the BM algorithm, when the number of error-correctable points is z (z = (n−k) / 2), the calculation amount is O ( z ² ) is known to be time (see, for example, JP-A-2003-168983). For this reason, as compared with the unlocking process of O (z ² ) time or longer (for example, unlocking process using Peterson method in which the calculation amount is O (z ³ ) time), the concealment is efficiently performed. Similar information restoration processing can be executed.

（総当り攻撃に対する秘匿登録情報の安全性について）
また、前記構成を備えた実施例１の前記類似情報検索システムＳでは、例えば、前記第三者や前記管理人が、前記秘匿登録情報（多項式点集合Ｒ）から前記多項式ｆ（ｘ）を総当りで復元して、前記登録情報ｓの解読を試みる可能性がある。ここで、十分に小さい正の値の実数をμとし（μ＞０，μ≒０）、前記多項式ｆ（ｘ）の候補となる多項式の総数をＮ_ｆとした場合、前記多項式の総数Ｎ_ｆについて、少なくとも（１−μ）の確率で、以下の式（９）が成立することが知られている（例えば、非特許文献７参照）。
Ｎ_ｆ＝（μ／３）×ｐ^ｋ−ｎ×（ｒ／ｎ）^ｎ …（９）
よって、例えば、前記自然数ｒ，ｐ，ｎ，ｋについて、ｒ＝ｐ＝１０^４，ｎ＝２２，ｋ＝１４とし、前記実数μについて、μ≒２^−１８８とした場合、Ｎ_ｆ≒２^８６が成立する。すなわち、２^８６通りの多項式のうち、前記登録情報ｓに対応する１つの多項式ｆ（ｘ）を特定しなければならない。 (About the security of confidential registration information against brute force attacks)
In the similar information search system S according to the first embodiment having the above-described configuration, for example, the third party or the administrator totals the polynomial f (x) from the secret registration information (polynomial point set R). There is a possibility that the registration information s is tried to be decrypted by being restored. Here, when a sufficiently small real number of positive values is μ (μ> 0, μ≈0), and the total number of polynomials that are candidates for the polynomial f (x) is N _f , the total number N _{f of the} polynomials It is known that the following formula (9) is established with a probability of at least (1−μ) (for example, see Non-Patent Document 7).
N _f = (μ / 3) × p ^k−n × (r / n) ⁿ (9)
Therefore, for example, when r = p = 10 ⁴ , n = 22, k = 14 for the natural numbers r, p, n, and k and ^{μ≈2 −188} for the real number μ, N _f ≈2 ⁸⁶ Is established. That is, 2 ⁸⁶ kinds of polynomial, must identify the single polynomial f that corresponds to the registration information s (x).

また、実施例１の前記類似情報検索システムＳでは、前記自然数ｒの値が大きくなるに連れて、前記多項式点集合Ｒに含まれる前記擬似多項式点（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））の総数が増えるため、前記登録多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ））を特定される可能性が低くなる。また、前記自然数ｒの値が大きければ、前記多項式の総数Ｎ_ｆ自体が大きく、前記多項式ｆ（ｘ）として選択できる多項式の総数も大きくすることができる。この場合、前記秘匿登録情報（多項式点集合Ｒ）が解読される可能性がさらに低減され、安全性を高くすることができる。 In the similar information search system S of the first embodiment, as the value of the natural number r increases, the pseudo-polynomial points (a _{n + 1} , f ′ (a _{n + 1} )) to be included in the polynomial point set R are increased. Since the total number of (a _r , f ′ (a _r )) increases, the possibility of specifying the registered polynomial points (a ₁ , f (a ₁ )) to (a _n , f (a _n )) is low. Become. Further, if the value of the natural number r is large, the total number N _{f of the} polynomial itself is large, and the total number of polynomials that can be selected as the polynomial f (x) can be increased. In this case, the possibility that the secret registration information (polynomial point set R) is decoded is further reduced, and the safety can be increased.

また、前記編集距離ｄについて、前記最大値ｄ_ｍａｘが大きくなるに連れて、検索される前記解候補集合Ｒ_Ａの候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭの個数が多くなる可能性が高くなる。すなわち、自然数Ｍの値が大きくなる可能性が高くなる。このため、類似情報検索処理としての利便性が高くなる。しかしながら、この場合、前記自然数ｋ（ｋ＝ｎ−２（ｄ_ｍａｘ−１）ｑ−２）の値が小さくなる（式（２）′参照）。このため、前記多項式ｆ（ｘ）の次数（ｋ−１）の値が小さくなり、前記多項式ｆ（ｘ）として選択できる多項式の総数も小さくなり、前記秘匿登録情報（多項式点集合Ｒ）が解読される可能性が高くなるため、安全性が低くなってしまう。
すなわち、前記最大値ｄ_ｍａｘが大きくなるに連れて、類似情報検索処理としての利便性が向上するが、前記秘匿登録情報（多項式点集合Ｒ）の安全性は低くなる。 Further, as the editing distance d increases, as the maximum value d _max increases, the number of candidate polynomial point sets R _{A1 to} R _AM of the solution candidate set _RA to be searched increases. That is, the possibility that the value of the natural number M becomes large increases. For this reason, the convenience as a similar information search process becomes high. However, in this case, the value of the natural number k (k = n−2 (d _max −1) q−2) is small (see Expression (2) ′). For this reason, the value of the degree (k−1) of the polynomial f (x) is reduced, the total number of polynomials that can be selected as the polynomial f (x) is also reduced, and the secret registration information (polynomial point set R) is decoded. Since the possibility of being increased, the safety is lowered.
That is, as the maximum value d _max increases, the convenience as the similar information search process is improved, but the security of the secret registration information (polynomial point set R) decreases.

逆に、前記最大値ｄ_ｍａｘが小さくなるに連れて、検索される前記解候補集合Ｒ_Ａの候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭの個数が少なくなる可能性が高くなり（自然数Ｍの値が小さくなる可能性が低くなり）、類似情報検索処理としての利便性が低くなる。また、この場合、前記自然数ｋの値が大きくなり、前記多項式ｆ（ｘ）の次数（ｋ−１）の値が大きくなり、前記多項式ｆ（ｘ）として選択できる多項式の総数も大きくなり、前記秘匿登録情報が解読される可能性が低くなるため、安全性が高くなる。
すなわち、前記最大値ｄ_ｍａｘが大きくなるに連れて、類似情報検索処理としての利便性が低下するが、前記秘匿登録情報（多項式点集合Ｒ）の安全性は高くなる。
この結果、実施例１の前記類似情報検索システムＳでは、前記最大値ｄ_ｍａｘに基づいて、前記利便性と前記安全性とを調節することができる。 Conversely, as the maximum value d _max decreases, the number of candidate polynomial point sets R _{A1 to} R _AM of the solution candidate set _RA to be searched increases (the value of the natural number M is reduced). The possibility of becoming smaller is reduced), and the convenience as a similar information search process is lowered. In this case, the value of the natural number k increases, the value of the degree (k-1) of the polynomial f (x) increases, and the total number of polynomials that can be selected as the polynomial f (x) also increases. Since the possibility that the secret registration information is decrypted is reduced, the safety is increased.
That is, as the maximum value d _max increases, the convenience as the similar information search process decreases, but the security of the secret registration information (polynomial point set R) increases.
As a result, in the similar information search system S of the first embodiment, the convenience and the safety can be adjusted based on the maximum value _dmax .

（総当り攻撃に対する秘匿検索情報の安全性について）
また、前記構成を備えた実施例１の前記類似情報検索システムＳでは、例えば、前記第三者や前記管理人が、Ｌ種類の検索数値ｂ_{ｍ−Ｌ＋１}〜ｂ_ｍを総当りで推測して、前記秘匿検索情報（検索部分集合Ｂ^＊）から前記検索集合Ｂを復元して、前記検索情報ｔの解読を試みる可能性がある。 (About the security of confidential search information against brute force attacks)
In the similar information search system S according to the first embodiment having the above-described configuration, for example, the third party or the administrator estimates L types of search numerical values b _{m−L + 1 to} b _m by brute force. There is a possibility that the search set B is restored from the secret search information (search subset B ^* ) and the search information t is deciphered.

ここで、前記自然数Ｌの値が大きくなるに連れて、Ｌ種類の検索数値ｂ_{ｍ−Ｌ＋１}〜ｂ_ｍを特定できる可能性が低くなる。よって、前記検索集合Ｂを復元して前記検索情報ｔの解読される可能性が低くなるため、安全性が高くなる。しかしながら、この場合、前記データベースサーバＤＳに記憶された前記秘匿登録情報Ｒ_１〜Ｒ_Ｎが前記候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭであるか否かを判別するための前記要素数‖Ｑ_ｊ ^＊‖（ｊ＝１，２，…，Ｎ）の判別値（ｃ−Ｌ）の値が小さくなる。このため、前記類似情報ｔ′が復元できる可能性が低いにも関わらず、前記候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭとして検出される個数が多くなる可能性が高くなる（自然数Ｍの値が大きくなる可能性が高くなる）。例えば、Ｌ＝ｃとした場合、（ｃ−Ｌ）＝０が成立し、前記データベースサーバＤＳに記憶された全ての前記秘匿登録情報Ｒ_１〜Ｒ_Ｎが前記候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＮとして検出されてしまう。
すなわち、前記自然数Ｌの値が大きくなるに連れて、前記秘匿検索情報（検索部分集合Ｂ^＊の安全性は高くなるが、誤検出が多くなり類似情報検索処理としての処理効率が低下する。 Here, as the value of the natural number L increases, the possibility that the L types of search numerical values b _{m−L + 1 to} b _m can be specified decreases. Therefore, since the possibility that the search set B is restored and the search information t is decoded is reduced, the safety is increased. However, in this case, the number of elements ‖Q _j ^* for determining whether or not the secret registration information R _{1 to} R _N stored in the database server DS is the candidate polynomial point set R _{A1 to} R _AM ^. The discriminant value (c−L) of ‖ (j = 1, 2,..., N) becomes smaller. For this reason, although the possibility that the similar information t ′ can be restored is low, there is a high possibility that the number detected as the candidate polynomial point sets R _{A1 to} R _AM is large (the value of the natural number M is large). Is likely to be). For example, when L = c, (c−L) = 0 is established, and all the secret registration information R _{1 to} R _N stored in the database server DS are the candidate polynomial point sets R _{A1 to} R _AN. Will be detected.
That is, as the value of the natural number L increases, the security of the secret search information (search subset B ^* increases), but false detection increases and the processing efficiency as the similar information search process decreases.

逆に、前記自然数Ｌの値が小さくなるに連れて、Ｌ種類の検索数値ｂ_{ｍ−Ｌ＋１}〜ｂ_ｍを特定できる可能性が高くなり、前記検索情報ｔの解読される可能性が高くなるため、安全性が低くなる。しかしながら、この場合、前記判別値（ｃ−Ｌ）の値が小さくなる。このため、前記類似情報ｔ′が復元できる可能性が低い前記候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭが検出される可能性が低くなる。例えば、Ｌ＝０とした場合、（ｃ−Ｌ）＝ｃが成立し、Ｑ_ｊ ^＊＝Ｑ_ｊ（ｊ＝１，２，…，Ｎ）が常に成立する前記候補多項式点集合Ｒ_Ａ１〜Ｒ_ＡＭが検出される。
すなわち、前記自然数Ｌの値が小さくなるに連れて、前記秘匿検索情報（検索部分集合Ｂ^＊の安全性は低くなるが、誤検出が低減され類似情報検索処理としての処理効率が向上する。
この結果、実施例１の前記類似情報検索システムＳでは、前記自然数Ｌの値に基づいて、前記安全性と前記処理効率とを調節することができる。 Conversely, as the value of the natural number L decreases, the possibility that the L types of search numerical values b _{m−L + 1 to} b _m can be specified increases, and the possibility that the search information t is decoded increases. , Safety is reduced. However, in this case, the discriminant value (c−L) is small. For this reason, the possibility that the candidate polynomial point sets R _{A1 to} R _AM that are unlikely to restore the similarity information t ′ is low is reduced. For example, when L = 0, (c−L) = c holds, and the candidate polynomial point set R _{A1 to} R A where Q _j ^* = Q _j (j = 1, 2,..., N) always holds. _AM is detected.
That is, as the value of the natural number L becomes smaller, the security of the secret search information (search subset B ^* becomes lower, but false detection is reduced and the processing efficiency as the similar information search process is improved.
As a result, in the similar information search system S of the first embodiment, the safety and the processing efficiency can be adjusted based on the value of the natural number L.

（登録情報の偏りに基づく攻撃に対する類似情報検索システムＳの安全性について）
また、前記構成を備えた実施例１の前記類似情報検索システムＳでは、例えば、前記登録情報ｓに文字列としての偏り、すなわち、文字列中の文字やｑ−ｇｒａｍの出現頻度に偏りが存在する場合がある。この場合、前記第三者や前記管理人が、前記偏りに基づいて、前記係数ｓ_０〜ｓ_ｋ−１を推測したり、前記登録数値ａ_１〜ａ_ｎや擬似数値ａ_ｎ＋１〜ａ_ｒを推測したり、これらの推測を組み合わせたりすることにより、総当りで前記多項式ｆ（ｘ）を復元する場合に比べ、前記多項式ｆ（ｘ）となる候補の多項式を絞り込むことができる可能性がある。 (Safety of similar information retrieval system S against attacks based on bias of registered information)
In the similar information search system S according to the first embodiment having the above-described configuration, for example, the registration information s is biased as a character string, that is, there is a bias in the appearance frequency of characters or q-grams in the character string. There is a case. In this case, the third party or the caretaker, on the basis of the deviation, or infer the coefficient _s 0 _{~s k-1,} the registration numerical _a 1 ~a _n and pseudo numerical _{a n} + 1 ~a _r There is a possibility that candidate polynomials that become the polynomial f (x) can be narrowed down by making a guess or by combining these guesses, compared to the case of restoring the polynomial f (x) in a brute force manner. .

ここで、例えば、前記部分文字列ｓ_ｋ（１）〜ｓ_{ｋ（ｋ−１）}に偏りが存在しても、前記係数ｓ_０〜ｓ_ｋ−１（ｓ_０，ｓ_１，…，ｓ_ｋ−１∈Ｆ^ｋ）の分布が前記ｋ次拡大体Ｆ^ｋ内で一様となるように前記可逆関数Ｔを設定することにより、前記秘匿登録情報（多項式点集合Ｒ）が解読される可能性が低くなる。すなわち、前記部分文字列ｓ_ｋ（１）〜ｓ_{ｋ（ｋ−１）}と、前記係数ｓ_０〜ｓ_ｋ−１との相関性が低減されることにより、前記係数ｓ_０〜ｓ_ｋ−１が推測され難くなり、前記秘匿登録情報（多項式点集合Ｒ）の安全性が高くなる。また、例えば、前記登録部分情報ｓ_ｑ１〜ｓ_ｑｎに偏りが存在しても、前記登録数値ａ_１〜ａ_ｎ（ａ_１，ａ_２，…，ａ_ｎ∈Ｆ^ｎ）の分布が前記ｎ次拡大体Ｆ^ｎ内で一様となるように前記ハッシュ関数Ｈを設定することにより、前記秘匿登録情報（多項式点集合Ｒ）が解読される可能性が低くなる。すなわち、前記登録部分情報ｓ_ｑ１〜ｓ_ｑｎと、前記登録数値ａ_１〜ａ_ｎとの相関性が低減されることにより、前記登録数値ａ_１〜ａ_ｎが推測され難くなり、前記秘匿登録情報（多項式点集合Ｒ）の安全性が高くなる。
この結果、実施例１の前記類似情報検索システムＳでは、前記各関数Ｔ，Ｈが各出力値ｓ_０〜ｓ_ｋ−１，ａ_１〜ａ_ｎを一様に分布させる能力、いわゆる、前記各関数Ｔ，Ｈの一様分布性に基づいて、前記秘匿登録情報（多項式点集合Ｒ）の安全性を高くすることができる。 Here, for example, the substring _{_{s k (1) ~s k (}} k-1) be present is a bias in the coefficient _{_{_{s 0 ~s k-1 (s}}} 0, s 1, ..., s k _The secret registration information (polynomial point set R) may be deciphered by setting the reversible function T so that the distribution of ₋₁ ∈ F ^k ) is uniform in the k-th order extension field F ^k . Becomes lower. That is, by reducing the correlation between the partial character strings s _{k (1) to} s _{k (} _k−1) and the coefficients s _{0 to} s _k−1 , the coefficients s _{0 to} s _k−1. Is difficult to guess, and the security of the secret registration information (polynomial point set R) is increased. Further, for example, even if there is a bias to the registration component information _s q1 _{~s qn,} the registration numerical _{_{_{_{a 1 ~a n (a 1,}}}} a 2, ..., a n ∈F n) distribution wherein n next by in extension field F ⁿ sets the hash function H so as to be uniform, possibly the secret registration information (polynomial point set R) are decrypted is lowered. That is, the registration partial information _s q1 _{~s qn,} by correlation with the registration numerical value _a 1 ~a _n is reduced, the registration numerical _a 1 ~a _n becomes is hardly guess the secret registration information The safety of (polynomial point set R) is increased.
As a result, the in similar information retrieval system S of Example 1, the respective function T, H is the output value _{_{_{s 0 ~s k-1, a}}} 1 ~a n ability to uniformly distribute the so-called, each Based on the uniform distribution of the functions T and H, the security of the secret registration information (polynomial point set R) can be increased.

また、例えば、前記第三者や前記管理人が、前記偏りに基づいて、前記登録情報ｓの一部のｑ−ｇｒａｍ等の情報を解読して、前記情報に前記検索情報ｔと同程度の類似度を有する情報、すなわち、前記編集距離ｄが最大値ｄ_ｍａｘ以下となる情報を作成して、前記登録情報ｓを解読しようとする可能性がある。 In addition, for example, the third party or the administrator decodes a part of the information such as q-gram of the registration information s based on the bias, and the information is similar to the search information t. There is a possibility that information having a similarity, that is, information in which the edit distance d is equal to or less than the maximum value d _max is created and the registered information s is to be decoded.

ここで、例えば、前記データベースサーバＤＳを、糖転移酵素や糖鎖修飾酵素等の糖鎖合成関連遺伝子、いわゆる、糖鎖遺伝子が前記登録情報ｓとして格納される糖鎖遺伝子データベース（ＧＧＤＢ：GlycoGene DataBase）とした場合、前記糖鎖遺伝子の種類が約３００種類となり、前記糖鎖遺伝子の平均配列長が既知のものに限れば約１２００塩基対（bp：base pair）となることが知られている。すなわち、前記データベースサーバＤＳを前記糖鎖遺伝子データベースとした場合には、Ｎ＝３００，ｎ＝１２００となることが知られている。なお、前記糖鎖遺伝子データベースについては、例えば、非特許文献９等に記載されており、公知である。
よって、前記偏りが存在しないものとした場合に、一遺伝子あたりの平均文字列空間の大きさ、すなわち、前記一遺伝子の文字列表現の数が、以下の数１の式（１０）により示された値以上となる。 Here, for example, the database server DS is stored in a glycan gene database (GGDB: GlycoGene DataBase) in which glycan synthesis-related genes such as glycosyltransferases and glycan modification enzymes, so-called glycan genes are stored as the registration information s. ), The number of types of sugar chain genes is about 300, and if the average sequence length of the sugar chain genes is limited to about 1,200 base pairs (bp: base pair), it is known. . That is, when the database server DS is the sugar chain gene database, it is known that N = 300 and n = 1200. The sugar chain gene database is known, for example, as described in Non-Patent Document 9 and the like.
Therefore, when it is assumed that there is no bias, the size of the average character string space per gene, that is, the number of character string representations of the one gene is expressed by the following equation (10). It becomes more than the value.

…（１０） (10)

なお、前記式（１０）における分子の底の値（４）は、塩基の種類数（「Ａ」、「Ｔ」、「Ｇ」、「Ｃ」）であり、分母の第２項（Σ演算子付きの項）は、前記編集距離ｄが最大値ｄ_ｍａｘ以下になる文字列の数（類似文字列の上限値）である。
よって、例えば、ｄ_ｍａｘ＝１００とした場合に、前記式（１０）により示された値は、約２^１７００（２^１７００≒２^２４００／２^７００）となり、前記秘匿登録情報（多項式点集合Ｒ）の安全性は、約１７００ビット以上の鍵の安全性に相当することがわかる。この結果、前記偏りが存在しても、最大で約１７００ビット以上の鍵の安全性を確保することができ、前記秘匿登録情報（多項式点集合Ｒ）の安全性を十分に確保できる。
したがって、実施例１の前記類似情報検索システムＳでは、前記自然数ｎ，Ｎの値および前記最大値ｄ_ｍａｘ，に基づいて、前記秘匿登録情報（多項式点集合Ｒ）の安全性を調節することができる。 The value (4) at the bottom of the numerator in the formula (10) is the number of base types (“A”, “T”, “G”, “C”), and the second term (Σ operation) of the denominator. The term with a child) is the number of character strings (upper limit value of similar character strings) at which the edit distance d is less than or equal to the maximum value _dmax .
Therefore, for example, when d _max = 100, the value represented by the equation (10) is about 2 ¹⁷⁰⁰ (2 ¹⁷⁰⁰ ≈2 ^2400/2 ⁷⁰⁰ ), and the secret registration information (polynomial point set R) It can be seen that the security of the key corresponds to the security of a key of about 1700 bits or more. As a result, even if the bias exists, it is possible to ensure the security of a key having a maximum of about 1700 bits or more and sufficiently secure the security registration information (polynomial point set R).
Therefore, in the similar information search system S of the first embodiment, the security of the secret registration information (polynomial point set R) can be adjusted based on the values of the natural numbers n and N and the maximum value d _max . it can.

（変更例）
以上、本発明の実施例を詳述したが、本発明は、前記実施例に限定されるものではなく、特許請求の範囲に記載された本発明の要旨の範囲内で、種々の変更を行うことが可能である。本発明の変更例（Ｈ01）〜（Ｈ011）を下記に例示する。
（Ｈ01）前記実施例の前記類似情報検索システムＳでは、前記各クライアントパソコンＰＣａ，ＰＣｂと、前記データベースサーバＤＳとを別体で構成したが、これに限定されず、例えば、前記各クライアントパソコンＰＣａ，ＰＣｂを一体的に構成したり、前記各クライアントパソコンＰＣａ，ＰＣｂおよび前記データベースサーバＤＳを一体的に構成することも可能である。 (Example of change)
As mentioned above, although the Example of this invention was explained in full detail, this invention is not limited to the said Example, A various change is performed within the range of the summary of this invention described in the claim. It is possible. Modification examples (H01) to (H011) of the present invention are exemplified below.
(H01) In the similar information search system S of the embodiment, the client personal computers PCa and PCb and the database server DS are separately configured. However, the present invention is not limited to this. , PCb can be integrated, or the client personal computers PCa, PCb and the database server DS can be integrated.

（Ｈ02）前記実施例では、前記インターネット１を介して、前記各クライアントパソコンＰＣａ，ＰＣｂと、前記データベースサーバＤＳとが情報の送受信が可能に接続されているが、これに限定されず、例えば、専用線や、有線・無線のＬＡＮ（Local Area Network、構内通信網）環境や、暗号通信網等のその他の通信回線を介して接続されることも可能である。なお、前記通信回線として、前記専用線や、前記暗号通信網等を使用した場合には、前記第三者や前記管理者が前記各情報ｓ，ｔ，ｔ′を解読し難くなる可能性がある。すなわち、内部解析（リバース・エンジニアリング）や改変に対する防護力、いわゆる、耐タンパ性が高くなる可能性がある。
（Ｈ03）前記実施例では、前記秘匿類似情報（解候補集合Ｒ_Ａ）の復元する前記秘匿類似情報復元処理（図５のＳＴ２１１、図８のＳＴ４０１〜ＳＴ４１１参照）を、前記検索用クライアントパソコンＰＣｂで実行して、前記類似情報ｔ′を前記第三者や前記管理者に取得されないようにしているが、これに限定されず、例えば、通信回線として前記専用線や前記暗号通信網等を使用して、前記耐タンパ性が高くなった場合には、前記秘匿類似情報復元処理を、前記データベースサーバＤＳで実行して、復元された前記類似情報ｔ′を前記検索用クライアントパソコンＰＣｂに送信することも可能である。 (H02) In the embodiment, the client personal computers PCa and PCb and the database server DS are connected via the Internet 1 so as to be able to transmit and receive information. However, the present invention is not limited to this. For example, It is also possible to connect via a dedicated line, a wired / wireless LAN (Local Area Network) environment, and other communication lines such as an encryption communication network. When the dedicated line, the encryption communication network, or the like is used as the communication line, there is a possibility that the third party or the administrator has difficulty in deciphering the information s, t, t ′. is there. That is, there is a possibility that the protection against internal analysis (reverse engineering) and modification, so-called tamper resistance, is increased.
(H03) In the above embodiment, the confidential similarity information (solution candidate set _{R A)} the confidential similarity information restoration process of restoring the (see ST401～ST411 ST 211, in Figure 8 of FIG. 5), the search for a client PC PCb The similar information t ′ is not obtained by the third party or the administrator, but is not limited to this. For example, the dedicated line or the encryption communication network is used as a communication line. When the tamper resistance becomes high, the secret similar information restoration process is executed by the database server DS, and the restored similar information t ′ is transmitted to the search client personal computer PCb. It is also possible.

（Ｈ04）前記実施例において、設定された各数値については、前記安全性（厳格性、機密性、秘匿性）、前記処理効率、前記利便性等の条件・仕様・設計等に応じて、任意の数値に変更可能である。例えば、実施例１のように、ｒ＞ｎとして、前記チャフ（擬似多項式点（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ）））を含む前記秘匿登録情報（多項式点集合Ｒ）を演算して、前記秘匿登録情報（多項式点集合Ｒ）の安全性を確保することが好ましいが、ｒ＝ｎとして、前記多項式点集合Ｒを、前記チャフを含まない前記（ｎ，ｋ）符号の符号語（登録多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ））のみの集合）とすることも可能である。また、Ｌ＞０として、Ｌ種類の検索数値ｂ_{ｍ−Ｌ＋１}〜ｂ_ｍが除かれた前記秘匿検索情報（検索部分集合Ｂ^＊）を演算して、前記秘匿検索情報（検索部分集合Ｂ^＊）の安全性を確保することが好ましいが、Ｌ＝０として、前記検索部分集合Ｂ^＊を、前記検索集合Ｂそのものとすることも可能である（Ｂ^＊＝Ｂ）。なお、この場合、通信回線として前記専用線や前記暗号通信網等を使用し、前記耐タンパ性を高くしておくことが好ましい。
（Ｈ05）本発明において、一方向関数の一例としての前記ハッシュ関数Ｈにより、前記数値ａ_１〜ａ_ｎ，ｂ_１〜ｂ_ｍから前記各部分情報ｓ_ｑ１〜ｓ_ｑｎ，ｔ_ｑ１〜ｔ_ｑｍを復元して、前記各情報ｓ，ｔをできないようにすることが好ましいが、これに限定されず、例えば、前記秘匿登録情報（多項式点集合Ｒ）の安全性等に応じて、前記ハッシュ関数Ｈを双方向関数の一例としての可逆関数等に変更することも可能である。 (H04) In the embodiment, each numerical value set is arbitrary according to conditions (specifications, designs, etc.) such as safety (strictness, confidentiality, confidentiality), processing efficiency, convenience, etc. Can be changed to For example, as in the _first embodiment, the secret registration information including the chaff (pseudo polynomial points (a _{n + 1} , f ′ (a _{n + 1} )) to (a _r , f ′ (a _r ))) where r> n. (Polynomial point set R) is preferably calculated to ensure the security of the secret registration information (polynomial point set R). However, when r = n, the polynomial point set R does not include the chaff. It is also possible to use a code word of (n, k) code (a set of registered polynomial points (a ₁ , f (a ₁ )) to (a _n , f (a _n )) only). Further, L> 0, and calculating the secure search information L types of search numerical _{b _m-L} + 1 ~b _m is removed (Search subset ^{B *),} the confidential search information (search subset ^B *) However, it is also possible to set the search subset B ^* to be the search set B itself (B ^* = B) with L = 0. In this case, it is preferable to use the dedicated line, the encryption communication network, or the like as a communication line to increase the tamper resistance.
(H05) In the present invention, by the hash function H as an example of a one-way function, the numerical _a 1 _~a _n, b 1 ~b from said _m each partial information _s q1 _~s _qn, a t q1 _{~t qm} It is preferable to restore the information s and t so that the information s and t cannot be used. However, the present invention is not limited to this. For example, the hash function H is determined according to the security of the secret registration information (polynomial point set R). Can be changed to a reversible function as an example of a bidirectional function.

（Ｈ06）前記実施例では、前記各情報ｓ，ｔ，ｔ′が、文字列情報により構成されているが、これに限定されず、例えば、順序付き木や、順序無し木、グラフ等の情報により構成することも可能である。ここで、順序付き木の情報とは、前記各情報ｓ，ｔ，ｔ′を、分岐の始点となる節点である根節点と、前記分岐の終点となる節点である葉節点と、前記根節点と前記葉節点との間に設けられた節点である内部節点と、前記根節点と前記内部節点と前記葉節点とを連結する枝となる前記分岐の情報により構成され、且つ、前記各分岐にはそれぞれ順序が付けられた情報のことである。また、順序無し木の情報とは、前記順序付き木の情報に比べ、前記各分岐には順序が付けられていない情報のことである。
これらの情報については、前記情報の構造に基づくヒストグラム（histogram、度数分布図、柱状グラフ）を演算した場合には、前記ヒストグラム間の距離（L₁ distance、Ｌ_１距離）に基づいて、解候補のフィルタリング処理を行うことができる。 (H06) In the above embodiment, each of the information s, t, t ′ is composed of character string information, but is not limited to this. For example, information such as an ordered tree, an unordered tree, a graph, etc. It is also possible to configure by. Here, the information of the ordered tree means that the information s, t, and t ′ includes the root node that is the node that is the start point of the branch, the leaf node that is the node that is the end point of the branch, and the root node. And the information on the branch that is a branch connecting the root node, the internal node, and the leaf node, and an internal node that is a node provided between the leaf node and the root node. Are each ordered information. Further, unordered tree information is information in which each branch is not ordered compared to the ordered tree information.
For these pieces of information, when a histogram (histogram, frequency distribution chart, columnar graph) based on the structure of the information is calculated, a solution candidate is calculated based on the distance between the histograms (L ₁ distance, L ₁ distance). The filtering process can be performed.

例えば、順序付き木の情報は、前記根節点および前記内部節点から分岐する枝が２本に限定され、且つ、前記根節点から前記葉節点まで到達するまでの前記分岐の回数である高さが全て同一となる順序付きの完全二分木の構造の情報である順序付完全二分木構造情報に変換することにより、前記順序付完全二分木構造情報を、ｑ−ｌｅｖｅｌ二分岐の情報（操作単位情報）のヒストグラムのベクトル表現により示すことができる。なお、前記ｑ−ｌｅｖｅｌ二分岐とは、前記完全二分木の一部の前記節点と前記節点どうしを連結する枝となる前記分岐の情報とにより構成された部分木の構造の情報である部分木構造情報であって、前記部分木における前記高さが（ｑ−１）となる前記部分木構造情報をいう。また、前記ｑ−ｌｅｖｅｌ二分岐の情報のヒストグラムとは、前記完全二分木における同じ前記ｑ−ｌｅｖｅｌ二分岐の情報の出現頻度のことをいう。 For example, the ordered tree information is limited to two branches that branch from the root node and the internal node, and the height that is the number of branches until the leaf node is reached from the root node. The ordered complete binary tree structure information is converted into q-level bifurcated information (operation unit information) by converting the ordered complete binary tree structure information, which is the information of the ordered complete binary tree that is all the same. ) Of the histogram. The q-level bifurcation is a subtree that is information on the structure of a subtree composed of the nodes of a part of the complete binary tree and the information of the branch that is a branch connecting the nodes. Structural information, which is the partial tree structure information in which the height of the partial tree is (q-1). In addition, the q-level bifurcation information histogram refers to the frequency of appearance of the same q-level bifurcation information in the complete binary tree.

ここで、２つの前記順序付完全二分木構造情報をＵ，Ｖとし、前記ｑ−ｌｅｖｅｌ二分岐の情報のヒストグラムの種類数（ベクトル空間の大きさ）を｜ΓＵ｜，｜ΓＶ｜とし、前記ｑ−ｌｅｖｅｌ二分岐の情報のヒストグラムをｕ_１〜ｕ_｜ΓＵ｜，ｖ_１〜ｖ_｜ΓＶ｜として、Ｕ＝（ｕ_１，ｕ_２，…，ｕ_｜ΓＵ｜），Ｖ＝（ｖ_１，ｖ_２，…，ｖ_｜ΓＶ｜）が成立するものとした場合に、前記Ｌ_１距離は、前記順序付完全二分木構造情報Ｕ，Ｖ間のベクトル距離になる。よって、前記Ｌ_１距離に基づいて、順序付完全二分木構造情報Ｕ，Ｖどうしが類似であるか否かを判別することができる。すなわち、解候補のフィルタリング処理を行うことができる。 Here, the two ordered complete binary tree structure information are U and V, the number of types of histograms (the size of the vector space) of the q-level bifurcation information is | ΓU |, | ΓV | A histogram of q-level bifurcation information is represented by u _{1 to} u _{| ΓU |} and v _{1 to} v _{| ΓV |} , and U = (u ₁ , u ₂ ,..., u _{| ΓU |} ), V = (v ₁ , v ₂ ,..., v _{| ΓV |} ), the L ₁ distance is the vector distance between the ordered complete binary tree structure information U and V. Therefore, on the basis of the L ₁ distance, it is possible to complete binary tree structure information U with order, what V it is determined whether or not similar. That is, solution candidate filtering processing can be performed.

なお、これらの情報のヒストグラム間のＬ_１距離に基づいて、解候補のフィルタリング処理を行う秘匿化なしの類似情報検索処理については、例えば、非特許文献１０〜１２に記載されており、公知である。また、前記順序付き木の情報を変換した前記順序付完全二分木構造情報Ｕ，Ｖについて、前記順序付完全二分木構造情報Ｕ，Ｖ間の編集距離をｄとした場合に、前記順序付完全二分木構造情報Ｕ，Ｖ間のＬ_１距離が｛４×（ｑ−１）＋１｝×ｄとなることが知られている（例えば、非特許文献１０等参照）。ここで、前記編集距離ｄとは、前記順序付完全二分木構造情報Ｖが、前記順序付完全二分木構造情報Ｕになるまでの前記ｑ−ｌｅｖｅｌ二分岐の情報の挿入・削除・置換の各編集操作の回数のことである。 Incidentally, based on the L ₁ distance between histograms of these information, the similarity information retrieval process without concealing the filtering process of the candidate solutions are, for example, are described in Non-Patent Document 10 to 12, in a known is there. For the ordered complete binary tree structure information U and V obtained by converting the ordered tree information, when the editing distance between the ordered complete binary tree structure information U and V is d, the ordered complete binary tree structure information U and V is obtained. binary tree structure information U, is _{L 1} distance between V is known to be a {4 × (q-1) +1} × d ( e.g., see non-Patent Document 10 or the like). Here, the edit distance d means each of insertion / deletion / replacement information of the q-level bifurcation information until the ordered complete binary tree structure information V becomes the ordered complete binary tree structure information U. This is the number of editing operations.

よって、前記フィルタリング処理の条件となる前記Ｌ_１距離を、前記ファジーボールトの開錠処理のフィルタリング条件とすることにより、秘匿化された類似情報検索処理を実行することが可能である。例えば、順序付き木の情報は、前記順序付完全二分木構造情報Ｕ，Ｖに基づいて、前記各集合Ａ，Ｂを演算し、前記秘匿登録情報（多項式点集合Ｒ）や前記秘匿検索情報（検索部分集合Ｂ^＊）を演算することができる。なお、前記完全二分木構造情報Ｕ，Ｖが有する前記節点の個数、例えば、｜Ｕ｜，｜Ｖ｜とし、｜ΓＵ｜＝ｎ，｜ΓＶ｜＝ｍ，ｎ≧ｍが成立するものとし、前記自然数ｃ，ｋについて以下の式（１１），（１２）が成立する場合には、以下の式（１３）が成立する。 Therefore, the L ₁ distance as the condition of the filtering process, by a filtering condition of unlocking processing of the fuzzy vault, it is possible to perform similar information retrieval processing which has been concealed. For example, the ordered tree information is obtained by calculating the sets A and B based on the ordered complete binary tree structure information U and V to obtain the secret registration information (polynomial point set R) or the secret search information ( The search subset B ^* ) can be computed. It is assumed that the number of nodes included in the complete binary tree structure information U and V, for example, | U |, | V |, and | ΓU | = n, | ΓV | = m, n ≧ m holds. When the following equations (11) and (12) are established for the natural numbers c and k, the following equation (13) is established.

ｃ＝ｎ−｛４×（ｑ−１）＋１｝×ｄ …（１１）
（ｎ＋ｋ）／２＝ｎ−｛４×（ｑ−１）＋１｝×ｄ_ｍａｘ …（１２）
‖Ｑ_ｊ‖≧ｃ
＝ｎ−｛４×（ｑ−１）＋１｝×ｄ
≧ｎ−｛４×（ｑ−１）＋１｝×ｄ_ｍａｘ
＝（ｎ＋ｋ）／２ …（１３） c = n− {4 × (q−1) +1} × d (11)
(N + k) / 2 = n− {4 × (q−1) +1} × d _max (12)
‖Q _j ‖ ≧ c
= N- {4 * (q-1) +1} * d
≧ n− {4 × (q−1) +1} × d _max
= (N + k) / 2 (13)

すなわち、ｃ≧（ｎ＋ｋ）／２が常に成立するため（式（７）参照）、前記実施例と同様に、前記次数（ｋ−１）を設定して前記多項式ｆ（ｘ）を演算できると共に、前記判別値ｃに基づいて、秘匿化されたフィルタリング処理を実行できる（図７のＳＴ３０８、図８のＳＴ４０６参照）。この結果、実施例１の前記類似情報検索システムＳと同様に、前記自然数ｃ，ｄ，ｋ，Ｌ，ｎ，ｍ，ｑ，ｒの各値に基づいて、前記秘匿類似情報検索処理（図７のＳＴ３０３〜ＳＴ３１２参照）を実行することができる。 That is, since c ≧ (n + k) / 2 is always established (see Expression (7)), the polynomial f (x) can be calculated by setting the degree (k−1) as in the above embodiment. Based on the discriminant value c, a concealed filtering process can be executed (see ST308 in FIG. 7 and ST406 in FIG. 8). As a result, similar to the similar information search system S of the first embodiment, the secret similar information search process (FIG. 7) is performed based on the values of the natural numbers c, d, k, L, n, m, q, and r. ST303 to ST312) can be executed.

（Ｈ07）本発明において、前記各情報ｓ，ｔを入力するための前記各画像２，３を表示することが好ましいが、これに限定されず、例えば、コマンド入力等により前記各情報ｓ，ｔを入力することにより、前記各画像２，３を省略することも可能である。
（Ｈ08）前記実施例では、前記誤り訂正符号の復号化アルゴリズムの一例としてのＢＭアルゴリズムを用いた前記開錠処理により、前記多項式ｆ（ｘ）の係数ｓ_０〜ｓ_ｋ−１を演算して前記類似情報ｔ′を復元するが（図８のＳＴ４１０参照）、これに限定されず、例えば、前記誤り訂正符号の復号化アルゴリズムの一例としてのピーターソン（Perterson）法やユークリッド法を用いた開錠処理により、前記類似情報ｔ′を復元することも可能である（例えば、特開２００３−１６８９８３号公報や非特許文献６等参照）。 (H07) In the present invention, it is preferable to display the images 2 and 3 for inputting the information s and t. However, the present invention is not limited to this. For example, the information s and t is input by command input or the like. It is also possible to omit the images 2 and 3 by inputting.
(H08) In the above embodiment, the coefficients s _{0 to} s _k-1 of the polynomial f (x) are calculated by the unlocking process using the BM algorithm as an example of the decoding algorithm of the error correction code. Although the similar information t ′ is restored (see ST410 in FIG. 8), the present invention is not limited to this. For example, the similarity information t ′ is not limited to this, and may be developed using the Peterson method or the Euclidean method as an example of the decoding algorithm of the error correction code. It is also possible to restore the similar information t ′ by a lock process (see, for example, Japanese Patent Laid-Open No. 2003-168983 and Non-Patent Document 6).

（Ｈ09）本発明において、前記自然数ｃ，ｄ，ｋ，Ｌ，ｎ，ｍ，ｑ，ｒの各値について、予め設定されているか、前記各情報ｓ，ｔ，ｔ′に基づいて自動的に設定されているが、これに限定されず、ユーザが各画像３，４等により設定できるようにすることも可能である。例えば、前記編集距離ｄの最大値ｄ_ｍａｘについて、前記検索画像４に最大値入力部を設けて、前記検索者が設定できるようにすることも可能である。
（Ｈ010）本発明において、前記登録情報ｓに基づいて、前記登録情報ｓを復元するために必要な点が最小で｛（ｎ＋ｋ）／２｝個となることが保証された（ｋ−１）次元で１変数の多項式ｆ（ｘ）を演算されることが好ましいが（図５のＳＴ１０７参照）、これに限定されず、例えば、｛（ｎ＋ｋ）／２｝個存在しても前記登録情報ｓを復元できない多項式を演算することも可能である。この場合、前記多項式ｆ（ｘ）として選択できる多項式の総数も大きくすることができ、前記利便性や前記安全性が高くなるが、前記類似情報ｔ′を復元可能か否かを判別するフィルタリング処理（図７のＳＴ３０８、図８のＳＴ４０６参照）の精度が低減されるため、誤検出が多くなり類似情報検索処理としての処理効率が低下する可能性がある。このため、前記多項式ｆ（ｘ）における復元可能な点の個数を、前記利便性と前記安全性と、前記フィルタリング処理の精度に基づく前記処理効率とを、前記類似情報検索システムＳの許容範囲に応じて調節することができる。 (H09) In the present invention, each value of the natural numbers c, d, k, L, n, m, q, r is preset or automatically based on the information s, t, t ′. Although it is set, the present invention is not limited to this, and it is also possible to allow the user to set the image using the images 3 and 4. For example, for the maximum value d _max of the edit distance d, a maximum value input unit may be provided in the search image 4 so that the searcher can set it.
(H010) In the present invention, based on the registration information s, it is guaranteed that the number of points necessary for restoring the registration information s is {(n + k) / 2} at a minimum (k−1). Although it is preferable to calculate a single variable polynomial f (x) in dimension (see ST107 in FIG. 5), the present invention is not limited to this. For example, even if {(n + k) / 2} exist, the registration information s It is also possible to calculate a polynomial that cannot be restored. In this case, the total number of polynomials that can be selected as the polynomial f (x) can be increased, and the convenience and the safety are enhanced, but the filtering process for determining whether or not the similar information t ′ can be restored. Since the accuracy of ST308 in FIG. 7 (see ST406 in FIG. 8) is reduced, there is a possibility that false detections increase and the processing efficiency as the similar information search processing is lowered. For this reason, the number of points that can be restored in the polynomial f (x) is set within the allowable range of the similarity information search system S, with the convenience, the safety, and the processing efficiency based on the accuracy of the filtering process. Can be adjusted accordingly.

（Ｈ011）本発明において、前記登録情報ｓに基づいて、前記登録情報ｓを復元するために必要な点が｛（ｎ＋ｋ）／２｝個以上となる（ｋ−１）次元で１変数の多項式ｆ（ｘ）を演算したが（図５のＳＴ１０７参照）、これに限定されず、例えば、｛（ｎ＋ｋ）／２｝個以下で前記登録情報ｓを復元可能であることが保証された多項式が存在すれば、そのような前記多項式を演算することも可能である。すなわち、前記誤り訂正符号として、｛（ｎ＋ｋ）／２｝個以上の点が判明すれば前記ＢＭアルゴリズム等により多項式の復元可能な前記（ｎ，ｋ）符号（（ｎ，ｋ）ＲＳ符号）を利用することに限定されず、｛（ｎ＋ｋ）／２｝個以下の点により多項式の復元可能なその他の前記誤り訂正符号を利用することも可能である。この場合、解読するために知得する必要がある前記登録多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ））の数が少なくなるため、前記多項式点集合Ｒの安全性が低下するが、前記多項式点集合Ｒに含まれる前記登録多項式点（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ））の総数ｒの値を大きくして、前記擬似多項式点（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））の総数を増加させることにより、前記安全性を確保することができる。 (H011) In the present invention, based on the registration information s, the number of points necessary for restoring the registration information s is {(n + k) / 2} or more (k−1) -dimensional polynomial of one variable Although f (x) has been calculated (see ST107 in FIG. 5), the present invention is not limited to this. For example, there is a polynomial that is guaranteed to be able to restore the registration information s with {(n + k) / 2} or less. If present, it is also possible to compute such a polynomial. That is, as the error correction code, the (n, k) code ((n, k) RS code) that can be restored to the polynomial by the BM algorithm or the like when the {(n + k) / 2} or more points are found. The present invention is not limited to use, and it is also possible to use other error correction codes that can restore a polynomial by {(n + k) / 2} or less points. In this case, since the number of the registered polynomial points (a ₁ , f (a ₁ )) to (a _n , f (a _n )) that need to be known for decoding is reduced, the polynomial point set R Although the safety is reduced, the value of the total number r of the registered polynomial points (a ₁ , f (a ₁ )) to (a _n , f (a _n )) included in the polynomial point set R is increased, The safety can be ensured by increasing the total number of the pseudo-polynomial points (a _{n + 1} , f ′ (a _{n + 1} )) to (a _r , f ′ (a _r )).

本発明は、例えば、登録する情報がＤＮＡの塩基配列情報（核酸配列情報）や、蛋白質のアミノ酸配列情報等の遺伝子情報が格納された遺伝子情報データベースの管理業務を、遺伝子解析の分野における研究機関等が外部委託業者に委託するＤＡＳモデルを構築する場合に有用である。 The present invention relates to, for example, management of a gene information database in which genetic information such as DNA base sequence information (nucleic acid sequence information) and protein amino acid sequence information is stored as information to be registered. This is useful when a DAS model is outsourced to an outside contractor.

図１は本発明の実施例１の類似情報検索システムの全体説明図である。FIG. 1 is an overall explanatory diagram of a similar information search system according to a first embodiment of the present invention. 図２は本発明の実施例１の類似情報検索システムを構成する各装置の機能をブロック図（機能ブロック図）で示した説明図である。FIG. 2 is an explanatory diagram showing the function of each device constituting the similar information retrieval system of the first embodiment of the present invention in a block diagram (functional block diagram). 図３は実施例１の登録画像の説明図である。FIG. 3 is an explanatory diagram of a registered image according to the first embodiment. 図４は実施例１の登録画像の説明図である。FIG. 4 is an explanatory diagram of a registered image according to the first embodiment. 図５は実施例１の秘匿登録情報送信プログラムの秘匿登録情報送信処理のフローチャートである。FIG. 5 is a flowchart of the secret registration information transmission process of the secret registration information transmission program according to the first embodiment. 図６は実施例１の秘匿検索情報送信プログラムの秘匿検索情報送信処理のフローチャートである。FIG. 6 is a flowchart of the confidential search information transmission process of the confidential search information transmission program according to the first embodiment. 図７は実施例１の秘匿類似情報検索プログラムの秘匿類似情報検索処理のフローチャートである。FIG. 7 is a flowchart of the secret similar information search process of the secret similar information search program according to the first embodiment. 図８は実施例１の秘匿類似情報復元プログラムの秘匿類似情報復元処理のフローチャートであり、図６のＳＴ２１１のサブルーチンの説明図である。FIG. 8 is a flowchart of the secret similar information restoration process of the secret similar information restoration program according to the first embodiment, and is an explanatory diagram of the subroutine of ST211 in FIG. 図９はＤＡＳモデルにおける遺伝子情報データベースに従来公知の暗号化データベースの技術を適用した場合の説明図である。FIG. 9 is an explanatory view when a conventionally known encryption database technique is applied to the gene information database in the DAS model.

Explanation of symbols

Ａ…登録集合、ＡＰ４…類似情報検索プログラム、ａ_１〜ａ_ｎ…登録数値、ａ_ｎ＋１〜ａ_ｒ…擬似数値、（ａ_１，ｆ（ａ_１））〜（ａ_ｎ，ｆ（ａ_ｎ））…多項式上の点、登録多項式点、（ａ_ｎ＋１，ｆ′（ａ_ｎ＋１））〜（ａ_ｒ，ｆ′（ａ_ｒ））…擬似多項式点、Ｂ…検索集合、Ｂ^＊…検索部分集合、秘匿検索情報、ｂ_１〜ｂ_ｍ…検索数値、ｃ，ｄ，ｋ，Ｌ，ｎ，ｍ，ｑ，ｒ…自然数、ＣＡ３…多項式演算手段、ＣＡ４…登録部分情報抽出手段、ＣＡ５…登録集合演算手段、ＣＡ６…登録代入値演算手段、ＣＡ７…擬似数値演算手段、ＣＡ８…擬似代入値演算手段、ＣＡ９…秘匿登録情報演算手段、ＣＡ10…秘匿登録情報送信手段、ＣＢ３…検索部分情報抽出手段、ＣＢ４…検索集合演算手段、ＣＢ５…検索集合記憶手段、ＣＢ６…秘匿検索情報演算手段、ＣＢ７…秘匿検索情報送信手段、ＣＢ９…秘匿類似情報受信手段、ＣＢ10A…検索数値判別手段、ＣＢ10B…数値抽出手段、ＣＢ11A…類似情報復元判別手段、ＣＢ12…類似情報復元手段、ＣＤ１…秘匿登録情報受信手段、ＣＤ２…秘匿登録情報記憶手段、ＣＤ３…秘匿検索情報受信手段、ＣＤ４…検索数値判別手段、ＣＤ５…多項式点部分集合演算手段、ＣＤ６…多項式点部分集合要素数判別手段、ＣＤ７…秘匿類似情報演算手段、ＣＤ８…秘匿類似情報送信手段、ＤＳ…記憶装置、類似情報検索装置、ｆ（ｘ）…多項式、ｆ（ａ_１），ｆ（ａ_２），…，ｆ（ａ_ｎ）…登録代入値、ｆ′（ａ_ｎ＋１）〜ｆ′（ａ_ｒ）…擬似代入値、ＰＣａ…登録装置、ＰＣｂ…検索装置、Ｑ_１ ^＊〜Ｑ_Ｎ ^＊…多項式点部分集合、Ｒ…多項式点集合、秘匿登録情報、Ｒ_Ａ…秘匿類似情報、Ｈ…一方向関数、Ｓ…類似情報検索システム、ｓ…登録情報、文字列情報、ｓ_ｑ１〜ｓ_ｑｎ…登録部分情報、部分情報、ｔ…検索情報、文字列情報、ｔ_ｑ１〜ｔ_ｑｍ…検索部分情報、部分情報、ｔ′…類似情報、文字列情報。 A ... registration set, AP4 ... similar information retrieval _{program, a} 1 ~a n _... registration numerical _{value, a n +} 1 ~a r _... pseudo numerical _{_{value, (a 1, f (a}} 1)) ~ (a n, f (a n) ) ... polynomial points, registered polynomial points, (a _{n + 1} , f ′ (a _{n + 1} )) to ( _ar , f ′ (a _r )) ... pseudo-polynomial points, B ... search set, B ^* ... search subset , Secret search information, b _{1 to} b _m ... Search numerical value, c, d, k, L, n, m, q, r ... natural number, CA3 ... polynomial operation means, CA4 ... registered partial information extraction means, CA5 ... registered set Calculation means, CA6 ... registered substitution value calculation means, CA7 ... pseudo numerical value calculation means, CA8 ... pseudo substitution value calculation means, CA9 ... secret registration information calculation means, CA10 ... secret registration information transmission means, CB3 ... search partial information extraction means, CB4: Search set calculation means, CB5: Search set storage CB6 ... Secret search information calculation means, CB7 ... Secret search information transmission means, CB9 ... Secret similarity information reception means, CB10A ... Search numerical value determination means, CB10B ... Numerical value extraction means, CB11A ... Similar information restoration determination means, CB12 ... Similar information Restoring means, CD1 ... Secret registration information receiving means, CD2 ... Secret registration information storage means, CD3 ... Secret search information receiving means, CD4 ... Search numerical value discrimination means, CD5 ... Polynomial point subset calculating means, CD6 ... Polynomial point subset element number determining means, CD7 ... confidential similarity information calculating means, CD8 ... confidential similarity information transmitting means, DS ... storage device, similar information retrieval system, f (x) ... _{polynomial, f (a 1), f} (a 2), ... , F (a _n ) ... registered substitution value, f ′ (a _{n + 1} ) to f ′ (a _r ) ... pseudo substitution value, PCa ... registration device, PCb ... search device, Q ₁ ^{* to} Q _N ^* ... polynomial point part Minute set, R ... polynomial point set, confidential registration information, R A _... confidential similar information, H ... one-way function, S ... similar information retrieval system, s ... registration information, character string _{information, s} q1 ~s _{qn ...} registration part information, partial information, t ... search information, the character string _{information, t} q1 ~t _{qm ...} search part information, part information, t '... similar information, character string information.

Claims

A storage device for storing registration information which is information to be registered;
A registration device that is connected to the storage device so as to be able to transmit and receive information, and that registers the registration information to the storage device;
Information similar to the registration information that is the same as or similar to the search information that is the search target information among the stored registration information that is connected to the storage device so as to be able to transmit and receive information. A search device for searching;
A similar information retrieval system having
The natural numbers are c, d, k, L, n, m, q, and r, respectively, and d operations of insertion / deletion / replacement are performed on the operation unit information that is the unit information to be operated in the search information. The search information is converted into the similar information, and n pieces of partial information having q pieces of the operation unit information can be calculated for the registration information. m or more pieces of partial information having q pieces of the operation unit information can be calculated, c is calculated based on d, q, n, m, and c ≧ L, m ≧ L, r ≧ n Are assumed to hold,
The registration device
Based on the registration information, a polynomial computing means for computing the polynomial that is a univariate (k-1) dimension and that can restore the registration information;
Registered partial information extracting means for extracting registered partial information, which is n pieces of partial information capable of restoring the registered information, based on the registered information;
A registered set calculation means for calculating a registered set, which is a set having n registered numeric values as elements, based on the extracted n pieces of registered partial information;
A registered substitution value calculating means for calculating a registered substitution value that is a numerical value of the polynomial into which the registered numeric value is substituted;
Pseudo numerical value calculating means for calculating (rn) types of pseudo numerical values which are numerical values other than the registered numerical values;
Pseudo-substitution value calculating means for calculating a pseudo-substitution value that is a numerical value other than the numerical value of the polynomial into which the pseudo-numeric value is substituted;
A point on the polynomial having a set of the registered numerical value and the registered substitution value corresponding to the registered numerical value is a registered polynomial point, and the pseudo numerical value and the pseudo substitution value corresponding to the pseudo numerical value are a set. When a point other than the polynomial is set as a pseudo-polynomial point, a polynomial point set that is a set of r points having n registered polynomial points and (r−n) pseudo-polynomial points is calculated. A secret registration information calculation means for calculating secret registration information in which the registration information is concealed;
Secret registration information transmitting means for transmitting the calculated secret registration information to the storage device;
Have
The storage device
Confidential registration information receiving means for receiving the confidential registration information transmitted by the confidential registration information transmitting means;
Secret registration information storage means for storing the received secret registration information;
Have
The search device includes:
Search partial information extraction means for extracting search partial information which is m pieces of partial information capable of restoring the search information based on the search information;
A search set calculation means for calculating a search set that is a set having m search numerical values as elements based on the extracted m pieces of search partial information;
Search set storage means for storing the calculated search set;
By calculating a search subset that is a subset of the search set having (m−L) types of the search numerical values excluding L types of the search numerical values among the m types of search numerical values, the search Secret search information calculation means for calculating secret search information in which information is concealed;
A secret search information transmitting means for transmitting the calculated secret search information to the storage device;
Have
The storage device
Secret search information receiving means for receiving the secret search information transmitted by the secret search information transmitting means;
Search numerical value determining means for determining whether the search numerical value included in the received confidential search information is the same as the registered numerical value or the pseudo numerical value included in each stored confidential registration information;
By extracting the registered polynomial point of the registered numerical value and the pseudo-polynomial point of the pseudo-numerical value that are the same as the search numerical value, a projection set of the search numerical value in the polynomial point set, A polynomial point subset computing means for computing a polynomial point subset which is a subset;
Polynomial point subset element number determining means for determining whether or not the calculated points of the polynomial point subset are (c−L) or more;
(CL) Concealed similarity information computing means for computing concealed similarity information in which the similar information is concealed by computing each concealment registration information corresponding to the polynomial point subset having at least (c−L) points ,
A secret similarity information transmitting means for transmitting the calculated secret similarity information to the search device;
Have
The search device includes:
A concealment similarity information receiving means for receiving the concealment similarity information transmitted by the concealment similarity information transmission means;
Search numerical value determining means for determining whether or not the search numerical value included in the stored search set is the same as the registered numerical value or the pseudo numerical value included in each received secret similar information;
Numerical value extraction means for extracting the registered numerical value and the pseudo numerical value that are the same as the search numerical value;
It is determined whether or not each of the secret similar information can be restored as the similar information with respect to the search information by determining whether or not the extracted registered numerical value and the pseudo numerical value are c or more. Similar information restoration discrimination means,
Similar information restoration means for computing the polynomial and restoring the similar information based on the registered numeric value and the pseudo numeric value when the extracted registered numeric value and the pseudo numeric value are c or more,
A similar information retrieval system characterized by comprising:

When c ≧ (n + k) / 2 holds for the natural numbers c, k, n,
The registration device
Based on the registration information, the polynomial is a one-variable polynomial in (k−1) dimensions, and the number of points on the polynomial necessary for restoring the registration information is {(n + k) / 2} or more. A polynomial calculation means for calculating a polynomial;
The similar information retrieval system according to claim 1, wherein:

Information to be registered is registered information,
Information that is the same as or similar to the registration information is similar information,
The natural numbers are d, k, n, q, r, respectively.
With respect to operation unit information that is information of a unit to be operated in the registration information, the registration information is converted into the similar information by performing d insertion / deletion / replacement operations, and the registration For information, it is assumed that n or more pieces of partial information having q pieces of operation unit information can be calculated, and r ≧ n holds.
The n pieces of partial information that can restore the registration information extracted based on the registration information are used as registration partial information.
A set having a registered numerical value, which is an n-type numerical value calculated based on the extracted n pieces of registered partial information, as a registered set,
A (k−1) -dimensional univariate polynomial calculated based on the registration information, and a numerical value calculated by substituting the registered numerical value for the polynomial that can restore the registration information is used as a registered substitution value. ,
A point on the polynomial having a set of the registered numerical value and the registered substitution value corresponding to the registered numerical value is a registered polynomial point,
A pseudo numerical value that is a numerical value other than the registered numerical value and a pseudo substituted value corresponding to the pseudo numerical value, and the pseudo substituted value that is a numerical value other than the numerical value of the polynomial to which the pseudo numerical value is substituted is a set. When a point other than a polynomial is a pseudo-polynomial point,
A polynomial point set, which is a set of r points each having n registration polynomial points and (r−n) pseudo-polynomial points, is stored as secret registration information in which the registration information is concealed. A secret registration information storage means;
The search target information is the search information,
Let natural numbers be c, L, m respectively.
For the search information, m or more pieces of partial information having q pieces of operation unit information can be calculated, c is calculated based on d, q, n, m, and c ≧ L, m ≧ L is established,
With respect to the operation unit information in the search information, the search information is converted into the similar information by performing d insertion / deletion / replacement operations, and the search information is extracted based on the search information. The m pieces of partial information capable of restoring the search information are set as search partial information,
A set having as elements search numerical values that are m types of numerical values calculated based on the extracted m pieces of search partial information is defined as a search set.
Of the m types of search numerical values, the search information conceals a search subset that is a subset of the search set whose elements are the (m−L) types of search numerical values excluding the L types of search numerical values. If the search information is hidden,
Search numerical value determination means for determining whether the search numerical value included in the confidential search information is the same as the registered numerical value or the pseudo numerical value included in each stored confidential registration information;
By extracting the registered polynomial point of the registered numerical value and the pseudo-polynomial point of the pseudo-numerical value that are the same as the search numerical value, a projection set of the search numerical value in the polynomial point set, A polynomial point subset computing means for computing a polynomial point subset which is a subset;
Polynomial point subset element number determining means for determining whether or not the calculated points of the polynomial point subset are (c−L) or more;
(CL) Concealed similarity information computing means for computing concealed similarity information in which the similar information is concealed by computing each concealment registration information corresponding to the polynomial point subset having at least (c−L) points ,
A similar information retrieval system characterized by comprising:

Computer
Information to be registered is registered information,
Information that is the same as or similar to the registration information is similar information,
The natural numbers are d, k, n, q, r, respectively.
With respect to operation unit information that is information of a unit to be operated in the registration information, the registration information is converted into the similar information by performing d insertion / deletion / replacement operations, and the registration For information, it is assumed that n or more pieces of partial information having q pieces of operation unit information can be calculated, and r ≧ n holds.
The n pieces of partial information that can restore the registration information extracted based on the registration information are used as registration partial information.
A set having a registered numerical value, which is an n-type numerical value calculated based on the extracted n pieces of registered partial information, as a registered set,
A (k−1) -dimensional univariate polynomial calculated based on the registration information, and a numerical value calculated by substituting the registered numerical value for the polynomial that can restore the registration information is used as a registered substitution value. ,
A point on the polynomial having a set of the registered numerical value and the registered substitution value corresponding to the registered numerical value is a registered polynomial point,
A pseudo numerical value that is a numerical value other than the registered numerical value and a pseudo substituted value corresponding to the pseudo numerical value, and the pseudo substituted value that is a numerical value other than the numerical value of the polynomial to which the pseudo numerical value is substituted is a set. When a point other than a polynomial is a pseudo-polynomial point,
A polynomial point set, which is a set of r points each having n registration polynomial points and (r−n) pseudo-polynomial points, is stored as secret registration information in which the registration information is concealed. Secret registration information storage means,
The search target information is the search information,
Let natural numbers be c, L, m respectively.
If m or more pieces of partial information having q pieces of operation unit information can be calculated for the search information, c is calculated based on d, q, n, m, and c ≧ L, m ≧ L Each holds,
With respect to the operation unit information in the search information, the search information is converted into the similar information by performing d insertion / deletion / replacement operations, and the search information is extracted based on the search information. The m pieces of partial information capable of restoring the search information are set as search partial information,
A set having as elements search numerical values that are m types of numerical values calculated based on the extracted m pieces of search partial information is defined as a search set.
Of the m types of search numerical values, the search information conceals a search subset that is a subset of the search set whose elements are the (m−L) types of search numerical values excluding the L types of search numerical values. If the search information is hidden,
Search numerical value determining means for determining whether the search numerical value included in the confidential search information is the same as the registered numerical value or the pseudo numerical value included in each stored confidential registration information;
By extracting the registered polynomial point of the registered numerical value and the pseudo-polynomial point of the pseudo-numerical value that are the same as the search numerical value, a projection set of the search numerical value in the polynomial point set, A polynomial point subset computing means for computing a polynomial point subset which is a subset;
A polynomial point subset element number discriminating means for discriminating whether or not there are (c−L) or more calculated points of the polynomial point subset;
(CL) Secret similar information calculation means for calculating secret similar information in which the similar information is concealed by calculating each secret registration information corresponding to the polynomial point subset having more than (c−L) points;
A similar information retrieval program characterized by functioning as