CN112732776B

CN112732776B - Secure approximate pattern matching method and system and electronic equipment

Info

Publication number: CN112732776B
Application number: CN202011561764.XA
Authority: CN
Inventors: 魏晓超; 徐琳; 王皓
Original assignee: Shandong Normal University
Current assignee: Shandong Normal University
Priority date: 2020-12-25
Filing date: 2020-12-25
Publication date: 2022-08-26
Anticipated expiration: 2040-12-25
Also published as: CN112732776A

Abstract

The present disclosure provides a secure approximate pattern matching method, system and electronic device, there are a first terminal holding a pattern character string, a length of the text character string and a threshold value and a second terminal holding the text character string, the length of the pattern character string and the threshold value, the first terminal and the second terminal execute a secure approximate pattern matching algorithm, if a hamming distance between a certain sub-character string of the text character string and the pattern character string is less than the threshold value, the first terminal outputs a position of the sub-character string in the text character string; according to the method and the device, the user with the mode information can obtain the position of the mode in the database, the database can not know the mode information of the user through the disappearing transmission algorithm and the Boolean type threshold privacy set intersection algorithm, the user can not know other data information in the database, and the mode matching is carried out while the safety of the data is guaranteed.

Description

A secure approximate pattern matching method, system and electronic device

技术领域technical field

本公开涉及模式匹配技术领域，特别涉及一种安全近似模式匹配方法、系统及电子设备。The present disclosure relates to the technical field of pattern matching, and in particular, to a method, system and electronic device for secure approximate pattern matching.

背景技术Background technique

本部分的陈述仅仅是提供了与本公开相关的背景技术，并不必然构成现有技术。The statements in this section merely provide background related to the present disclosure and do not necessarily constitute prior art.

近似模式匹配的应用非常广泛，如在人脸识别系统中，当光线、位置或表情不同时，系统所提取到的用户的面部图像的特征数据也不同，因此，当所提取的特征数据与存储在数据库中的特征模板匹配时，需要根据二者的相似度来判断面部图像对应的身份信息，而不是根据二者是否相同来判断。Approximate pattern matching is widely used. For example, in the face recognition system, when the light, position or expression is different, the feature data of the user's facial image extracted by the system is also different. When the feature templates in the database are matched, the identity information corresponding to the facial image needs to be judged according to the similarity between the two, rather than whether the two are the same.

然而，发明人发现，用户的面部数据以及数据库中的特征模板都属于私有数据，双方通常不希望公开自己所掌握的私有数据，以避免泄露自己的隐私信息。However, the inventor found that the user's facial data and the feature templates in the database belong to private data, and both parties usually do not want to disclose their private data to avoid revealing their private information.

发明内容SUMMARY OF THE INVENTION

为了解决现有技术的不足，本公开提供了一种安全近似模式匹配方法、系统及电子设备，持有模式信息的用户可以获得其模式在数据库中出现的位置，通过茫然传输算法和布尔类型阈值隐私集合求交算法，数据库方无法得知用户的模式信息，用户无法得知数据库中的其他数据信息，在进行模式匹配的同时保证了各自数据的安全性。In order to solve the deficiencies of the prior art, the present disclosure provides a method, system and electronic device for secure approximate pattern matching. Users who hold pattern information can obtain the location where their pattern appears in the database. In the privacy set intersection algorithm, the database side cannot know the user's pattern information, and the user cannot know other data information in the database, which ensures the security of their respective data while performing pattern matching.

为了实现上述目的，本公开采用如下技术方案：In order to achieve the above object, the present disclosure adopts the following technical solutions:

本公开第一方面提供了一种安全近似模式匹配方法。A first aspect of the present disclosure provides a secure approximate pattern matching method.

一种安全近似模式匹配方法，应用于持有模式字符串、文本字符串的长度以及阈值的第一终端，包括以下步骤：A secure approximate pattern matching method applied to a first terminal holding a pattern string, the length of a text string and a threshold, comprising the following steps:

第一终端与持有文本字符串、模式字符串长度及阈值的第二终端执行安全近似模式匹配算法，如果文本字符串的某个子字符串和模式字符串之间的汉明距离小于阈值，第一终端输出此子字符串的在文本字符串中的位置。The first terminal executes a secure approximate pattern matching algorithm with the second terminal that holds the text string, the length of the pattern string and the threshold. If the Hamming distance between a substring of the text string and the pattern string is less than the threshold, the first A terminal outputs the position of this substring within the text string.

本公开第二方面提供了一种电子设备。A second aspect of the present disclosure provides an electronic device.

一种电子设备，包括持有模式字符串、文本字符串的长度以及阈值的第一终端，第一终端与持有文本字符串、模式字符串长度及阈值的第二终端通信；An electronic device, comprising a first terminal holding a pattern string, the length of the text string and a threshold, the first terminal communicating with a second terminal holding the text string, the length of the pattern string and the threshold;

第一终端与第二终端执行安全近似模式匹配算法，如果文本字符串的某个子字符串和模式字符串之间的汉明距离小于阈值，第一终端输出此子字符串的在文本字符串中的位置。The first terminal and the second terminal execute a secure approximate pattern matching algorithm. If the Hamming distance between a certain substring of the text string and the pattern string is less than the threshold, the first terminal outputs the substring in the text string. s position.

本公开第三方面提供了一种安全近似模式匹配方法。A third aspect of the present disclosure provides a secure approximate pattern matching method.

一种安全近似模式匹配方法，应用于持有文本字符串、模式字符串长度及阈值的第二终端，包括以下步骤：A secure approximate pattern matching method, applied to a second terminal holding a text string, a pattern string length and a threshold, comprising the following steps:

第二终端与持有模式字符串、文本字符串的长度以及阈值的第一终端执行安全近似模式匹配算法，如果文本字符串的某个子字符串和模式字符串之间的汉明距离小于阈值，使得第一终端输出此子字符串的在文本字符串中的位置。The second terminal performs a secure approximate pattern matching algorithm with the first terminal holding the pattern string, the length of the text string and the threshold, if the Hamming distance between a certain substring of the text string and the pattern string is less than the threshold, Causes the first terminal to output the position of this substring within the text string.

本公开第四方面提供了一种电子设备。A fourth aspect of the present disclosure provides an electronic device.

一种电子设备，包括持有文本字符串、模式字符串长度及阈值的第二终端，第二终端与持有模式字符串、文本字符串的长度以及阈值的第一终端通信；An electronic device, comprising a second terminal holding a text string, a pattern string length and a threshold, the second terminal communicating with a first terminal holding the pattern string, the length of the text string and the threshold;

第一终端与第二终端执行安全近似模式匹配算法，如果文本字符串的某个子字符串和模式字符串之间的汉明距离小于阈值，使得第一终端输出此子字符串的在文本字符串中的位置。The first terminal and the second terminal execute a secure approximate pattern matching algorithm. If the Hamming distance between a certain substring of the text string and the pattern string is less than the threshold, the first terminal outputs the text string of the substring. in the location.

本公开第五方面提供了一种安全近似模式匹配方法。A fifth aspect of the present disclosure provides a secure approximate pattern matching method.

一种安全近似模式匹配方法，存在持有模式字符串、文本字符串的长度和阈值的第一终端以及持有文本字符串、模式字符串长度和阈值的第二终端，包括以下步骤：A method for secure approximate pattern matching, having a first terminal holding a pattern string, the length of the text string and a threshold, and a second terminal holding the text string, the length of the pattern string and the threshold, comprising the following steps:

本公开第六方面提供了一种安全近似模式匹配系统。A sixth aspect of the present disclosure provides a secure approximate pattern matching system.

一种安全近似模式匹配系统，包括持有模式字符串、文本字符串的长度和阈值的第一终端以及持有文本字符串、模式字符串长度和阈值的第二终端，第一终端与第二终端通信：A secure approximate pattern matching system, comprising a first terminal holding a pattern string, the length of the text string and a threshold, and a second terminal holding the text string, the length of the pattern string and the threshold, the first terminal and the second terminal Terminal communication:

与现有技术相比，本公开的有益效果是：Compared with the prior art, the beneficial effects of the present disclosure are:

1、本公开所述的方法、电子设备或系统，持有模式信息的用户可以获得其模式在数据库中出现的位置，通过茫然传输算法和布尔类型阈值隐私集合求交算法，数据库方无法得知用户的模式信息，用户无法得知数据库中的其他数据信息，在进行模式匹配的同时保证了各自数据的安全性。1. With the method, electronic device or system described in the present disclosure, the user who holds the pattern information can obtain the position where the pattern appears in the database, and the database party cannot know the location of the pattern in the database through the blind transmission algorithm and the Boolean type threshold privacy set intersection algorithm. The user's pattern information, the user cannot know other data information in the database, and ensures the security of their respective data while performing pattern matching.

2、本公开所述的方法、电子设备或系统，能够让持有模式信息的参与方获得其模式在文本中出现的位置，但持有文本的参与方无法获得任何关于模式的相关信息，持有模式的参与方无法获得文本的其他相关信息。2. The method, electronic device or system described in the present disclosure enables the participant who holds the pattern information to obtain the position where the pattern appears in the text, but the participant who holds the text cannot obtain any relevant information about the pattern. Other relevant information about the text is not available to parties with patterns.

附图说明Description of drawings

构成本公开的一部分的说明书附图用来提供对本公开的进一步理解，本公开的示意性实施例及其说明用于解释本公开，并不构成对本公开的不当限定。The accompanying drawings that constitute a part of the present disclosure are used to provide further understanding of the present disclosure, and the exemplary embodiments of the present disclosure and their descriptions are used to explain the present disclosure and do not constitute an improper limitation of the present disclosure.

图1为本公开实施例1提供的安全近似模式匹配方法的流程示意图。FIG. 1 is a schematic flowchart of the secure approximate pattern matching method provided in Embodiment 1 of the present disclosure.

具体实施方式Detailed ways

下面结合附图与实施例对本公开作进一步说明。The present disclosure will be further described below with reference to the accompanying drawings and embodiments.

应该指出，以下详细说明都是例示性的，旨在对本公开提供进一步的说明。除非另有指明，本文使用的所有技术和科学术语具有与本公开所属技术领域的普通技术人员通常理解的相同含义。It should be noted that the following detailed description is exemplary and intended to provide further explanation of the present disclosure. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs.

需要注意的是，这里所使用的术语仅是为了描述具体实施方式，而非意图限制根据本公开的示例性实施方式。如在这里所使用的，除非上下文另外明确指出，否则单数形式也意图包括复数形式，此外，还应当理解的是，当在本说明书中使用术语“包含”和/或“包括”时，其指明存在特征、步骤、操作、器件、组件和/或它们的组合。It should be noted that the terminology used herein is for the purpose of describing specific embodiments only, and is not intended to limit the exemplary embodiments according to the present disclosure. As used herein, unless the context clearly dictates otherwise, the singular is intended to include the plural as well, furthermore, it is to be understood that when the terms "comprising" and/or "including" are used in this specification, it indicates that There are features, steps, operations, devices, components and/or combinations thereof.

在不冲突的情况下，本公开中的实施例及实施例中的特征可以相互组合。The embodiments of this disclosure and features of the embodiments may be combined with each other without conflict.

实施例1：Example 1:

如图1所示，本公开实施例1提供了一种安全近似模式匹配方法，应用于第一终端(即P₁)；As shown in FIG. 1 , Embodiment 1 of the present disclosure provides a secure approximate pattern matching method, which is applied to a first terminal (ie P ₁ );

参与方P₀(即第二终端)的输入为文本字符串t∈{0,1}ⁿ、模式字符串的长度m以及阈值τ，参与方P₁的输入为模式字符串p∈{0,1}^m、文本字符串的长度n以及阈值τ；The input of the participant P ₀ (ie the second terminal) is the text string t∈{0,1} ⁿ , the length m of the pattern string and the threshold τ, and the input of the participant P ₁ is the pattern string p∈{0, 1} ^m , the length n of the text string and the threshold τ;

安全近似模式匹配算法执行结束后，如果t的第i个子字符串和p之间的汉明距离小于τ，那么P₁输出位置i，形式化描述如下：After the execution of the safe approximate pattern matching algorithm, if the Hamming distance between the ith substring of t and p is less than τ, then P ₁ outputs position i, which is formally described as follows:

输入：P₀的输入为(t,m,τ)，P₁的输入为(p,n,τ)。Input: The input of P ₀ is (t, m, τ), and the input of P ₁ is (p, n, τ).

输出：如果t的第i个子字符串和p之间的汉明距离小于τ，P₁输出i。Output: P ₁ outputs i if the Hamming distance between the ith substring of t and p is less than τ.

初始化：设安全参数为λ，设置全局参数p，p为一个素数，其二进制长度|p|>λ。Initialization: Set the security parameter as λ, set the global parameter p, p is a prime number, and its binary length |p|>λ.

安全近似模式匹配算法，具体如下：Safe approximate pattern matching algorithm, as follows:

(1)P₀随机选择m对随机数，分别为

其中

P₀根据文本字符串t的每一个长度为m的子字符串选择相应的随机数，得到文本集合

其中i＝1,…,n-m+1；(1) P ₀ randomly selects m pairs of random numbers, which are

in

P ₀ selects the corresponding random number according to each substring of length m of the text string t, and obtains the text set

where i=1,...,n-m+1;

(2)对于每一个j＝1,…,m，P₀和P₁执行一个2选1茫然传输算法，其中P₀的输入为

P₁的输入为p_j，算法执行结束后，P₁得到模式集合

( ₂ ) For each j ₌ ₁ , .

The input of P ₁ is p _j , after the execution of the algorithm, P ₁ gets the pattern set

(3)对于每一个i＝1,…,n-m+1，P₀和P₁执行一个布尔类型阈值隐私集合求交算法，其中P₀的输入为(C_i,m,m-τ)，P₁的输入为(S,m,m-τ)。算法执行结束后，P₁得到输出集合b_i∈{0,1}^n-m+1。(3) For each _i ₌ ₁ , _. , the input of P ₁ is (S,m,m-τ). After the algorithm is executed, P ₁ obtains the output set b _i ∈{0,1} ^n-m+1 .

(4)如果b_i＝1，P₁输出i。(4) If bi = 1, P ₁ outputs _i .

布尔类型阈值隐私集合求交算法，具体如下：Boolean type threshold privacy set intersection algorithm, as follows:

在布尔类型阈值隐私集合求交算法中，C、S为集合，|C|、|S|分别为集合C、S的长度，t为阈值，参与方P₀的输入为(C,|S|,t)，参与方P₁的输入为(S,|C|,t)，算法运行结束后，如果|C∩S|≥t，P₁输出1，否则P₁输出0，具体描述如下：In the Boolean threshold privacy set intersection algorithm, C and S are sets, |C| and |S| are the lengths of sets C and S, respectively, t is the threshold, and the input of the participant P ₀ is (C, |S| ,t), the input of the participant P ₁ is (S,|C|,t). After the algorithm runs, if |C∩S|≥t, P ₁ outputs 1, otherwise P ₁ outputs 0. The specific description is as follows:

输入：P₀的输入为(C,|S|,T)，P₁的输入为(S,|C|,t)。Input: The input of P ₀ is (C, |S|, T), and the input of P ₁ is (S, |C|, t).

输出：如果|C∩S|≥t，P₁输出1，否则P₁输出0。Output: If |C∩S|≥t, P ₁ outputs 1, otherwise P ₁ outputs 0.

初始化：设安全参数为λ，设置全局参数p。p为一个素数，其二进制长度|p|>λ。Initialization: Set the security parameter to λ and set the global parameter p. p is a prime number whose binary length |p|>λ.

1)P₀公开加法同态加密公钥pk₁，然后P₀和P₁执行私有集合交集基数算法，P₁得到Enc(pk₁,|C∩S|)。1) P ₀ publishes the public additive homomorphic encryption public key pk ₁ , then P ₀ and P ₁ execute the private set intersection cardinality algorithm, and P ₁ obtains Enc(pk ₁ ,|C∩S|).

2)P₁选择随机数r∈{0,…,p-1}，进行加法同态计算得到Enc(pk₁,|C∩S|+r)＝Enc(pk₁,|C∩S|)·Enc(pk₁,r)；2) P ₁ selects a random number r∈{0,…,p-1}, and performs additive homomorphic calculation to obtain Enc(pk ₁ ,|C∩S|+r)=Enc(pk ₁ ,|C∩S|) ·Enc(pk ₁ ,r);

P₁选择随机数r′∈{0,…,p-1}和R∈{0,…,p-1}，准备一个根为r+t,r+t+1,…,r+min(|C|,|S|)的多项式p(·)，然后计算多项式p′(·)＝r′·p(·)+R，多项式p′(·)的系数为a₀,a₁,…,a_{min(|C|,|S|+1}；P ₁ selects random numbers r′∈{0,…,p-1} and R∈{0,…,p-1}, and prepares a root as r+t,r+t+1,…,r+min( |C|, |S|) polynomial p(·), then calculate polynomial p′(·)=r′·p(·)+R, the coefficients of polynomial p′(·) are a ₀ , a ₁ ,… ,a _{min(|C|,|S|+1} ;

P₁用加法同态加密公钥pk₂对多项式p′(·)的系数进行加密，得到加密后的系数分别为P ₁ encrypts the coefficients of the polynomial p'(·) with the additive homomorphic encryption public key pk ₂ , and the encrypted coefficients are

Enc(pk₂,a₀),Enc(pk₂,a₁),…,Enc(pk₂,a_{min(|C|,|S|+1})；Enc(pk ₂ ,a ₀ ),Enc(pk ₂ ,a ₁ ),…,Enc(pk ₂ ,a _{min(|C|,|S|+1} );

P₁将多项式p′(·)的加密系数以及Enc(pk₁,|C∩S|+r)发送给P₀。P ₁ sends the encryption coefficient of the polynomial p'(·) and Enc(pk ₁ , |C∩S|+r) to P ₀ .

3)P₀解密收到的密文得到|C∩S|+r，然后用点|C∩S|+r茫然计算多项式p′(·)，将得到的结果记为Enc(pk₂,R′)，可得到：3) P ₀ decrypts the received ciphertext to get |C∩S|+r, then use the point |C∩S|+r to calculate the polynomial p′(·) at a loss, and record the result as Enc(pk ₂ ,R '),available:

P₀选择随机数r"∈{0,…,p-1}，进行加法同态计算得到Enc(pk₁,R′+r")＝Enc(pk₁,R′)·Enc(pk₁,r")，然后发送给P₁请求其解密。P ₀ selects a random number r"∈{0,...,p-1}, and performs additive homomorphic calculation to obtain Enc(pk ₁ ,R′+r")=Enc(pk ₁ ,R′)·Enc(pk ₁ , r" ₎ , and then send it to P1 to request its decryption.

4)P₁解密Enc(pk₁,R′+r")并将结果R′+r"发送给P₀，P₀经过计算得到R′。4) P ₁ decrypts Enc(pk ₁ , R'+r") and sends the result R'+r" to P ₀ , and P ₀ obtains R' through calculation.

5)P₀和P₁执行隐私相等性测试算法，其中P₀输入R′，P₁输入R，算法执行结束后，P₁输出0或1。5) P ₀ and P ₁ execute the privacy equality test algorithm, where P ₀ inputs R′, P ₁ inputs R, and after the algorithm is executed, P ₁ outputs 0 or 1.

本实施例中，茫然传输算法，具体如下：In this embodiment, the dazed transmission algorithm is specifically as follows:

茫然传输(oblivious transfer，OT)算法是一个两方算法，两个参与方分别为发送方S和接收方R，在一个2选1茫然传输算法

算法中，发送方S输入2个消息(x₀,x₁)，接收方R输入一个选择信息σ∈{0,1}，双方执行算法后，R输出x_σ；除此之外，不泄露任何额外信息。The oblivious transfer (OT) algorithm is a two-party algorithm. The two parties are the sender S and the receiver R respectively.

In the algorithm, the sender S inputs 2 messages (x ₀ , x ₁ ), the receiver R inputs a selection information σ∈{0,1}, after both parties execute the algorithm, R outputs x _σ ; in addition, no leakage any additional information.

本实施例中，隐私相等性测试，具体如下：In this embodiment, the privacy equality test is as follows:

隐私相等性测试(private equality test，PEQT)算法是一个两方算法，两个参与方分别为发送方S和接收方R。发送方S的输入为x₀，接收方R的输入为x₁，双方执行算法后，如果x₀＝x₁，R输出1，否则R输出0。除此之外，不泄露任何额外信息。The private equality test (PEQT) algorithm is a two-party algorithm, and the two parties are the sender S and the receiver R respectively. The input of the sender S is x ₀ , and the input of the receiver R is x ₁ . After both parties execute the algorithm, if x ₀ =x ₁ , R outputs 1, otherwise R outputs 0. Other than that, no additional information is disclosed.

本实施例中，加密隐私集合交集基数，具体如下：In this embodiment, the encryption privacy set intersection cardinality is as follows:

加密隐私集合交集基数(encrypted private set intersection-cardinality，ePSI-CA)算法是一个两方算法，两个参与方分别为P₀和P₁。C、S为集合，|C|、|S|分别为集合C、S的长度，(pk₁,sk₁)、(pk₂,sk₂)为P₀和P₁的加法同态加密密钥对。P₀的输入为(C,|S|,pk₁,sk₁)，P₁的输入为(S,|C|,pk₁,pk₂,sk₂)，双方执行协后，P₁输出加密后的交集元素个数Enc(pk₁,|C∩S|)。The encrypted private set intersection-cardinality (ePSI-CA) algorithm is a two-party algorithm, and the two parties are P ₀ and P ₁ respectively. C and S are sets, |C| and |S| are the lengths of sets C and S, respectively, (pk ₁ , sk ₁ ) and (pk ₂ , sk ₂ ) are the additive homomorphic encryption keys of P ₀ and P ₁ right. The input of P ₀ is (C,|S|,pk ₁ ,sk ₁ ), and the input of P ₁ is (S,|C|,pk ₁ ,pk ₂ ,sk ₂ ), after the two parties execute the agreement, P ₁ outputs encryption The number of subsequent intersection elements Enc(pk ₁ ,|C∩S|).

实施例2：Example 2:

本公开实施例2提供了一种电子设备，包括持有模式字符串、文本字符串的长度以及阈值的第一终端，第一终端与持有文本字符串、模式字符串长度及阈值的第二终端通信；Embodiment 2 of the present disclosure provides an electronic device, including a first terminal holding a pattern string, the length of the text string and a threshold, the first terminal and a second terminal holding the text string, the length of the pattern string and the threshold terminal communication;

所述设备的具体工作方法与实施例1提供的方法相同，这里不再赘述。The specific working method of the device is the same as the method provided in Embodiment 1, and will not be repeated here.

实施例3：Example 3:

本公开实施例3提供了一种安全近似模式匹配方法，应用于持有文本字符串、模式字符串长度及阈值的第二终端，包括以下步骤：Embodiment 3 of the present disclosure provides a secure approximate pattern matching method, which is applied to a second terminal holding a text string, a pattern string length, and a threshold, including the following steps:

详细方法与实施例1提供的方法相同，这里不再赘述。The detailed method is the same as that provided in Embodiment 1, and will not be repeated here.

实施例4：Example 4:

本公开实施例4提供了一种电子设备，包括持有文本字符串、模式字符串长度及阈值的第二终端，第二终端与持有模式字符串、文本字符串的长度以及阈值的第一终端通信；Embodiment 4 of the present disclosure provides an electronic device, including a second terminal that holds a text string, the length of the pattern string, and a threshold, the second terminal and a first terminal that holds the pattern string, the length of the text string, and the threshold terminal communication;

实施例5：Example 5:

本公开实施例5提供了一种安全近似模式匹配方法，存在持有模式字符串、文本字符串的长度和阈值的第一终端以及持有文本字符串、模式字符串长度和阈值的第二终端，包括以下步骤：Embodiment 5 of the present disclosure provides a secure approximate pattern matching method. There are a first terminal holding a pattern string, the length of the text string, and a threshold, and a second terminal holding the text string, the length of the pattern string, and a threshold. , including the following steps:

实施例6：Example 6:

本公开实施例6提供了一种安全近似模式匹配系统，包括持有模式字符串、文本字符串的长度和阈值的第一终端以及持有文本字符串、模式字符串长度和阈值的第二终端，第一终端与第二终端通信：Embodiment 6 of the present disclosure provides a secure approximate pattern matching system, including a first terminal holding a pattern string, the length of the text string, and a threshold, and a second terminal holding the text string, the length of the pattern string, and a threshold , the first terminal communicates with the second terminal:

所述系统的工作方法与实施例1提供的方法相同，这里不再赘述。The working method of the system is the same as the method provided in Embodiment 1, and details are not repeated here.

本领域内的技术人员应明白，本公开的实施例可提供为方法、系统、或计算机程序产品。因此，本公开可采用硬件实施例、软件实施例、或结合软件和硬件方面的实施例的形式。而且，本公开可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器和光学存储器等)上实施的计算机程序产品的形式。As will be appreciated by one skilled in the art, embodiments of the present disclosure may be provided as a method, system, or computer program product. Accordingly, the present disclosure may take the form of a hardware embodiment, a software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present disclosure may take the form of a computer program product embodied on one or more computer-usable storage media having computer-usable program code embodied therein, including but not limited to disk storage, optical storage, and the like.

本公开是参照根据本公开实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器，使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present disclosure is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the disclosure. It will be understood that each flow and/or block in the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to the processor of a general purpose computer, special purpose computer, embedded processor or other programmable data processing device to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing device produce Means for implementing the functions specified in a flow or flow of a flowchart and/or a block or blocks of a block diagram.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中，使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品，该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory result in an article of manufacture comprising instruction means, the instructions The apparatus implements the functions specified in the flow or flow of the flowcharts and/or the block or blocks of the block diagrams.

这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上，使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理，从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded on a computer or other programmable data processing device to cause a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process such that The instructions provide steps for implementing the functions specified in the flow or blocks of the flowcharts and/or the block or blocks of the block diagrams.

本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程，是可以通过计算机程序来指令相关的硬件来完成，所述的程序可存储于一计算机可读取存储介质中，该程序在执行时，可包括如上述各方法的实施例的流程。其中，所述的存储介质可为磁碟、光盘、只读存储记忆体(Read-Only Memory，ROM)或随机存储记忆体(RandomAccessMemory，RAM)等。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented by instructing relevant hardware through a computer program, and the program can be stored in a computer-readable storage medium. During execution, the processes of the embodiments of the above-mentioned methods may be included. The storage medium may be a magnetic disk, an optical disk, a read-only memory (Read-Only Memory, ROM), or a random access memory (Random Access Memory, RAM) or the like.

以上所述仅为本公开的优选实施例而已，并不用于限制本公开，对于本领域的技术人员来说，本公开可以有各种更改和变化。凡在本公开的精神和原则之内，所作的任何修改、等同替换、改进等，均应包含在本公开的保护范围之内。The above descriptions are only preferred embodiments of the present disclosure, and are not intended to limit the present disclosure. For those skilled in the art, the present disclosure may have various modifications and changes. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present disclosure shall be included within the protection scope of the present disclosure.

Claims

1. a safe approximate pattern matching method, is characterized in that: be applied to the first terminal that holds the length of pattern string, text string and threshold, comprises the following steps:

The first terminal executes a secure approximate pattern matching algorithm with the second terminal that holds the text string, the length of the pattern string and the threshold. If the Hamming distance between a substring of the text string and the pattern string is less than the threshold, the first A terminal outputs the position of the substring within the text string;

Safe approximate pattern matching algorithms, including:

The second terminal randomly selects m pairs of random numbers, which are

The second terminal selects a corresponding random number according to each substring of length m of the text string t, and obtains a text set

where i=1,...,n-m+1;

For each j=1,...,m, the first terminal and the second terminal perform a 2-to-1 blind transmission algorithm, and the input of the second terminal is

The input of the first terminal is p _j . After the algorithm is executed, the first terminal obtains the pattern set

For each _i =1, . The input of the terminal is S,m,m-τ, after the algorithm is executed, the first terminal obtains the output set b _i ∈{0,1} ^n-m+1 ;

If b _i =1, the first terminal outputs i;

Boolean threshold privacy set intersection algorithm, including:

The second terminal publishes the public additive homomorphic encryption public key pk ₁ , then the first terminal and the second terminal execute the private set intersection radix algorithm, and the first terminal obtains Enc(pk ₁ ,|C∩S|);

The first terminal selects a random number r∈{0,...,p-1} and performs additive homomorphic calculation to obtain Enc(pk ₁ ,|C∩S|+r)=Enc(pk ₁ ,|C∩S|)· Enc(pk ₁ ,r);

The first terminal selects random numbers r′∈{0,…,p-1} and R∈{0,…,p-1}, and prepares a root of r+t,r+t+1,…,r+min (|C|,|S|) polynomial p(·), then calculate polynomial p′(·)=r′·p(·)+R, the coefficients of polynomial p′(·) are a ₀ ,a ₁ , …,a _{min(|C|,|S|) +1} ;

The first terminal uses the additive homomorphic encryption public key pk ₂ to encrypt the coefficients of the polynomial p'(·), and the encrypted coefficients are obtained as Enc(pk ₂ , a ₀ ), Enc(pk ₂ , a ₁ ), ... ,Enc(pk ₂ ,a _{min(|C|,|S|)+1} );

The first terminal sends the encryption coefficient of the polynomial p'(·) and Enc(pk ₁ , |C∩S|+r) to the second terminal;

The second terminal decrypts the received ciphertext to obtain |C∩S|+r, then uses the point |C∩S|+r to calculate the polynomial p′(·) at a loss, and denote the obtained result as Enc(pk ₂ ,R′ ),get:

The second terminal selects a random number r"∈{0,...,p-1} and performs additive homomorphic calculation to obtain Enc(pk ₁ ,R'+r")=Enc(pk ₁ ,R')·Enc(pk ₁ ,r"), and then send it to the first terminal to request its decryption;

The first terminal decrypts Enc(pk ₁ , R'+r") and sends the result R'+r" to the second terminal, and the second terminal obtains R' through calculation;

The first terminal and the second terminal execute the privacy equality test algorithm, wherein the second terminal inputs R', the first terminal inputs R, and after the algorithm execution ends, the first terminal outputs 0 or 1;

where m is the length of the pattern string; n is the length of the text string; τ is the threshold given in the secure approximate pattern matching algorithm; C is the set held by the first terminal; S is the set held by the second terminal; let The security parameter is λ, and the global parameter p is set, where p is a prime number, and its binary length |p|>λ; |C| is the number of elements in the set C; |S| is the number of elements in the set S; t is a Boolean type Thresholds given in the privacy set intersection algorithm.

2. An electronic device, characterized in that: comprising a first terminal holding a pattern string, the length of a text string and a threshold, the first terminal and a second terminal holding a text string, the length of the pattern string and the threshold communication;

The first terminal and the second terminal execute a secure approximate pattern matching algorithm. If the Hamming distance between a certain substring of the text string and the pattern string is less than the threshold, the first terminal outputs the substring in the text string. s position;

Safe approximate pattern matching algorithms, including:

The second terminal randomly selects m pairs of random numbers, which are

where i=1,...,n-m+1;

If b _i =1, the first terminal outputs i;

Boolean threshold privacy set intersection algorithm, including:

The first terminal selects random numbers r′∈{0,…,p-1} and R∈{0,…,p-1}, and prepares a root of r+t,r+t+1,…,r+min (|C|,|S|) polynomial p(·), then calculate polynomial p′(·)=r′·p(·)+R, the coefficients of polynomial p′(·) are a ₀ , a ₁ , …,a _{min(|C|,|S|)+1} ;

3. A safe approximate pattern matching method, characterized in that: being applied to a second terminal holding a text string, a pattern string length and a threshold, comprising the following steps:

The second terminal performs a secure approximate pattern matching algorithm with the first terminal holding the pattern string, the length of the text string and the threshold, if the Hamming distance between a certain substring of the text string and the pattern string is less than the threshold, Causes the first terminal to output the position of the substring in the text string;

Safe approximate pattern matching algorithms, including:

The second terminal randomly selects m pairs of random numbers, which are

where i=1,...,n-m+1;

If b _i =1, the first terminal outputs i;

Boolean threshold privacy set intersection algorithm, including:

4. An electronic device, characterized in that: comprising a second terminal holding a text string, the length of the pattern string and a threshold, the second terminal and the first terminal holding the pattern string, the length of the text string and the threshold communication;

The first terminal and the second terminal execute a secure approximate pattern matching algorithm. If the Hamming distance between a certain substring of the text string and the pattern string is less than the threshold, the first terminal outputs the text string of the substring. position in;

Safe approximate pattern matching algorithms, including:

The second terminal randomly selects m pairs of random numbers, which are

where i=1,...,n-m+1;

If b _i =1, the first terminal outputs i;

Boolean threshold privacy set intersection algorithm, including:

The first terminal selects random numbers r′∈{0,…,p-1} and R∈{0,…,p-1}, and prepares a root of r+t,r+t+1,…,r+min (|C|,|S|) polynomial p(·), then calculate polynomial p′(·)=r′·p(·)+R, the coefficients of polynomial p′(·) are a ₀ , a ₁ , …,a _{min(|C|,|S|) +1} ;

5. A method for safe approximate pattern matching, characterized in that: there is a first terminal holding a pattern string, the length of the text string and a threshold and a second terminal holding the text string, the length of the pattern string and the threshold, Include the following steps:

Safe approximate pattern matching algorithms, including:

The second terminal randomly selects m pairs of random numbers, which are

where i=1,...,n-m+1;

If b _i =1, the first terminal outputs i;

Boolean threshold privacy set intersection algorithm, including:

6. A safe approximate pattern matching system, characterized in that: comprising a first terminal holding a pattern string, the length of the text string and a threshold, and a second terminal holding the text string, the length of the pattern string and the threshold, The first terminal communicates with the second terminal:

Safe approximate pattern matching algorithms, including:

The second terminal randomly selects m pairs of random numbers, which are

where i=1,...,n-m+1;

If b _i =1, the first terminal outputs i;

Boolean threshold privacy set intersection algorithm, including:

The first terminal selects a random number r∈{0,...,p-1}, and performs additive homomorphic calculation to obtain Enc(pk ₁ ,|C∩S|+r)=Enc(pk ₁ ,|C∩S|)· Enc(pk ₁ ,r);