WO2008030717A1 - Encrypted data search - Google Patents
Encrypted data search Download PDFInfo
- Publication number
- WO2008030717A1 WO2008030717A1 PCT/US2007/076758 US2007076758W WO2008030717A1 WO 2008030717 A1 WO2008030717 A1 WO 2008030717A1 US 2007076758 W US2007076758 W US 2007076758W WO 2008030717 A1 WO2008030717 A1 WO 2008030717A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- item
- data
- plaintext
- indexing
- items
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/60—Protecting data
- G06F21/62—Protecting access to data via a platform, e.g. using keys or access control rules
- G06F21/6218—Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
- G06F21/6245—Protecting personal data, e.g. for financial or medical purposes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/60—Protecting data
- G06F21/606—Protecting data by securing the transmission between two devices or processes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L9/00—Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols
- H04L9/32—Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols including means for verifying the identity or authority of a user of the system or for message authentication, e.g. authorization, entity authentication, data integrity or data verification, non-repudiation, key authentication or verification of credentials
- H04L9/3236—Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols including means for verifying the identity or authority of a user of the system or for message authentication, e.g. authorization, entity authentication, data integrity or data verification, non-repudiation, key authentication or verification of credentials using cryptographic hash functions
Definitions
- deterministic encryption In such database systems, an item of plaintext will always be encrypted to the same ciphertext when using the same encryption key.
- Examples of deterministic encryption include use of block ciphers in electronic codebook (ECB) mode or use of a constant initialization vector (IV). Because deterministic encryption always encrypts the same plaintext to the same ciphertext when using a given cryptographic key, data patterns may be recognizable, resulting in information leakage. This is especially a problem when data to be encrypted is too large to fit into a single block, which may be 8 or 16 bytes in length, depending on which block cipher algorithm is used.
- a search for a data item corresponding to a non- deterministically encrypted ciphertext item of an encrypted column of a database may be performed by using an indexing structure corresponding to the encrypted column of the database.
- a code may be calculated, transparently with respect to a requester, based on the data item and a cryptographic key.
- the code may be used as an index to the indexing structure, which may have entries organized according to respective codes based on corresponding data items and the cryptographic key.
- each of the entries of the indexing structure may include the respective code and data for accessing a row of a database that includes a corresponding non-deterministically encrypted ciphertext item of the encrypted column of the database.
- a search for a desired data item corresponding to a non-deterministically encrypted ciphertext item of an encrypted column of a database may be performed by accessing an indexing structure corresponding to the encrypted column of the database. Entries of the indexing structure may be organized according to plaintext data items corresponding to non-deterministically encrypted ciphertext items of the encrypted column of the database. In the indexing structure, references related to the corresponding plaintext data items may be encrypted and other information in the indexing structure may be unencrypted.
- the search may be performed by loading at least a portion of the indexing structure into a memory, accessing an entry of the indexing structure, and decrypting at least one of the references of the entry of the indexing structure. The at least one decrypted reference may be used to access a row of the database including a corresponding non- deterministically encrypted ciphertext item of the encrypted column of the database.
- FIG. 1 illustrates an exemplary operating environment consistent with the subject matter of this disclosure.
- Fig. 2 is a functional block diagram of an exemplary processing device that may be used to implement processing device 102 of Fig. 1, processing device 104 of Fig. 1, or both processing devices.
- Figs. 3A-3C illustrate exemplary indexing structures that may be employed in embodiments consistent with the subject matter of this disclosure.
- Fig. 4 is a flowchart that illustrates a method that may be performed consistent with the exemplary indexing structures of Figs. 3A-3C.
- Fig. 5 illustrates an exemplary indexing structure that may be employed in another embodiment consistent with the subject matter of this disclosure.
- Fig. 6 is a flowchart that illustrates a method that may be performed consistent with the exemplary indexing structure of Fig. 5.
- Fig. 7 is a flowchart that illustrates a method that may be performed in a third embodiment consistent with the subject matter of this disclosure.
- FIG. 1 illustrates an exemplary operating environment 100 for an embodiment consistent with subject matter of this disclosure.
- Operating environment 100 may include processing device 102, processing device 104 and network 106.
- Processing device 102 may be, for example, a server or other processing device capable of executing a database system.
- Processing device 104 may be a personal computer (PC) or other processing device capable of executing applications and communicating with processing device 102 via network 106.
- Network 106 may be a wired or wireless network and may include a number of devices connected via wired or wireless means.
- Network 104 may include only one network or a number of different networks, some of which may be networks of different types.
- processing device 104 may execute an application, which accesses information in a database of processing device 102 via - A -
- the application may create, delete, read or modify data in the database of processing device 102.
- FIG. 1 illustrates an exemplary operating environment.
- Other operating environments or variations of operating environment 100 may be used with other embodiments consistent with the subject matter of this disclosure.
- Fig. 1 illustrates processing device 102 and processing device 104 as being separate devices.
- processing devices 102 and 104 may be combined in a single processing device in one embodiment.
- the operating environment may not include network 106.
- functions or services performed by processing device 102 may be distributed across multiple processing devices which may be connected via a network, such as, for example, network 106.
- FIG. 2 is a functional block diagram which illustrates an exemplary processing device 200, which may be used to implement processing device 102, processing device 104, or both devices.
- Processing device 200 may include a bus 210, a processor 220, a memory 230, a read only memory (ROM) 240, a storage device 250, an input device 260, an output device 270, and a communication interface 280.
- Bus 210 may permit communication among components of processing device 200.
- communication interface 280 may not be included as one of the components of processing device 200.
- Processor 220 may include at least one conventional processor or microprocessor that interprets and executes instructions.
- Memory 230 may be a random access memory (RAM) or another type of dynamic storage device that stores information and instructions for execution by processor 220. Memory 230 may also store temporary variables or other intermediate information used during execution of instructions by processor 220.
- ROM 240 may include a conventional ROM device or another type of static storage device that stores static information and instructions for processor 220.
- Storage device 250 may include any type of media for storing data and/or instructions. When processing device 200 is used to implement processing device 102, storage device 250 may include one or more databases of a database system.
- Input device 260 may include one or more conventional mechanisms that permit a user to input information to processing device 200, such as, for example, a keyboard, a mouse, or other input device.
- Output device 270 may include one or more conventional mechanisms that output information to the user, including a display, a printer, or other output device.
- Communication interface 280 may include any transceiver- like mechanism that enables processing device 200 to communicate with other devices or networks. In one embodiment, communication interface 280 may include an interface to network 106.
- Processing device 200 may perform such functions in response to processor 220 executing sequences of instructions contained in a computer-readable medium, such as, for example, memory 230, or other medium. Such instructions may be read into memory 230 from another computer-readable medium, such as storage device 250, or from a separate device via communication interface 280.
- a computer-readable medium such as, for example, memory 230, or other medium.
- Such instructions may be read into memory 230 from another computer-readable medium, such as storage device 250, or from a separate device via communication interface 280.
- data may be viewed as being stored in tables.
- a row of the table may correspond to a record in a file.
- Some database systems may permit data stored in a column of a table to be encrypted.
- Such database systems may permit a search on data in the encrypted column, provided the data is deterministically encrypted. That is, a search for rows in a table having a particular plaintext value corresponding to deterministically encrypted ciphertext in an encrypted column of the database may be performed.
- deterministic encryption always encrypts plaintext items to the same corresponding ciphertext items. Thus, data patterns may be recognizable resulting in information leakage.
- Non-deterministic encryption methods such as, for example, use of block ciphers in cipher-block chaining (CBC) mode with a random initialization vector, or other non-deterministic encryption methods, may encrypt the same plaintext data items to different ciphertext data items.
- non-deterministic encryption according to use of block ciphers in CBC mode with a random initialization vector may encrypt each block of plaintext by XORing a current block of plaintext with a previous ciphertext block before encrypting the current block.
- a value of a ciphertext data item may be based not only on a corresponding plaintext data item and a cryptographic key, but may also be based on other data, such as, for example, previously encrypted blocks of data or a random initialization vector.
- Embodiments consistent with the subject matter of this disclosure relate to database systems in which searching may be performed on non-deterministically encrypted data of an encrypted column of a database.
- a code may be calculated based on a desired plaintext data item and a cryptographic key.
- the code may be a message authentication code (MAC), a Hashed Message Authentication Code (HMAC), or other code.
- the code may be used as an index to an indexing structure, which may have entries organized according to respective codes based on corresponding plaintext data items and a cryptographic key.
- the indexing structure may be a B-tree or other indexing structure, which may be used to search for one or more rows in the database having a particular plaintext data item corresponding to encrypted data of an encrypted column of the database.
- Each of the entries of the indexing structure may include an indexing value, corresponding to a code calculated based on the corresponding plaintext data item and the cryptographic key, and data for accessing a row of a database that includes a corresponding non-deterministically encrypted ciphertext item of the encrypted column of the database.
- the indexing structure may include hash buckets allocated for respective items according to a corresponding hash value.
- a hashed message authentication code may be calculated based on a respective plaintext data item and a cryptographic key.
- the hash value may be produced by hashing the calculated hashed message authentication code.
- Each item of a hash bucket may include information for obtaining a database entry including a non-deterministically encrypted data item corresponding to a respective plaintext data item.
- an indexing structure for a non-deterministically encrypted column of a database may be accessed. Each entry of the indexing structure may be organized according to plaintext data items corresponding to non- deterministically encrypted ciphertext items of the encrypted column of the database.
- Each of the entries of the indexing structure may include one or more references related to the corresponding plaintext data item.
- the one or more references related to the corresponding plaintext data item may be encrypted and other information in the indexing structure may be unencrypted.
- a search is performed, at least a portion of the indexing structure may be loaded into a memory and one of the entries of the indexing structure corresponding may be accessed.
- the one or more encrypted references of the one of the entries of the indexing structure may be decrypted and used to access a row including a corresponding non-deterministically encrypted ciphertext item of the encrypted column of the database.
- non-deterministic encryption and decryption may be performed using symmetric keys. That is, a cryptographic key may be used to non-deterministically encrypt a data item and the same cryptographic key may be used to decrypt the encrypted data item.
- non-deterministic encryption and decryption may be performed using asymmetric keys. That is, a public cryptographic key may be used to non-deterministically encrypt a data item and a private cryptographic key may be used to decrypt the data.
- Database systems typically use some type of indexing scheme for quickly searching data stored in column of a database in order to access particular records or rows.
- One well-known indexing scheme includes use of a B-tree, although other indexing schemes may also be used in other embodiments.
- a new data type which we call a duplet, may be used with the indexing scheme of the database system.
- the duplet may include paired data items.
- the duplet may include a code based on a plaintext item corresponding to a non-deterministically encrypted ciphertext item stored in an encrypted column of the database, and non- deterministically encrypted ciphertext, which may be equal to the non- deterministically encrypted ciphertext item stored in the encrypted column of the database.
- the non-deterministically encrypted ciphertext as an E-value.
- the code based on the plaintext item may be a Message Authentication Code (MAC), or other code.
- the MAC may be a Hashed Message Authentication Code (HMAC), which is a one-way hash computed using a plaintext item and a cryptographic key.
- the cryptographic key may be equivalent to a cryptographic key used to form the E-value, a second key that may be protected by the key used to form the E-value, or a completely independent key.
- Fig. 3A illustrates an exemplary B-tree which may be used as an indexing structure in embodiments consistent with the subject matter of this disclosure.
- the exemplary B-tree may include index nodes 302, 312, 320, 326, 328, 30, 332, 334, 336, 338, 340, and 342.
- Each of the index nodes may include one or more entries.
- the index nodes, which are not leaf nodes, may include one or more links to other index nodes.
- index node 302 may include a number of entries and may further include links to other index nodes, such as index nodes 312, 320, 326 and 328.
- Index node 312 may include a number of entries and may further include links to other index nodes, such as index nodes 330, 332 and 334, which in this example, may be leaf nodes.
- Index node 320 may include at least one entry and a link to index nodes 336 and 338, which in this example, may be leaf nodes.
- Index node 326 may include at least one entry and a link to index node 340, which in this example may be a leaf node.
- Index node 328 may include at least one entry and a link to index node 342, which in this example may be a leaf node.
- Fig. 3B illustrates a more detailed view of exemplary index nodes 302, 312 and 320 of Fig. 3A consistent with the subject matter of this disclosure.
- each entry in the index nodes may include a duplet.
- duplets may be used with other indexing structures in other embodiments.
- each index node may include one or more items and each of the one or more items may include a duplet.
- index node 302 may include a first item having a duplet including an index value, which may be a code such as, for example, 33567, which may be a MAC or an HMAC based on a first plaintext item, and an E-value, hdfyjd, corresponding to the first plaintext item encrypted by key kl, a second item having a duplet including an index value, which may be a code, such as, for example, 58957, which may be a MAC or an HMAC based on a second plaintext item, and E-value, olhdrs, corresponding to the second plaintext item encrypted by key kl, and a third item having a duplet including an index value, which may be a code, such as, for example, 97460, which may be a MAC or an HMAC, based on
- index node 312 may include two entries.
- a first entry of index node 312 may include a duplet having an index value, 16485, based on a fourth plaintext item and an E-value, ifjtrslkm, corresponding to the fourth plaintext item encrypted by key kl .
- a second entry of index node 312 may include a duplet having an index value, 20945, based on a fifth plaintext item and an E-value, eswgh, corresponding to the fifth plaintext item encrypted by key kl .
- Index node 320 may include one entry including a duplet.
- Index node 302 may include a link 304, which may be a link to index node 312 having entries with corresponding index values less than index value 33567 of index node 302, a link 306, which is a link to index node 320 having an entry with a corresponding index value greater than index value 33567 and less than index value 58957 of index node 302, a link 308, which may link index node 302 to index node 326 having one or more entries with respective index values greater than index value 58957 and less than index value 97460 of index node 302, and a link 310, which may link index node 302 to an index node 328 having one or more entries with respective index values greater than index value 97460 of index node 302.
- index node 312 may include a link 314 to index node 330, which may include one or more entries having index values less than index value 16485 of index node 312, a link 316 to index node 332, which may include one or more entries including index values greater than index value 16485 and less than index value to 20945 of index node 312, and a link 318 to index node 334, which may include one or more entries including index values greater than index value 20945 of index node 312.
- Index node 320 may include a link 322 to index node 336, which may include one or more entries including index values less than index value 46789 of index node 320, and a link 324 to index node 338, which may include one or more entries including index values greater than index value 46789 of index node 320.
- Each of the index node entries may include information indicating a data type of the corresponding plaintext data item (not shown) and may include a reference or pointer to corresponding non-deterministically encrypted ciphertext of an encrypted column of the database (not shown). Further, each of the index nodes may include a different number of items than as shown in the exemplary indexing structure of Fig. 3B.
- index nodes 302, 312, or 320 may have a different number of items included within the respective index nodes than as shown in Fig. 3B.
- the indexing structure of Figs. 3 A and 3B is an exemplary indexing structure.
- Fig. 3B illustrates each item of the exemplary indexing structure including an index value and an E-value
- each item of an indexing structure may include an index value, with a corresponding E-value residing in a separate data structure.
- exemplary index node 302' of Fig. 3C is similar to index node 302 of Fig. 3B.
- each of the items of index node 302' may include a first entry of a duplet, which in this example is an index value, and a reference or pointer to a corresponding E-value included in a data structure 360, which may be a table, an array, or other data structure.
- data structure 360 illustrates the E-values, corresponding to index node 302', being in consecutive locations within data structure 360, the E-values may be arranged in locations within data structure 360, which are not consecutive or contiguous.
- an indexing structure such as, for example, the indexing structure of Figs.
- each new item added to a node in the indexing structure may have a link pointing to an index node including one or more items having a respective indexing value that is less than the indexing value of the added item and a second link pointing to an index node including one or more items having a respective indexing value that is greater than the indexing value of the added item.
- processing device 102 may update at least one of the existing links of the indexing structure to point to the new index node.
- Each new item that processing device 102 may add to the indexing structure may include a respective index value and either a corresponding E-value or a reference to a corresponding E-value.
- the corresponding E-value may be stored in a separate data structure, such as, for example, a table, an array, or other data structure.
- FIG. 4 is a flowchart that illustrates an exemplary process for using an indexing structure, such as, for example, the exemplary indexing structures of Figs. 3A-3C, to search for non-deterministically encrypted data in a database in embodiments consistent with the subject matter of this disclosure.
- processing device 102 may receive a request for a desired data item that may be included in a database of processing device 102 (act 402).
- the request may be from a requester such as, for example, a user or an application of processing device 102 or from a requester such as, for example, a user or an application of another processing device, such as, for example, processing device 104, which may communicate with processing device 102 via a network, such as, for example, network 106.
- the request may be a search request or other request that includes finding a desired data item and may include a plaintext form of the desired data item. Given the desired plaintext data item, processing device 102 may calculate, transparently with respect to the requester, an indexing value, which may be a code, such as, for example, a MAC or a HMAC based on the desired plaintext data item and a cryptographic key (act 404).
- Processing device 102 may then access and search an indexing structure of the database in an attempt to locate data corresponding to the desired plaintext data item (act 406). If the indexing structure is, for example, a B-tree, processing device 102 may examine index values of duplets within index nodes of the B-tree to traverse the B-tree in the attempt to locate the desired data.
- the indexing structure is, for example, a B-tree
- processing device 102 may examine index values of duplets within index nodes of the B-tree to traverse the B-tree in the attempt to locate the desired data.
- processing device 102 may determine whether the desired item was found (act 408). If the desired item was not found, then processing device 102 may return an indication that the desired data was not found in the database (act 422).
- processing device 102 may obtain an E-value of a duplet corresponding to the indexing value calculated during act 404 (act 410). Processing device 102 may then decrypt the E-value to provide corresponding plaintext (act 412). The corresponding plaintext may then be compared with the plaintext form of the desired data item provided during act 402 (act 414). If processing device 102 determines that the compared plaintexts are equal, then the data corresponding to the found item within the indexing structure may be obtained from the database and may be returned to the requester (act 416). That is, the found item of the indexing structure may include a reference to the corresponding data stored in the database. Processing device 102 may then determine whether the found data item is unique (act 418).
- processing device 102 may determine whether the found data item is unique based on whether the found data item is a primary key in a database, based on a uniqueness indicator that may be included in the database or in an entry of an indexing structure, or based on other criteria. If processing device 102 determines that the found data item is unique in the database, then the process is completed.
- processing device 102 may search the indexing structure for a next item corresponding to the indexing value (act 420). Processing device 102 may then repeat acts 408-424. [0046] If the comparison performed during act 414 indicates that the plaintexts are not equal, then processing device 102 may determine that a hash collision occurred. That is, two different plaintext items generate the same index value when using the same cryptographic key. The possibility of such an occurrence is rare, but possible. When processing device 102 determines that a hash collision occurred, processing device 102 may employ any one of a number of well-known methods for resolving a hash collision (act 424).
- items having identical codes or indexing values may be stored in contiguous locations of a node of the indexing structure.
- processing device 102 may search the contiguous items within the node to determine whether any of the contiguous items within the node are associated with an E-value which, when decrypted, matches the plaintext of the desired item. Once the hash collision is resolved, processing device 102 may repeat acts 408-414.
- Fig. 5 illustrates another exemplary indexing structure which may be used in another embodiment consistent with the subject matter of this disclosure.
- Fig. 5 illustrates an exemplary B-tree indexing structure, although other indexing structures may be used in other embodiments.
- indexing structure 502 on the right side of Fig. 5 illustrates an index node of indexing structure 502 as it may be when it resides in memory.
- Indexing structure 502 in memory may include nodes built using plaintext items as index values. Each node may include an index value, or plaintext item, as well as other data pertaining to the plaintext item, along with other unencrypted data.
- node 502, in memory may include two items, a first item may include a respective plaintext item, plaintext- 1, as an index value and other data related to the plaintext item, and unencrypted data-1, which may be other unencrypted information of the first item.
- a second item of node 502 may include another respective plaintext item, plaintext-2, as an index value and other data related to the plaintext item, and unencrypted data-2, which may be other unencrypted information of the second item.
- the index values may be the employee names. Searching on such an indexing structure may be performed by traversing the indexing structure until the desired name is found in a node of the indexing structure or until a determination can be made that the desired name is not included in the database when the desired name is not found.
- the left side of Fig. 5 illustrates indexing structure 502 as it may be when saved in storage within the database system.
- the saved version of indexing structure 502 may include encrypted versions of all plaintext references, for example, enc-text- 1 of the first item of node 502 and enc-text-2 of the second item of node 502. That is, all plaintext references, including the index values, may be saved in encrypted form while the organization of the indexing structure remains unchanged. In other words, an order of items in index nodes and the linkages between nodes may be arranged according to the plaintext index values although all plaintext references, including the index values, may be saved in encrypted form. Further, any other information related to a plaintext item that may be used by the index, such as, for example, plaintext statistics, may also be encrypted.
- Fig. 5 illustrates an exemplary node of an indexing structure having two items. In other embodiments, more or fewer items may be stored within a node of the indexing structure.
- Fig. 6 is a flowchart that illustrates an exemplary process for using an indexing structure, such as, for example, the exemplary indexing structure of Fig. 5, to search for non-deterministically encrypted data in a database in embodiments consistent with the subject matter of this disclosure.
- processing device 102 may receive a request for a desired data item that may be included in a database of processing device 102 (act 602).
- the request may be made directly by a requester such as, for example, a user or an application, via processing device 102 or via another processing device, such as processing device 104 via a network, such as network 106.
- the request may be a search request and may include a plaintext form of the desired data item.
- processing device 102 may access an indexing structure of the database in order to perform a search for data in the database that corresponds to the desired data item (act 604). Processing device 102 may then load at least a portion of the indexing structure into dynamic storage, such as memory 230 (act 606). Processing device 102 may then decrypt encrypted references in the loaded portion of the indexing structure (act 608) and may use the loaded portion of the indexing structure to find and access one or more non-deterministically encrypted data items in the database (act 610). [0051] In one embodiment, processing device 102 may decrypt the encrypted references of the indexing structure as an index page or portion of the indexing structure is loaded into memory 230.
- searching may then be performed using the corresponding plaintext references and other information from the indexing structure.
- the plaintext references from the indexing structure may be decrypted as the search is performed, such as, for example, when a plaintext reference from the index is needed.
- the exemplary method described above, with reference to Fig. 6, may be used to search for data pertaining to a particular data item, such as, for example, an equality search, may be used to search for data pertaining to a range of data values, such as, for example, a range search, or may be used to perform a search for information that is similar to a particular data item, such as, for example, a fuzzy search.
- processing device 102 may receive a search request or other request for finding database data related to a plaintext item (act 702).
- the request may be made directly by a requester such as, for example, a user or an application, via processing device 102 or via another processing device, such as processing device 104 via a network, such as network 106.
- Processing device 102 may calculate a HMAC over the plaintext item using a cryptographic key (act 704).
- Processing device 102 may then hash the calculated HMAC to produce a hash value (act 706).
- Processing device 102 may use the hash value as an index value to a hash bucket within a hashed-based indexing structure 710 to obtain information related to the requested plaintext data item based on an indexed entry of hash-based indexing structure 710 (act 708). Because the hash value is calculated from a HMAC based on a plaintext item, hash buckets of the hash-based index may be allocated for items corresponding to plaintext items of data according to a respective HMAC of the corresponding plaintext items of data.
- hash-based indexing structure 710 may include a link to an item in a database having encrypted information related to the requested plaintext data item. The encrypted information may include non-deterministically encrypted data.
- the exemplary hashed-based indexing structure method illustrated by Fig. 7 avoids possible leakage of information that may occur as a result of an arrangement of hash buckets according to an index hash function.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computer Security & Cryptography (AREA)
- Physics & Mathematics (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Bioethics (AREA)
- Databases & Information Systems (AREA)
- Computer Hardware Design (AREA)
- Data Mining & Analysis (AREA)
- Medical Informatics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Mathematical Physics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Storage Device Security (AREA)
Abstract
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2007800328508A CN101512525B (en) | 2006-09-06 | 2007-08-24 | Encrypted data search |
KR1020097004699A KR101403745B1 (en) | 2006-09-06 | 2007-08-24 | Encrypted data search |
EP07841329.1A EP2064638B1 (en) | 2006-09-06 | 2007-08-24 | Encrypted data search |
JP2009527488A JP4810611B2 (en) | 2006-09-06 | 2007-08-24 | Search for encrypted data |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/516,267 | 2006-09-06 | ||
US11/516,267 US7689547B2 (en) | 2006-09-06 | 2006-09-06 | Encrypted data search |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2008030717A1 true WO2008030717A1 (en) | 2008-03-13 |
Family
ID=39153188
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2007/076758 WO2008030717A1 (en) | 2006-09-06 | 2007-08-24 | Encrypted data search |
Country Status (7)
Country | Link |
---|---|
US (1) | US7689547B2 (en) |
EP (1) | EP2064638B1 (en) |
JP (1) | JP4810611B2 (en) |
KR (1) | KR101403745B1 (en) |
CN (1) | CN101512525B (en) |
TW (1) | TWI372345B (en) |
WO (1) | WO2008030717A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2010186163A (en) * | 2009-01-23 | 2010-08-26 | Nec (China) Co Ltd | Method and apparatus for k-anonymity update on encrypted inverted index table |
JP2010211786A (en) * | 2008-12-30 | 2010-09-24 | Nec (China) Co Ltd | Method and apparatus for ciphertext indexing and searching |
US10783270B2 (en) | 2018-08-30 | 2020-09-22 | Netskope, Inc. | Methods and systems for securing and retrieving sensitive data using indexable databases |
Families Citing this family (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8661263B2 (en) | 2006-09-29 | 2014-02-25 | Protegrity Corporation | Meta-complete data storage |
US7809142B2 (en) * | 2007-06-19 | 2010-10-05 | International Business Machines Corporation | Data scrambling and encryption of database tables |
US10262136B1 (en) * | 2008-08-04 | 2019-04-16 | Zscaler, Inc. | Cloud-based malware detection |
US8819451B2 (en) * | 2009-05-28 | 2014-08-26 | Microsoft Corporation | Techniques for representing keywords in an encrypted search index to prevent histogram-based attacks |
JP5411034B2 (en) * | 2010-03-19 | 2014-02-12 | 株式会社日立ソリューションズ | Database encryption system and method |
US8533489B2 (en) * | 2010-09-29 | 2013-09-10 | Microsoft Corporation | Searchable symmetric encryption with dynamic updating |
WO2012081450A1 (en) * | 2010-12-13 | 2012-06-21 | 日本電気株式会社 | Encoded database management system, client and server, natural joining method and program |
EP2738689A4 (en) | 2011-07-29 | 2015-04-29 | Nec Corp | System for generating index resistant against divulging of information, index generation device, and method therefor |
US8832427B2 (en) | 2012-03-30 | 2014-09-09 | Microsoft Corporation | Range-based queries for searchable symmetric encryption |
CN104704493B (en) * | 2012-08-15 | 2019-06-07 | 维萨国际服务协会 | Searchable encrypted data |
US8943331B2 (en) * | 2012-12-28 | 2015-01-27 | Alcatel Lucent | Privacy-preserving database system |
WO2014182419A1 (en) * | 2013-05-06 | 2014-11-13 | Thomson Reuters South Asia Private Limited | Offline searching of encrypted content |
US10122714B2 (en) | 2013-08-01 | 2018-11-06 | Bitglass, Inc. | Secure user credential access system |
US9552492B2 (en) * | 2013-08-01 | 2017-01-24 | Bitglass, Inc. | Secure application access system |
US9553867B2 (en) | 2013-08-01 | 2017-01-24 | Bitglass, Inc. | Secure application access system |
US9852306B2 (en) | 2013-08-05 | 2017-12-26 | International Business Machines Corporation | Conjunctive search in encrypted data |
US9646166B2 (en) | 2013-08-05 | 2017-05-09 | International Business Machines Corporation | Masking query data access pattern in encrypted data |
CN104462990B (en) * | 2013-09-13 | 2019-02-26 | 腾讯科技(深圳)有限公司 | Character string encipher-decipher method and device |
US20170262546A1 (en) * | 2014-07-30 | 2017-09-14 | Hewlett Packard Enterprise Development Lp | Key search token for encrypted data |
WO2016043700A1 (en) | 2014-09-15 | 2016-03-24 | Demandware, Inc. | Secure storage and access to sensitive data |
US10013440B1 (en) * | 2014-10-31 | 2018-07-03 | Amazon Technologies, Inc. | Incremental out-of-place updates for index structures |
CN104572827B (en) * | 2014-12-08 | 2017-12-15 | 北京工业大学 | It is a kind of based on across plaintext and the Hybrid Search system of ciphertext |
JP6441160B2 (en) | 2015-04-27 | 2018-12-19 | 株式会社東芝 | Concealment device, decryption device, concealment method and decryption method |
US9519798B2 (en) | 2015-05-07 | 2016-12-13 | ZeroDB, Inc. | Zero-knowledge databases |
KR101703828B1 (en) * | 2015-10-15 | 2017-02-08 | 한국전자통신연구원 | Method of generating index tag for encrypted data. method of searching encrypted data using index tag and database apparatus for the same |
WO2017193108A2 (en) | 2016-05-06 | 2017-11-09 | ZeroDB, Inc. | Encryption for distributed storage and processing |
SG11201811425TA (en) * | 2016-09-22 | 2019-01-30 | Visa Int Service Ass | Techniques for in-memory key range searches |
US10482279B2 (en) * | 2016-11-08 | 2019-11-19 | Microsoft Technology Licensing, Llc | Pattern-less private data detection on data sets |
US10360390B2 (en) * | 2016-12-14 | 2019-07-23 | Sap Se | Oblivious order-preserving encryption |
EP3388969B1 (en) | 2017-04-13 | 2019-10-16 | DSwiss AG | Search system |
EP3657475B1 (en) * | 2017-09-12 | 2021-08-25 | Mitsubishi Electric Corporation | Data processing apparatus, data processing method, and data processing program |
CN110858251B (en) * | 2018-08-22 | 2020-07-21 | 阿里巴巴集团控股有限公司 | Data query method and device |
US11003783B1 (en) * | 2018-09-21 | 2021-05-11 | Amazon Technologies, Inc. | Searchable encrypted data stores |
CA3126089C (en) | 2019-03-01 | 2023-06-20 | Cyborg Inc. | System and method for statistics-based pattern searching of compressed data and encrypted data |
EP4154147A1 (en) * | 2020-06-29 | 2023-03-29 | Huawei Technologies Co., Ltd. | Data storage server and client devices for securely storing data |
CN112182616B (en) * | 2020-09-29 | 2024-05-17 | 江苏大周基业智能科技有限公司 | Method and system for controlling security of cryptographic technique of core table data |
TWI835039B (en) * | 2021-06-16 | 2024-03-11 | 威聯通科技股份有限公司 | Index node allocation method, data processing device and computer-readable medium |
US20240195610A1 (en) * | 2022-12-09 | 2024-06-13 | Yuen Ping Lee | Systems and Methods for Programmable Corporate Policies and Management Intervention |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5692124A (en) * | 1996-08-30 | 1997-11-25 | Itt Industries, Inc. | Support of limited write downs through trustworthy predictions in multilevel security of computer network communications |
US6052686A (en) * | 1997-07-11 | 2000-04-18 | At&T Corporation | Database processing using schemas |
US6233685B1 (en) * | 1997-08-29 | 2001-05-15 | Sean William Smith | Establishing and employing the provable untampered state of a device |
US6601026B2 (en) * | 1999-09-17 | 2003-07-29 | Discern Communications, Inc. | Information retrieval by natural language querying |
US7065579B2 (en) * | 2001-01-22 | 2006-06-20 | Sun Microsystems, Inc. | System using peer discovery and peer membership protocols for accessing peer-to-peer platform resources on a network |
Family Cites Families (40)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4827508A (en) * | 1986-10-14 | 1989-05-02 | Personal Library Software, Inc. | Database usage metering and protection system and method |
CA2000006C (en) * | 1989-01-23 | 1994-07-12 | Walter W. Chang | Combinatorial signatures for data encoding and searching |
US5293576A (en) * | 1991-11-21 | 1994-03-08 | Motorola, Inc. | Command authentication process |
US5475826A (en) | 1993-11-19 | 1995-12-12 | Fischer; Addison M. | Method for protecting a volatile file using a single hash |
JP3339688B2 (en) * | 1993-12-01 | 2002-10-28 | アールピーケイ ニュージーランド リミテッド | Non-deterministic mixture generator stream encryption system |
US5495533A (en) | 1994-04-29 | 1996-02-27 | International Business Machines Corporation | Personal key archive |
US5990810A (en) * | 1995-02-17 | 1999-11-23 | Williams; Ross Neil | Method for partitioning a block of data into subblocks and for storing and communcating such subblocks |
US5742807A (en) * | 1995-05-31 | 1998-04-21 | Xerox Corporation | Indexing system using one-way hash for document service |
US5701469A (en) * | 1995-06-07 | 1997-12-23 | Microsoft Corporation | Method and system for generating accurate search results using a content-index |
JP3647940B2 (en) * | 1995-09-22 | 2005-05-18 | 富士通株式会社 | Data management device |
US5854916A (en) * | 1995-09-28 | 1998-12-29 | Symantec Corporation | State-based cache for antivirus software |
US5864852A (en) * | 1996-04-26 | 1999-01-26 | Netscape Communications Corporation | Proxy server caching mechanism that provides a file directory structure and a mapping mechanism within the file directory structure |
JP3022405B2 (en) * | 1997-06-03 | 2000-03-21 | 日本電気株式会社 | Image memory controller |
US6012057A (en) * | 1997-07-30 | 2000-01-04 | Quarterdeck Corporation | High speed data searching for information in a computer system |
JP3056704B2 (en) * | 1997-08-25 | 2000-06-26 | 三菱電機株式会社 | Data management device |
JPH11143780A (en) * | 1997-11-05 | 1999-05-28 | Hitachi Ltd | Method and device for managing secret information in database |
US6446052B1 (en) * | 1997-11-19 | 2002-09-03 | Rsa Security Inc. | Digital coin tracing using trustee tokens |
JP3849279B2 (en) * | 1998-01-23 | 2006-11-22 | 富士ゼロックス株式会社 | Index creation method and search method |
JP3457184B2 (en) * | 1998-06-25 | 2003-10-14 | シャープ株式会社 | Search device and medium storing control program therefor |
US7152165B1 (en) | 1999-07-16 | 2006-12-19 | Intertrust Technologies Corp. | Trusted storage systems and methods |
US6738766B2 (en) * | 2000-02-02 | 2004-05-18 | Doongo Technologies, Inc. | Apparatus and methods for providing personalized application search results for wireless devices based on user profiles |
US7412462B2 (en) * | 2000-02-18 | 2008-08-12 | Burnside Acquisition, Llc | Data repository and method for promoting network storage of data |
US7043641B1 (en) * | 2000-03-08 | 2006-05-09 | Igt | Encryption in a secure computerized gaming system |
US6968456B1 (en) | 2000-08-08 | 2005-11-22 | Novell, Inc. | Method and system for providing a tamper-proof storage of an audit trail in a database |
US7362868B2 (en) | 2000-10-20 | 2008-04-22 | Eruces, Inc. | Hidden link dynamic key manager for use in computer systems with database structure for storage of encrypted data and method for storage and retrieval of encrypted data |
US6928428B1 (en) * | 2000-11-27 | 2005-08-09 | Microsoft Corporation | Distributed confidential contextual querying |
TW561358B (en) * | 2001-01-11 | 2003-11-11 | Force Corp Z | File switch and switched file system |
US7360075B2 (en) * | 2001-02-12 | 2008-04-15 | Aventail Corporation, A Wholly Owned Subsidiary Of Sonicwall, Inc. | Method and apparatus for providing secure streaming data transmission facilities using unreliable protocols |
US7062490B2 (en) * | 2001-03-26 | 2006-06-13 | Microsoft Corporation | Serverless distributed file system |
GB2377514B (en) * | 2001-07-05 | 2005-04-27 | Hewlett Packard Co | Document encryption |
US7266699B2 (en) | 2001-08-30 | 2007-09-04 | Application Security, Inc. | Cryptographic infrastructure for encrypting a database |
US7269729B2 (en) | 2001-12-28 | 2007-09-11 | International Business Machines Corporation | Relational database management encryption system |
US20030159054A1 (en) * | 2002-02-19 | 2003-08-21 | Minebea Co. | Reconfigurable secure input device |
US7287033B2 (en) * | 2002-03-06 | 2007-10-23 | Ori Software Development, Ltd. | Efficient traversals over hierarchical data and indexing semistructured data |
JP4077329B2 (en) * | 2003-01-31 | 2008-04-16 | 株式会社東芝 | Transaction processing system, parallel control method, and program |
US20030177115A1 (en) * | 2003-02-21 | 2003-09-18 | Stern Yonatan P. | System and method for automatic preparation and searching of scanned documents |
US20050004924A1 (en) * | 2003-04-29 | 2005-01-06 | Adrian Baldwin | Control of access to databases |
US10339336B2 (en) | 2003-06-11 | 2019-07-02 | Oracle International Corporation | Method and apparatus for encrypting database columns |
US7743069B2 (en) | 2004-09-03 | 2010-06-22 | Sybase, Inc. | Database system providing SQL extensions for automated encryption and decryption of column data |
US7571490B2 (en) | 2004-11-01 | 2009-08-04 | Oracle International Corporation | Method and apparatus for protecting data from unauthorized modification |
-
2006
- 2006-09-06 US US11/516,267 patent/US7689547B2/en active Active
-
2007
- 2007-08-23 TW TW096131311A patent/TWI372345B/en active
- 2007-08-24 EP EP07841329.1A patent/EP2064638B1/en active Active
- 2007-08-24 WO PCT/US2007/076758 patent/WO2008030717A1/en active Application Filing
- 2007-08-24 KR KR1020097004699A patent/KR101403745B1/en active IP Right Grant
- 2007-08-24 JP JP2009527488A patent/JP4810611B2/en not_active Expired - Fee Related
- 2007-08-24 CN CN2007800328508A patent/CN101512525B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5692124A (en) * | 1996-08-30 | 1997-11-25 | Itt Industries, Inc. | Support of limited write downs through trustworthy predictions in multilevel security of computer network communications |
US6052686A (en) * | 1997-07-11 | 2000-04-18 | At&T Corporation | Database processing using schemas |
US6233685B1 (en) * | 1997-08-29 | 2001-05-15 | Sean William Smith | Establishing and employing the provable untampered state of a device |
US6601026B2 (en) * | 1999-09-17 | 2003-07-29 | Discern Communications, Inc. | Information retrieval by natural language querying |
US7065579B2 (en) * | 2001-01-22 | 2006-06-20 | Sun Microsystems, Inc. | System using peer discovery and peer membership protocols for accessing peer-to-peer platform resources on a network |
Non-Patent Citations (3)
Title |
---|
A . SCHNEIER: "Cryptographic protection of databases", APPLIED CRYPTOGRAPHY, vol. XX, XX, 1 January 1996 (1996-01-01), pages 73 - 74, XP002958299 |
DAMANI ERNESTO ET AL.: "Balancing confidentiality and efficiency in untrusted relational DBMSs", PROCEEDINGS OF THE 10TH ACM CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY; [ACM CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY], WASHINGTON D.C., USA, 1 January 2003 (2003-01-01), pages 93 - 102 |
See also references of EP2064638A4 |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2010211786A (en) * | 2008-12-30 | 2010-09-24 | Nec (China) Co Ltd | Method and apparatus for ciphertext indexing and searching |
JP2010186163A (en) * | 2009-01-23 | 2010-08-26 | Nec (China) Co Ltd | Method and apparatus for k-anonymity update on encrypted inverted index table |
US10783270B2 (en) | 2018-08-30 | 2020-09-22 | Netskope, Inc. | Methods and systems for securing and retrieving sensitive data using indexable databases |
US11620402B2 (en) | 2018-08-30 | 2023-04-04 | Netskope, Inc. | Methods and systems for securing and retrieving sensitive data using indexable databases |
Also Published As
Publication number | Publication date |
---|---|
TWI372345B (en) | 2012-09-11 |
US20080059414A1 (en) | 2008-03-06 |
EP2064638A1 (en) | 2009-06-03 |
KR20090048623A (en) | 2009-05-14 |
US7689547B2 (en) | 2010-03-30 |
EP2064638A4 (en) | 2016-05-04 |
CN101512525A (en) | 2009-08-19 |
KR101403745B1 (en) | 2014-06-03 |
EP2064638B1 (en) | 2019-03-27 |
CN101512525B (en) | 2012-10-03 |
TW200817949A (en) | 2008-04-16 |
JP2010503118A (en) | 2010-01-28 |
JP4810611B2 (en) | 2011-11-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7689547B2 (en) | Encrypted data search | |
US20080097954A1 (en) | Ranged lookups | |
Iyer et al. | A framework for efficient storage security in RDBMS | |
US7519835B2 (en) | Encrypted table indexes and searching encrypted tables | |
US8639947B2 (en) | Structure preserving database encryption method and system | |
US20160247150A1 (en) | Format-preserving cryptographic systems | |
US9934388B2 (en) | Method and system for database encryption | |
JP5997851B2 (en) | Privacy protection database system | |
US20090022321A1 (en) | Personal information management system, personal information management program, and personal information protecting method | |
Liu | Securing outsourced databases in the cloud | |
JP2006189925A (en) | Private information management system, private information management program, and private information protection method | |
US20200210595A1 (en) | CryptoJSON Indexed Search Systems and Methods | |
KR100910303B1 (en) | Data encryption and decryption apparatus using variable code table and method thereof | |
Gabel et al. | Secure database outsourcing to the cloud: Side-channels, counter-measures and trusted execution | |
Dowsley et al. | A database adapter for secure outsourcing | |
EP4137978A1 (en) | Enhanced data security through combination of encryption and vertical fragmentation of tabular data | |
Wang et al. | Secure dynamic SSE via access indistinguishable storage | |
CN116383849A (en) | Method and device for indexing secret state data | |
Iyer et al. | A Framework for Efficient Storage Security in | |
CN114647866A (en) | Data encryption and encrypted data query method and system | |
Jang et al. | An effective queries execution algorithm on the encrypted database | |
Waisenberg | SPDE-A Structure Preserving Database Encryption Scheme | |
Al Hanjouri | Ayman M. Al Derawi |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 200780032850.8 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 07841329 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1170/CHENP/2009 Country of ref document: IN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020097004699 Country of ref document: KR |
|
ENP | Entry into the national phase |
Ref document number: 2009527488 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2007841329 Country of ref document: EP |