CN102591947B - 用于数据去重复的快速且低ram占用的索引 - Google Patents
用于数据去重复的快速且低ram占用的索引 Download PDFInfo
- Publication number
- CN102591947B CN102591947B CN201110445284.1A CN201110445284A CN102591947B CN 102591947 B CN102591947 B CN 102591947B CN 201110445284 A CN201110445284 A CN 201110445284A CN 102591947 B CN102591947 B CN 102591947B
- Authority
- CN
- China
- Prior art keywords
- hashed value
- index
- compact
- hash
- entry
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
- G06F12/0802—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
- G06F12/0862—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches with prefetch
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
- G06F12/0802—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
- G06F12/0893—Caches characterised by their organisation or structure
- G06F12/0897—Caches characterised by their organisation or structure with two or more cache hierarchy levels
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/13—File access structures, e.g. distributed indices
- G06F16/137—Hash-based
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/17—Details of further file system functions
- G06F16/174—Redundancy elimination performed by the file system
- G06F16/1748—De-duplication implemented within the file system, e.g. based on file segments
- G06F16/1752—De-duplication implemented within the file system, e.g. based on file segments based on file chunks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
- G06F12/0802—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
- G06F12/0866—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches for peripheral storage systems, e.g. disk cache
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/10—Providing a specific technical effect
- G06F2212/1016—Performance improvement
- G06F2212/1024—Latency reduction
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/46—Caching storage objects of specific type in disk cache
- G06F2212/463—File
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2212/00—Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
- G06F2212/46—Caching storage objects of specific type in disk cache
- G06F2212/466—Metadata, control data
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (11)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/979,669 | 2010-12-28 | ||
US12/979,669 US8935487B2 (en) | 2010-05-05 | 2010-12-28 | Fast and low-RAM-footprint indexing for data deduplication |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102591947A CN102591947A (zh) | 2012-07-18 |
CN102591947B true CN102591947B (zh) | 2016-06-01 |
Family
ID=46383826
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201110445284.1A Active CN102591947B (zh) | 2010-12-28 | 2011-12-27 | 用于数据去重复的快速且低ram占用的索引 |
Country Status (6)
Country | Link |
---|---|
US (1) | US8935487B2 (zh) |
EP (1) | EP2659378B1 (zh) |
CN (1) | CN102591947B (zh) |
ES (1) | ES2626026T3 (zh) |
HK (1) | HK1173520A1 (zh) |
WO (1) | WO2012092213A2 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110324381A (zh) * | 2018-03-30 | 2019-10-11 | 北京忆芯科技有限公司 | 云计算与雾计算系统中的kv存储设备 |
Families Citing this family (141)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9849372B2 (en) | 2012-09-28 | 2017-12-26 | Sony Interactive Entertainment Inc. | Method and apparatus for improving efficiency without increasing latency in emulation of a legacy application title |
US9483484B1 (en) * | 2011-05-05 | 2016-11-01 | Veritas Technologies Llc | Techniques for deduplicated data access statistics management |
US8732401B2 (en) | 2011-07-07 | 2014-05-20 | Atlantis Computing, Inc. | Method and apparatus for cache replacement using a catalog |
US8914381B2 (en) * | 2012-02-16 | 2014-12-16 | Apple Inc. | Correlation filter |
US20130219116A1 (en) * | 2012-02-16 | 2013-08-22 | Wenguang Wang | Data migration for composite non-volatile storage device |
US9032183B2 (en) * | 2012-02-24 | 2015-05-12 | Simplivity Corp. | Method and apparatus for content derived data placement in memory |
US8682869B2 (en) * | 2012-04-05 | 2014-03-25 | International Business Machines Corporation | Increased in-line deduplication efficiency |
US8688652B2 (en) * | 2012-04-05 | 2014-04-01 | International Business Machines Corporation | Increased in-line deduplication efficiency |
US8762399B2 (en) * | 2012-05-20 | 2014-06-24 | International Business Machines Corporation | Hash collision reduction system |
US10135462B1 (en) | 2012-06-13 | 2018-11-20 | EMC IP Holding Company LLC | Deduplication using sub-chunk fingerprints |
US8972672B1 (en) | 2012-06-13 | 2015-03-03 | Emc Corporation | Method for cleaning a delta storage system |
US9116902B1 (en) | 2012-06-13 | 2015-08-25 | Emc Corporation | Preferential selection of candidates for delta compression |
US9026740B1 (en) * | 2012-06-13 | 2015-05-05 | Emc Corporation | Prefetch data needed in the near future for delta compression |
US9141301B1 (en) | 2012-06-13 | 2015-09-22 | Emc Corporation | Method for cleaning a delta storage system |
US9400610B1 (en) | 2012-06-13 | 2016-07-26 | Emc Corporation | Method for cleaning a delta storage system |
US8712978B1 (en) | 2012-06-13 | 2014-04-29 | Emc Corporation | Preferential selection of candidates for delta compression |
US8918390B1 (en) | 2012-06-13 | 2014-12-23 | Emc Corporation | Preferential selection of candidates for delta compression |
US8880476B2 (en) | 2012-06-28 | 2014-11-04 | International Business Machines Corporation | Low-overhead enhancement of reliability of journaled file system using solid state storage and de-duplication |
US9694276B2 (en) | 2012-06-29 | 2017-07-04 | Sony Interactive Entertainment Inc. | Pre-loading translated code in cloud based emulated applications |
US9248374B2 (en) | 2012-06-29 | 2016-02-02 | Sony Computer Entertainment Inc. | Replay and resumption of suspended game |
US9717989B2 (en) | 2012-06-29 | 2017-08-01 | Sony Interactive Entertainment Inc. | Adding triggers to cloud-based emulated games |
US9656163B2 (en) | 2012-06-29 | 2017-05-23 | Sony Interactive Entertainment Inc. | Haptic enhancements for emulated video game not originally designed with haptic capabilities |
US9925468B2 (en) | 2012-06-29 | 2018-03-27 | Sony Interactive Entertainment Inc. | Suspending state of cloud-based legacy applications |
US9707476B2 (en) | 2012-09-28 | 2017-07-18 | Sony Interactive Entertainment Inc. | Method for creating a mini-game |
US11013993B2 (en) | 2012-09-28 | 2021-05-25 | Sony Interactive Entertainment Inc. | Pre-loading translated code in cloud based emulated applications |
US20140092087A1 (en) | 2012-09-28 | 2014-04-03 | Takayuki Kazama | Adaptive load balancing in software emulation of gpu hardware |
US9779027B2 (en) * | 2012-10-18 | 2017-10-03 | Oracle International Corporation | Apparatus, system and method for managing a level-two cache of a storage appliance |
US9772949B2 (en) * | 2012-10-18 | 2017-09-26 | Oracle International Corporation | Apparatus, system and method for providing a persistent level-two cache |
US20140115246A1 (en) * | 2012-10-19 | 2014-04-24 | Oracle International Corporation | Apparatus, system and method for managing empty blocks in a cache |
CN102982122A (zh) * | 2012-11-13 | 2013-03-20 | 浪潮电子信息产业股份有限公司 | 一种适用于海量存储系统的重复数据删除方法 |
US9277010B2 (en) | 2012-12-21 | 2016-03-01 | Atlantis Computing, Inc. | Systems and apparatuses for aggregating nodes to form an aggregated virtual storage for a virtualized desktop environment |
US9069472B2 (en) | 2012-12-21 | 2015-06-30 | Atlantis Computing, Inc. | Method for dispersing and collating I/O's from virtual machines for parallelization of I/O access and redundancy of storing virtual machine data |
US9699231B2 (en) * | 2012-12-27 | 2017-07-04 | Akamai Technologies, Inc. | Stream-based data deduplication using directed cyclic graphs to facilitate on-the-wire compression |
US9420058B2 (en) | 2012-12-27 | 2016-08-16 | Akamai Technologies, Inc. | Stream-based data deduplication with peer node prediction |
US9223840B2 (en) | 2012-12-31 | 2015-12-29 | Futurewei Technologies, Inc. | Fast object fingerprints |
US9612955B2 (en) * | 2013-01-09 | 2017-04-04 | Wisconsin Alumni Research Foundation | High-performance indexing for data-intensive systems |
US9372865B2 (en) * | 2013-02-12 | 2016-06-21 | Atlantis Computing, Inc. | Deduplication metadata access in deduplication file system |
US9471590B2 (en) | 2013-02-12 | 2016-10-18 | Atlantis Computing, Inc. | Method and apparatus for replicating virtual machine images using deduplication metadata |
US9250946B2 (en) | 2013-02-12 | 2016-02-02 | Atlantis Computing, Inc. | Efficient provisioning of cloned virtual machine images using deduplication metadata |
US9189409B2 (en) * | 2013-02-19 | 2015-11-17 | Avago Technologies General Ip (Singapore) Pte. Ltd. | Reducing writes to solid state drive cache memories of storage controllers |
CN104052492B (zh) * | 2013-03-15 | 2019-05-14 | 索尼电脑娱乐公司 | 用于通过基于云的网络进行数据传输的状态信息的压缩 |
US9258012B2 (en) * | 2013-03-15 | 2016-02-09 | Sony Computer Entertainment Inc. | Compression of state information for data transfer over cloud-based networks |
US9471500B2 (en) * | 2013-04-12 | 2016-10-18 | Nec Corporation | Bucketized multi-index low-memory data structures |
EP2997496B1 (en) | 2013-05-16 | 2022-01-19 | Hewlett Packard Enterprise Development LP | Selecting a store for deduplicated data |
WO2014185918A1 (en) * | 2013-05-16 | 2014-11-20 | Hewlett-Packard Development Company, L.P. | Selecting a store for deduplicated data |
US9519591B2 (en) * | 2013-06-22 | 2016-12-13 | Microsoft Technology Licensing, Llc | Latch-free, log-structured storage for multiple access methods |
US20150019815A1 (en) * | 2013-07-15 | 2015-01-15 | International Business Machines Corporation | Utilizing global digests caching in data deduplication of workloads |
US10229132B2 (en) | 2013-07-15 | 2019-03-12 | International Business Machines Corporation | Optimizing digest based data matching in similarity based deduplication |
US10789213B2 (en) | 2013-07-15 | 2020-09-29 | International Business Machines Corporation | Calculation of digest segmentations for input data using similar data in a data deduplication system |
US9891857B2 (en) | 2013-07-15 | 2018-02-13 | International Business Machines Corporation | Utilizing global digests caching in similarity based data deduplication |
US10296597B2 (en) * | 2013-07-15 | 2019-05-21 | International Business Machines Corporation | Read ahead of digests in similarity based data deduplicaton |
US10296598B2 (en) * | 2013-07-15 | 2019-05-21 | International Business Machines Corporation | Digest based data matching in similarity based deduplication |
US9892048B2 (en) * | 2013-07-15 | 2018-02-13 | International Business Machines Corporation | Tuning global digests caching in a data deduplication system |
US9922042B2 (en) | 2013-07-15 | 2018-03-20 | International Business Machines Corporation | Producing alternative segmentations of data into blocks in a data deduplication system |
US9836474B2 (en) | 2013-07-15 | 2017-12-05 | International Business Machines Corporation | Data structures for digests matching in a data deduplication system |
US10339109B2 (en) * | 2013-07-15 | 2019-07-02 | International Business Machines Corporation | Optimizing hash table structure for digest matching in a data deduplication system |
US9892127B2 (en) * | 2013-07-15 | 2018-02-13 | International Business Machines Corporation | Global digests caching in a data deduplication system |
US9594766B2 (en) | 2013-07-15 | 2017-03-14 | International Business Machines Corporation | Reducing activation of similarity search in a data deduplication system |
CN104951244B (zh) * | 2014-03-31 | 2018-04-27 | 伊姆西公司 | 用于存取数据的方法和设备 |
US9361032B2 (en) | 2014-05-14 | 2016-06-07 | International Business Machines Corporation | Management of server cache storage space |
CN104156284A (zh) * | 2014-08-27 | 2014-11-19 | 小米科技有限责任公司 | 文件备份方法和装置 |
KR20160042224A (ko) | 2014-10-07 | 2016-04-19 | 에스케이하이닉스 주식회사 | 데이터 저장 장치 및 그것의 동작 방법 |
US9678827B2 (en) * | 2014-10-07 | 2017-06-13 | SK Hynix Inc. | Access counts for performing data inspection operations in data storage device |
US11943142B2 (en) | 2014-11-10 | 2024-03-26 | Marvell Asia Pte, LTD | Hybrid wildcard match table |
US11218410B2 (en) | 2014-11-10 | 2022-01-04 | Marvell Asia Pte, Ltd. | Hybrid wildcard match table |
US10116564B2 (en) * | 2014-11-10 | 2018-10-30 | Cavium, Inc. | Hybrid wildcard match table |
CN104699815A (zh) * | 2015-03-24 | 2015-06-10 | 北京嘀嘀无限科技发展有限公司 | 数据处理方法和系统 |
US9772824B2 (en) | 2015-03-25 | 2017-09-26 | International Business Machines Corporation | Program structure-based blocking |
US9916458B2 (en) * | 2015-03-31 | 2018-03-13 | EMC IP Holding Company LLC | Secure cloud-based storage of data shared across file system objects and clients |
US10191914B2 (en) | 2015-03-31 | 2019-01-29 | EMC IP Holding Company LLC | De-duplicating distributed file system using cloud-based object store |
US9824092B2 (en) * | 2015-06-16 | 2017-11-21 | Microsoft Technology Licensing, Llc | File storage system including tiers |
US10255287B2 (en) * | 2015-07-31 | 2019-04-09 | Hiveio Inc. | Method and apparatus for on-disk deduplication metadata for a deduplication file system |
US9665287B2 (en) | 2015-09-18 | 2017-05-30 | Alibaba Group Holding Limited | Data deduplication using a solid state drive controller |
JP6067819B1 (ja) * | 2015-10-21 | 2017-01-25 | 株式会社東芝 | 階層化ストレージシステム、ストレージコントローラ、並びに重複排除及びストレージ階層化のための方法 |
US9678977B1 (en) | 2015-11-25 | 2017-06-13 | International Business Machines Corporation | Similarity based deduplication of snapshots data |
US9646043B1 (en) | 2015-11-25 | 2017-05-09 | International Business Machines Corporation | Combining data matches from multiple sources in a deduplication storage system |
US10031937B2 (en) | 2015-11-25 | 2018-07-24 | International Business Machines Corporation | Similarity based data deduplication of initial snapshots of data sets |
US9703642B2 (en) | 2015-11-25 | 2017-07-11 | International Business Machines Corporation | Processing of tracked blocks in similarity based deduplication of snapshots data |
US9984123B2 (en) | 2015-11-25 | 2018-05-29 | International Business Machines Corporation | Reducing resource consumption of a similarity index in data deduplication |
US9703643B2 (en) | 2015-11-25 | 2017-07-11 | International Business Machines Corporation | Calculation of representative values for similarity units in deduplication of snapshots data |
US20170168944A1 (en) * | 2015-12-15 | 2017-06-15 | Facebook, Inc. | Block cache eviction |
US10185666B2 (en) | 2015-12-15 | 2019-01-22 | Facebook, Inc. | Item-wise simulation in a block cache where data eviction places data into comparable score in comparable section in the block cache |
CN105550345B (zh) * | 2015-12-25 | 2019-03-26 | 百度在线网络技术(北京)有限公司 | 文件操作方法和装置 |
US10839308B2 (en) * | 2015-12-28 | 2020-11-17 | International Business Machines Corporation | Categorizing log records at run-time |
US10222987B2 (en) * | 2016-02-11 | 2019-03-05 | Dell Products L.P. | Data deduplication with augmented cuckoo filters |
US10180992B2 (en) * | 2016-03-01 | 2019-01-15 | Microsoft Technology Licensing, Llc | Atomic updating of graph database index structures |
US10437521B2 (en) * | 2016-03-25 | 2019-10-08 | Netapp, Inc. | Consistent method of indexing file system information |
US9940060B1 (en) * | 2016-05-02 | 2018-04-10 | Pure Storage, Inc. | Memory use and eviction in a deduplication storage system |
US10628305B2 (en) * | 2016-05-13 | 2020-04-21 | International Business Machines Corporation | Determining a data layout in a log structured storage system |
US10788988B1 (en) * | 2016-05-24 | 2020-09-29 | Violin Systems Llc | Controlling block duplicates |
US10037164B1 (en) | 2016-06-29 | 2018-07-31 | EMC IP Holding Company LLC | Flash interface for processing datasets |
US10055351B1 (en) * | 2016-06-29 | 2018-08-21 | EMC IP Holding Company LLC | Low-overhead index for a flash cache |
US10146438B1 (en) | 2016-06-29 | 2018-12-04 | EMC IP Holding Company LLC | Additive library for data structures in a flash memory |
US10331561B1 (en) | 2016-06-29 | 2019-06-25 | Emc Corporation | Systems and methods for rebuilding a cache index |
US10261704B1 (en) | 2016-06-29 | 2019-04-16 | EMC IP Holding Company LLC | Linked lists in flash memory |
US10089025B1 (en) | 2016-06-29 | 2018-10-02 | EMC IP Holding Company LLC | Bloom filters in a flash memory |
SG11201811423SA (en) * | 2016-09-22 | 2019-01-30 | Visa Int Service Ass | Techniques for in-memory data searching |
US10754859B2 (en) | 2016-10-28 | 2020-08-25 | Microsoft Technology Licensing, Llc | Encoding edges in graph databases |
KR102306672B1 (ko) * | 2016-11-23 | 2021-09-29 | 삼성전자주식회사 | 데이터 중복 제거를 수행하는 스토리지 시스템, 스토리지 시스템 및 데이터 처리 시스템의 동작방법 |
US11644992B2 (en) * | 2016-11-23 | 2023-05-09 | Samsung Electronics Co., Ltd. | Storage system performing data deduplication, method of operating storage system, and method of operating data processing system |
US10209892B2 (en) | 2016-11-28 | 2019-02-19 | Hewlett Packard Enterprise Development Lp | Storage of format-aware filter format tracking states |
CN109716658B (zh) | 2016-12-15 | 2021-08-20 | 华为技术有限公司 | 一种基于相似性的重复数据删除方法和系统 |
CN109937412A (zh) * | 2016-12-27 | 2019-06-25 | 日彩电子科技(深圳)有限公司 | 应用于数据去重的数据路由方法 |
US10831370B1 (en) * | 2016-12-30 | 2020-11-10 | EMC IP Holding Company LLC | Deduplicated and compressed non-volatile memory cache |
US10372620B2 (en) * | 2016-12-30 | 2019-08-06 | Intel Corporation | Devices, systems, and methods having high data deduplication and low read latencies |
US10795859B1 (en) | 2017-04-13 | 2020-10-06 | EMC IP Holding Company LLC | Micro-service based deduplication |
US10795860B1 (en) | 2017-04-13 | 2020-10-06 | EMC IP Holding Company LLC | WAN optimized micro-service based deduplication |
US11010300B2 (en) * | 2017-05-04 | 2021-05-18 | Hewlett Packard Enterprise Development Lp | Optimized record lookups |
WO2019000355A1 (en) * | 2017-06-30 | 2019-01-03 | Intel Corporation | COMPRESSED KEY LOG STRUCTURE |
US10628492B2 (en) | 2017-07-20 | 2020-04-21 | Microsoft Technology Licensing, Llc | Distributed graph database writes |
US11461269B2 (en) | 2017-07-21 | 2022-10-04 | EMC IP Holding Company | Metadata separated container format |
US10860212B1 (en) | 2017-07-21 | 2020-12-08 | EMC IP Holding Company LLC | Method or an apparatus to move perfect de-duplicated unique data from a source to destination storage tier |
US10949088B1 (en) | 2017-07-21 | 2021-03-16 | EMC IP Holding Company LLC | Method or an apparatus for having perfect deduplication, adapted for saving space in a deduplication file system |
US10936543B1 (en) | 2017-07-21 | 2021-03-02 | EMC IP Holding Company LLC | Metadata protected sparse block set for SSD cache space management |
US11113153B2 (en) | 2017-07-27 | 2021-09-07 | EMC IP Holding Company LLC | Method and system for sharing pre-calculated fingerprints and data chunks amongst storage systems on a cloud local area network |
US20190034282A1 (en) * | 2017-07-28 | 2019-01-31 | EMC IP Holding Company LLC | Offline repopulation of cache |
US10929382B1 (en) | 2017-07-31 | 2021-02-23 | EMC IP Holding Company LLC | Method and system to verify integrity of a portion of replicated data |
US11093453B1 (en) | 2017-08-31 | 2021-08-17 | EMC IP Holding Company LLC | System and method for asynchronous cleaning of data objects on cloud partition in a file system with deduplication |
US10592149B1 (en) * | 2017-10-06 | 2020-03-17 | EMC IP Holding Company LLC | Dynamic de-duplication methodologies for efficient resource utilization on de-duplication system |
US11243703B2 (en) | 2018-04-27 | 2022-02-08 | Hewlett Packard Enterprise Development Lp | Expandable index with pages to store object records |
US10970254B2 (en) | 2018-05-02 | 2021-04-06 | International Business Machines Corporation | Utilization of tail portions of a fixed size block in a deduplication environment by deduplication chunk virtualization |
CN109189995B (zh) * | 2018-07-16 | 2021-09-21 | 哈尔滨理工大学 | 基于mpi的云存储中数据消冗方法 |
CN111124939A (zh) * | 2018-10-31 | 2020-05-08 | 深信服科技股份有限公司 | 一种基于全闪存阵列的数据压缩方法及系统 |
US11074225B2 (en) * | 2018-12-21 | 2021-07-27 | Vmware, Inc. | Synchronization of index copies in an LSM tree file system |
US11372823B2 (en) * | 2019-02-06 | 2022-06-28 | President And Fellows Of Harvard College | File management with log-structured merge bush |
US11093176B2 (en) * | 2019-04-26 | 2021-08-17 | EMC IP Holding Company LLC | FaaS-based global object compression |
US11567995B2 (en) | 2019-07-26 | 2023-01-31 | Microsoft Technology Licensing, Llc | Branch threading in graph databases |
US11126401B2 (en) | 2019-09-18 | 2021-09-21 | Bank Of America Corporation | Pluggable sorting for distributed databases |
US11016978B2 (en) | 2019-09-18 | 2021-05-25 | Bank Of America Corporation | Joiner for distributed databases |
US11429573B2 (en) * | 2019-10-16 | 2022-08-30 | Dell Products L.P. | Data deduplication system |
US11593327B2 (en) * | 2020-03-13 | 2023-02-28 | EMC IP Holding Company LLC | Segmented index for data deduplication |
CN111382162B (zh) * | 2020-04-02 | 2023-09-05 | 安徽睿极智能科技有限公司 | 一种基于ai数据的结构化存储介质及其介质的读写方法 |
CN112256650B (zh) * | 2020-10-20 | 2024-05-31 | 广州市百果园网络科技有限公司 | 存储空间管理方法、装置、设备及存储介质 |
CN113259166B (zh) * | 2021-05-27 | 2021-10-01 | 长扬科技(北京)有限公司 | 一种日志告警处理方法和装置 |
WO2022262990A1 (en) * | 2021-06-18 | 2022-12-22 | Huawei Technologies Co., Ltd. | Method and system for indexing data item in data storage system and data indexing module |
US12066902B2 (en) * | 2021-11-17 | 2024-08-20 | Coinbase Il Rd Ltd. | System and method for database recovery |
US20230409532A1 (en) * | 2022-06-21 | 2023-12-21 | Western Digital Technologies, Inc. | Deduplication for data transfers to portable storage devices |
US12039180B2 (en) | 2022-09-14 | 2024-07-16 | Hewlett Packard Enterprise Development Lp | Temporary sparse index for a deduplication storage system |
US11797508B1 (en) * | 2023-06-02 | 2023-10-24 | Black Cape Inc. | Systems and methods for geospatial correlation |
US12117984B1 (en) * | 2023-06-02 | 2024-10-15 | Black Cape Inc. | Systems and methods for event tracking |
CN116991479B (zh) * | 2023-09-28 | 2023-12-12 | 中国人民解放军国防科技大学 | 超长指令字缓存标签体的前瞻执行-旁路纠错方法及装置 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080005141A1 (en) * | 2006-06-29 | 2008-01-03 | Ling Zheng | System and method for retrieving and using block fingerprints for data deduplication |
US20100250858A1 (en) * | 2009-03-31 | 2010-09-30 | Symantec Corporation | Systems and Methods for Controlling Initialization of a Fingerprint Cache for Data Deduplication |
Family Cites Families (72)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5634125A (en) | 1993-09-02 | 1997-05-27 | International Business Machines Corporation | Selecting buckets for redistributing data between nodes in a parallel database in the quiescent mode |
JP4303803B2 (ja) | 1998-04-22 | 2009-07-29 | 株式会社東芝 | キャッシュフラッシュ装置 |
US6412080B1 (en) | 1999-02-23 | 2002-06-25 | Microsoft Corporation | Lightweight persistent storage system for flash memory devices |
US6453404B1 (en) | 1999-05-27 | 2002-09-17 | Microsoft Corporation | Distributed data cache with memory allocation model |
GB2354104A (en) | 1999-09-08 | 2001-03-14 | Sony Uk Ltd | An editing method and system |
US6976229B1 (en) | 1999-12-16 | 2005-12-13 | Ricoh Co., Ltd. | Method and apparatus for storytelling with digital photographs |
US6687815B1 (en) | 2000-02-01 | 2004-02-03 | Sun Microsystems, Inc. | Method and apparatus for storing non-volatile configuration information |
US7269608B2 (en) | 2001-05-30 | 2007-09-11 | Sun Microsystems, Inc. | Apparatus and methods for caching objects using main memory and persistent memory |
US6803925B2 (en) | 2001-09-06 | 2004-10-12 | Microsoft Corporation | Assembling verbal narration for digital display images |
US7076602B2 (en) | 2001-11-05 | 2006-07-11 | Hywire Ltd. | Multi-dimensional associative search engine having an external memory |
US6754800B2 (en) | 2001-11-14 | 2004-06-22 | Sun Microsystems, Inc. | Methods and apparatus for implementing host-based object storage schemes |
CA2475319A1 (en) | 2002-02-04 | 2003-08-14 | Cataphora, Inc. | A method and apparatus to visually present discussions for data mining purposes |
US7096213B2 (en) | 2002-04-08 | 2006-08-22 | Oracle International Corporation | Persistent key-value repository with a pluggable architecture to abstract physical storage |
US7137145B2 (en) | 2002-04-09 | 2006-11-14 | Cisco Technology, Inc. | System and method for detecting an infective element in a network environment |
GB2388242A (en) | 2002-04-30 | 2003-11-05 | Hewlett Packard Co | Associating audio data and image data |
US20040034869A1 (en) | 2002-07-12 | 2004-02-19 | Wallace Michael W. | Method and system for display and manipulation of thematic segmentation in the analysis and presentation of film and video |
US6928526B1 (en) | 2002-12-20 | 2005-08-09 | Datadomain, Inc. | Efficient data storage system |
GB2424535A (en) | 2003-04-30 | 2006-09-27 | Hewlett Packard Co | Editing an image and associating sound with it |
US7827182B1 (en) | 2004-06-02 | 2010-11-02 | Cisco Technology, Inc | Searching for a path to identify where to move entries among hash tables with storage for multiple entries per bucket during insert operations |
US20050281541A1 (en) | 2004-06-17 | 2005-12-22 | Logan Beth T | Image organization method and system |
EP1797510A2 (en) | 2004-10-06 | 2007-06-20 | Permabit, Inc. | A storage system for randomly named blocks of data |
US7941401B2 (en) | 2005-05-09 | 2011-05-10 | Gemstone Systems, Inc. | Distributed data management system |
US20070005874A1 (en) | 2005-07-01 | 2007-01-04 | Dan Dodge | File system storing transaction records in flash-like media |
US7739599B2 (en) | 2005-09-23 | 2010-06-15 | Microsoft Corporation | Automatic capturing and editing of a video |
US7797283B2 (en) | 2005-10-21 | 2010-09-14 | Isilon Systems, Inc. | Systems and methods for maintaining distributed data |
US20080007567A1 (en) | 2005-12-18 | 2008-01-10 | Paul Clatworthy | System and Method for Generating Advertising in 2D or 3D Frames and Scenes |
US7457934B2 (en) | 2006-03-22 | 2008-11-25 | Hitachi, Ltd. | Method and apparatus for reducing the amount of data in a storage system |
JP4939851B2 (ja) * | 2006-06-21 | 2012-05-30 | パナソニック株式会社 | 情報処理端末、セキュアデバイスおよび状態処理方法 |
US7640262B1 (en) | 2006-06-30 | 2009-12-29 | Emc Corporation | Positional allocation |
US20080010238A1 (en) | 2006-07-07 | 2008-01-10 | Microsoft Corporation | Index having short-term portion and long-term portion |
US8214517B2 (en) | 2006-12-01 | 2012-07-03 | Nec Laboratories America, Inc. | Methods and systems for quick and efficient data management and/or processing |
US7443321B1 (en) | 2007-02-13 | 2008-10-28 | Packeteer, Inc. | Compression of stream data using a hierarchically-indexed database |
US8234327B2 (en) | 2007-03-30 | 2012-07-31 | Netapp, Inc. | System and method for bandwidth optimization in a network storage environment |
US8315984B2 (en) | 2007-05-22 | 2012-11-20 | Netapp, Inc. | System and method for on-the-fly elimination of redundant data |
US7818329B2 (en) | 2007-06-07 | 2010-10-19 | International Business Machines Corporation | Method and apparatus for automatic multimedia narrative enrichment |
EP2015184A2 (en) * | 2007-07-06 | 2009-01-14 | Prostor Systems, Inc. | Commonality factoring for removable media |
US8046509B2 (en) | 2007-07-06 | 2011-10-25 | Prostor Systems, Inc. | Commonality factoring for removable media |
CN101350869B (zh) | 2007-07-19 | 2011-08-24 | 中国电信股份有限公司 | 基于索引和散列的电信计费去重方法及设备 |
JP5026213B2 (ja) | 2007-09-28 | 2012-09-12 | 株式会社日立製作所 | ストレージ装置及びデータ重複排除方法 |
US7962452B2 (en) | 2007-12-28 | 2011-06-14 | International Business Machines Corporation | Data deduplication by separating data from meta data |
US8447938B2 (en) | 2008-01-04 | 2013-05-21 | International Business Machines Corporation | Backing up a deduplicated filesystem to disjoint media |
US7962706B2 (en) | 2008-02-14 | 2011-06-14 | Quantum Corporation | Methods and systems for improving read performance in data de-duplication storage |
US7814074B2 (en) | 2008-03-14 | 2010-10-12 | International Business Machines Corporation | Method and system for assuring integrity of deduplicated data |
US20090238538A1 (en) | 2008-03-20 | 2009-09-24 | Fink Franklin E | System and method for automated compilation and editing of personalized videos including archived historical content and personal content |
JP2009251725A (ja) * | 2008-04-02 | 2009-10-29 | Hitachi Ltd | 記憶制御装置及び記憶制御装置を用いた重複データ検出方法。 |
US7567188B1 (en) | 2008-04-10 | 2009-07-28 | International Business Machines Corporation | Policy based tiered data deduplication strategy |
US9395929B2 (en) | 2008-04-25 | 2016-07-19 | Netapp, Inc. | Network storage server with integrated encryption, compression and deduplication capability |
US8515909B2 (en) | 2008-04-29 | 2013-08-20 | International Business Machines Corporation | Enhanced method and system for assuring integrity of deduplicated data |
US8620877B2 (en) * | 2008-04-30 | 2013-12-31 | International Business Machines Corporation | Tunable data fingerprinting for optimizing data deduplication |
US8645333B2 (en) | 2008-05-29 | 2014-02-04 | International Business Machines Corporation | Method and apparatus to minimize metadata in de-duplication |
US20090319547A1 (en) | 2008-06-19 | 2009-12-24 | Microsoft Corporation | Compression Using Hashes |
US9043726B2 (en) | 2008-07-03 | 2015-05-26 | Ebay Inc. | Position editing tool of collage multi-media |
US9639505B2 (en) | 2008-07-03 | 2017-05-02 | Ebay, Inc. | System and methods for multimedia “hot spot” enablement |
US8271564B2 (en) * | 2008-07-14 | 2012-09-18 | Symbol Technologies, Inc. | Lookup table arrangement and related management method for accommodating concurrent processors |
US8086799B2 (en) | 2008-08-12 | 2011-12-27 | Netapp, Inc. | Scalable deduplication of stored data |
US8074049B2 (en) | 2008-08-26 | 2011-12-06 | Nine Technology, Llc | Online backup system with global two staged deduplication without using an indexing database |
US10642794B2 (en) | 2008-09-11 | 2020-05-05 | Vmware, Inc. | Computer storage deduplication |
US20100088296A1 (en) | 2008-10-03 | 2010-04-08 | Netapp, Inc. | System and method for organizing data to facilitate data deduplication |
WO2010045262A1 (en) | 2008-10-14 | 2010-04-22 | Wanova Technologies, Ltd. | Storage-network de-duplication |
US20100199065A1 (en) * | 2009-02-04 | 2010-08-05 | Hitachi, Ltd. | Methods and apparatus for performing efficient data deduplication by metadata grouping |
US8860865B2 (en) | 2009-03-02 | 2014-10-14 | Burning Moon, Llc | Assisted video creation utilizing a camera |
US8380738B2 (en) | 2009-03-17 | 2013-02-19 | Nec Laboratories America, Inc. | System and methods for database distribution and querying over key-based scalable storage |
US8205065B2 (en) | 2009-03-30 | 2012-06-19 | Exar Corporation | System and method for data deduplication |
US8255365B2 (en) * | 2009-06-08 | 2012-08-28 | Symantec Corporation | Source classification for performing deduplication in a backup operation |
CN101706825B (zh) | 2009-12-10 | 2011-04-20 | 华中科技大学 | 一种基于文件内容类型的重复数据删除方法 |
US9401967B2 (en) | 2010-06-09 | 2016-07-26 | Brocade Communications Systems, Inc. | Inline wire speed deduplication system |
US8397028B2 (en) | 2010-06-15 | 2013-03-12 | Stephen SPACKMAN | Index entry eviction |
CN101916171A (zh) | 2010-07-16 | 2010-12-15 | 中国科学院计算技术研究所 | 一种并发层次式的重复数据消除方法和系统 |
US8397080B2 (en) | 2010-07-29 | 2013-03-12 | Industrial Technology Research Institute | Scalable segment-based data de-duplication system and method for incremental backups |
US9104326B2 (en) | 2010-11-15 | 2015-08-11 | Emc Corporation | Scalable block data storage using content addressing |
US8572053B2 (en) | 2010-12-09 | 2013-10-29 | Jeffrey Vincent TOFANO | De-duplication indexing |
US9110936B2 (en) | 2010-12-28 | 2015-08-18 | Microsoft Technology Licensing, Llc | Using index partitioning and reconciliation for data deduplication |
-
2010
- 2010-12-28 US US12/979,669 patent/US8935487B2/en active Active
-
2011
- 2011-12-23 ES ES11854263.8T patent/ES2626026T3/es active Active
- 2011-12-23 WO PCT/US2011/067293 patent/WO2012092213A2/en active Application Filing
- 2011-12-23 EP EP11854263.8A patent/EP2659378B1/en active Active
- 2011-12-27 CN CN201110445284.1A patent/CN102591947B/zh active Active
-
2013
- 2013-01-09 HK HK13100335.9A patent/HK1173520A1/zh unknown
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080005141A1 (en) * | 2006-06-29 | 2008-01-03 | Ling Zheng | System and method for retrieving and using block fingerprints for data deduplication |
US20100250858A1 (en) * | 2009-03-31 | 2010-09-30 | Symantec Corporation | Systems and Methods for Controlling Initialization of a Fingerprint Cache for Data Deduplication |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110324381A (zh) * | 2018-03-30 | 2019-10-11 | 北京忆芯科技有限公司 | 云计算与雾计算系统中的kv存储设备 |
CN110324381B (zh) * | 2018-03-30 | 2021-08-03 | 北京忆芯科技有限公司 | 云计算与雾计算系统中的kv存储设备 |
Also Published As
Publication number | Publication date |
---|---|
US8935487B2 (en) | 2015-01-13 |
EP2659378B1 (en) | 2017-03-08 |
HK1173520A1 (zh) | 2013-05-16 |
EP2659378A4 (en) | 2015-01-21 |
EP2659378A2 (en) | 2013-11-06 |
US20110276781A1 (en) | 2011-11-10 |
CN102591947A (zh) | 2012-07-18 |
ES2626026T3 (es) | 2017-07-21 |
WO2012092213A2 (en) | 2012-07-05 |
WO2012092213A3 (en) | 2012-10-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102591947B (zh) | 用于数据去重复的快速且低ram占用的索引 | |
US9053032B2 (en) | Fast and low-RAM-footprint indexing for data deduplication | |
US11093466B2 (en) | Incremental out-of-place updates for index structures | |
CN102436420B (zh) | 使用辅助存储器的低ram空间、高吞吐量的持久键值存储 | |
JP6198210B2 (ja) | コンピュータ実装された動的シャーディング方法 | |
CN102880663B (zh) | 部分去重复的文件的优化 | |
US20170123676A1 (en) | Reference Block Aggregating into a Reference Set for Deduplication in Memory Management | |
US8412677B2 (en) | Systems and methods for byte-level or quasi byte-level single instancing | |
US8214388B2 (en) | System and method for adding a storage server in a distributed column chunk data store | |
US20070061542A1 (en) | System for a distributed column chunk data store | |
US7457935B2 (en) | Method for a distributed column chunk data store | |
US11151030B1 (en) | Method for prediction of the duration of garbage collection for backup storage systems | |
CN103098035A (zh) | 存储系统 | |
US20200334292A1 (en) | Key value append | |
WO2008157081A2 (en) | Distributed data storage using erasure resilient coding | |
CN102591946A (zh) | 使用索引划分和协调来进行数据去重复 | |
CN105117355A (zh) | 存储器、存储器系统及其数据处理方法 | |
US20170123678A1 (en) | Garbage Collection for Reference Sets in Flash Storage Systems | |
US20170123689A1 (en) | Pipelined Reference Set Construction and Use in Memory Management | |
US11210281B2 (en) | Technique for log records management in database management system | |
US20170123677A1 (en) | Integration of Reference Sets with Segment Flash Management | |
KR20210113297A (ko) | 컴퓨터 메모리 내의 복제 및 밸류 중복성을 제거하기 위한 시스템, 방법, 및 장치 | |
US12124420B2 (en) | Systems, methods and devices for eliminating duplicates and value redundancy in computer memories | |
CN118519964A (zh) | 数据处理方法、装置、计算机程序产品、设备及存储介质 | |
CN114691681A (zh) | 数据处理方法、装置、电子设备及可读存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1173520 Country of ref document: HK |
|
ASS | Succession or assignment of patent right |
Owner name: MICROSOFT TECHNOLOGY LICENSING LLC Free format text: FORMER OWNER: MICROSOFT CORP. Effective date: 20150727 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20150727 Address after: Washington State Applicant after: Micro soft technique license Co., Ltd Address before: Washington State Applicant before: Microsoft Corp. |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1173520 Country of ref document: HK |