HK1165573A1 - Methods and apparatus for content-aware data partitioning and data de- duplication - Google Patents
Methods and apparatus for content-aware data partitioning and data de- duplicationInfo
- Publication number
- HK1165573A1 HK1165573A1 HK12106045.8A HK12106045A HK1165573A1 HK 1165573 A1 HK1165573 A1 HK 1165573A1 HK 12106045 A HK12106045 A HK 12106045A HK 1165573 A1 HK1165573 A1 HK 1165573A1
- Authority
- HK
- Hong Kong
- Prior art keywords
- data
- duplication
- methods
- content
- aware
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0628—Interfaces specially adapted for storage systems making use of a particular technique
- G06F3/0638—Organizing or formatting or addressing of data
- G06F3/064—Management of blocks
- G06F3/0641—De-duplication techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/17—Details of further file system functions
- G06F16/174—Redundancy elimination performed by the file system
- G06F16/1748—De-duplication implemented within the file system, e.g. based on file segments
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0602—Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
- G06F3/0608—Saving storage space on storage systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
- G06F3/0601—Interfaces specially adapted for storage systems
- G06F3/0668—Interfaces specially adapted for storage systems adopting a particular infrastructure
- G06F3/067—Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS]
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/964—Database arrangement
- Y10S707/966—Distributed
- Y10S707/967—Peer-to-peer
- Y10S707/968—Partitioning
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/964—Database arrangement
- Y10S707/966—Distributed
- Y10S707/967—Peer-to-peer
- Y10S707/968—Partitioning
- Y10S707/969—Horizontal partitioning
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/964—Database arrangement
- Y10S707/966—Distributed
- Y10S707/967—Peer-to-peer
- Y10S707/968—Partitioning
- Y10S707/97—Vertical partitioning
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/964—Database arrangement
- Y10S707/966—Distributed
- Y10S707/971—Federated
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/964—Database arrangement
- Y10S707/966—Distributed
- Y10S707/971—Federated
- Y10S707/972—Partitioning
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/964—Database arrangement
- Y10S707/966—Distributed
- Y10S707/971—Federated
- Y10S707/972—Partitioning
- Y10S707/973—Horizontal partitioning
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/964—Database arrangement
- Y10S707/966—Distributed
- Y10S707/971—Federated
- Y10S707/972—Partitioning
- Y10S707/974—Vertical partitioning
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Document Processing Apparatus (AREA)
- Storage Device Security (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13882708P | 2008-12-18 | 2008-12-18 | |
PCT/US2009/068687 WO2010080591A2 (fr) | 2008-12-18 | 2009-12-18 | Procédés et dispositif de partitionnement de données sensible au contenu et de déduplication de données |
Publications (1)
Publication Number | Publication Date |
---|---|
HK1165573A1 true HK1165573A1 (en) | 2012-10-05 |
Family
ID=42267562
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
HK12106045.8A HK1165573A1 (en) | 2008-12-18 | 2012-06-20 | Methods and apparatus for content-aware data partitioning and data de- duplication |
Country Status (8)
Country | Link |
---|---|
US (2) | US8589455B2 (fr) |
EP (1) | EP2361417B1 (fr) |
JP (1) | JP5468620B2 (fr) |
CN (1) | CN102301377B (fr) |
AU (1) | AU2009335697A1 (fr) |
CA (1) | CA2747661A1 (fr) |
HK (1) | HK1165573A1 (fr) |
WO (1) | WO2010080591A2 (fr) |
Families Citing this family (66)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004070568A2 (fr) | 2003-01-31 | 2004-08-19 | Viair, Inc. | Recuperation asynchrone en temps reel de donnees |
US8140637B2 (en) | 2007-10-25 | 2012-03-20 | Hewlett-Packard Development Company, L.P. | Communicating chunks between devices |
DE112007003693B4 (de) | 2007-10-25 | 2014-05-15 | Hewlett-Packard Development Co., L.P. | Datenverarbeitungsvorrichtung und Verfahren zur Datenverarbeitung |
US9372941B2 (en) | 2007-10-25 | 2016-06-21 | Hewlett Packard Enterprise Development Lp | Data processing apparatus and method of processing data |
US8782368B2 (en) * | 2007-10-25 | 2014-07-15 | Hewlett-Packard Development Company, L.P. | Storing chunks in containers |
DE112008003826B4 (de) | 2008-04-25 | 2015-08-20 | Hewlett-Packard Development Company, L.P. | Datenverarbeitungsvorrichtung und Verfahren zur Datenverarbeitung |
WO2010080591A2 (fr) * | 2008-12-18 | 2010-07-15 | Sumooh Inc. | Procédés et dispositif de partitionnement de données sensible au contenu et de déduplication de données |
GB2472072B (en) * | 2009-07-24 | 2013-10-16 | Hewlett Packard Development Co | Deduplication of encoded data |
WO2011013125A1 (fr) | 2009-07-27 | 2011-02-03 | Storwize Ltd. | Procede et systeme de transformation d'objets de donnees logiques a des fins de stockage |
WO2011033582A1 (fr) * | 2009-09-18 | 2011-03-24 | Hitachi, Ltd. | Système de stockage pour élimination de données dupliquées |
US8510275B2 (en) | 2009-09-21 | 2013-08-13 | Dell Products L.P. | File aware block level deduplication |
CN102934115B (zh) * | 2010-03-12 | 2016-07-06 | 科派恩股份有限公司 | 管理数据的方法、客户端设备和系统 |
WO2011116087A2 (fr) | 2010-03-16 | 2011-09-22 | Copiun, Inc. | Déduplication de données distribuée et hautement évolutive |
US20120036366A1 (en) * | 2010-08-09 | 2012-02-09 | Microsoft Corporation | Secure and verifiable data handling |
CN103229161B (zh) | 2010-08-24 | 2016-01-20 | 科派恩股份有限公司 | 连续接入网关和去重数据缓存服务器 |
US20130179413A1 (en) * | 2010-09-21 | 2013-07-11 | Georgia Tech Research Corporation | Compressed Distributed Storage Systems And Methods For Providing Same |
WO2012056491A1 (fr) | 2010-10-26 | 2012-05-03 | Hitachi, Ltd. | Appareil de stockage et procédé de contrôle des données |
US8862876B2 (en) | 2010-11-09 | 2014-10-14 | International Business Machines Corporation | Method and system for deleting data |
US8438139B2 (en) | 2010-12-01 | 2013-05-07 | International Business Machines Corporation | Dynamic rewrite of files within deduplication system |
US8380681B2 (en) * | 2010-12-16 | 2013-02-19 | Microsoft Corporation | Extensible pipeline for data deduplication |
US9280550B1 (en) * | 2010-12-31 | 2016-03-08 | Emc Corporation | Efficient storage tiering |
US8886901B1 (en) | 2010-12-31 | 2014-11-11 | Emc Corporation | Policy based storage tiering |
DE102011011283A1 (de) * | 2011-02-15 | 2012-08-16 | Christmann Informationstechnik + Medien Gmbh & Co. Kg | Verfahren zur Deduplizierung von auf einem Speichermedium gespeicherten Daten und Dateiserver dafür |
JP5660617B2 (ja) * | 2011-03-29 | 2015-01-28 | 日本電気株式会社 | ストレージ装置 |
US8904128B2 (en) | 2011-06-08 | 2014-12-02 | Hewlett-Packard Development Company, L.P. | Processing a request to restore deduplicated data |
US9069477B1 (en) * | 2011-06-16 | 2015-06-30 | Amazon Technologies, Inc. | Reuse of dynamically allocated memory |
US8918375B2 (en) | 2011-08-31 | 2014-12-23 | Microsoft Corporation | Content aware chunking for achieving an improved chunk size distribution |
US8990171B2 (en) | 2011-09-01 | 2015-03-24 | Microsoft Corporation | Optimization of a partially deduplicated file |
US8700634B2 (en) | 2011-12-29 | 2014-04-15 | Druva Inc. | Efficient deduplicated data storage with tiered indexing |
US8996467B2 (en) | 2011-12-29 | 2015-03-31 | Druva Inc. | Distributed scalable deduplicated data backup system |
US9990347B2 (en) | 2012-01-23 | 2018-06-05 | Microsoft Technology Licensing, Llc | Borderless table detection engine |
CN104067293B (zh) | 2012-01-23 | 2017-07-25 | 微软技术许可有限责任公司 | 矢量图分类引擎 |
EP2807602A1 (fr) * | 2012-01-23 | 2014-12-03 | Microsoft Corporation | Moteur de reconnaissance de motifs |
US9128616B2 (en) * | 2012-04-13 | 2015-09-08 | Hitachi, Ltd. | Storage device to backup content based on a deduplication system |
US10135462B1 (en) | 2012-06-13 | 2018-11-20 | EMC IP Holding Company LLC | Deduplication using sub-chunk fingerprints |
US8918390B1 (en) | 2012-06-13 | 2014-12-23 | Emc Corporation | Preferential selection of candidates for delta compression |
US9026740B1 (en) | 2012-06-13 | 2015-05-05 | Emc Corporation | Prefetch data needed in the near future for delta compression |
US9141301B1 (en) | 2012-06-13 | 2015-09-22 | Emc Corporation | Method for cleaning a delta storage system |
US8972672B1 (en) | 2012-06-13 | 2015-03-03 | Emc Corporation | Method for cleaning a delta storage system |
US8712978B1 (en) | 2012-06-13 | 2014-04-29 | Emc Corporation | Preferential selection of candidates for delta compression |
US9400610B1 (en) | 2012-06-13 | 2016-07-26 | Emc Corporation | Method for cleaning a delta storage system |
US9116902B1 (en) | 2012-06-13 | 2015-08-25 | Emc Corporation | Preferential selection of candidates for delta compression |
US9262429B2 (en) | 2012-08-13 | 2016-02-16 | Microsoft Technology Licensing, Llc | De-duplicating attachments on message delivery and automated repair of attachments |
CN102880671A (zh) * | 2012-09-07 | 2013-01-16 | 浪潮电子信息产业股份有限公司 | 一种面向分布式文件系统的主动重复数据删除方法 |
US9626373B2 (en) * | 2012-10-01 | 2017-04-18 | Western Digital Technologies, Inc. | Optimizing data block size for deduplication |
US9953008B2 (en) | 2013-01-18 | 2018-04-24 | Microsoft Technology Licensing, Llc | Grouping fixed format document elements to preserve graphical data semantics after reflow by manipulating a bounding box vertically and horizontally |
GB2513341A (en) | 2013-04-23 | 2014-10-29 | Ibm | Method and system for data de-duplication |
CN104123309B (zh) | 2013-04-28 | 2017-08-25 | 国际商业机器公司 | 用于数据管理的方法和系统 |
CN105637493A (zh) * | 2013-07-29 | 2016-06-01 | 慧与发展有限责任合伙企业 | 频繁使用的去重复对象的完整性 |
BR112015023973B1 (pt) | 2013-08-19 | 2021-12-14 | Huawei Technologies Co., Ltd | Método e aparelho de processamento de objeto de dados |
US10545918B2 (en) | 2013-11-22 | 2020-01-28 | Orbis Technologies, Inc. | Systems and computer implemented methods for semantic data compression |
WO2015183302A1 (fr) * | 2014-05-30 | 2015-12-03 | Hitachi, Ltd. | Procédé et appareil de système de stockage avec déduplication de données |
US10120875B1 (en) * | 2014-12-02 | 2018-11-06 | EMC IP Holding Company LLC | Method and system for detecting boundaries of data blocks for deduplication |
US10177907B2 (en) * | 2015-07-20 | 2019-01-08 | Sony Corporation | Distributed object routing |
CN105117235A (zh) * | 2015-09-18 | 2015-12-02 | 四川效率源信息安全技术股份有限公司 | 一种重组Office文件的方法 |
CN105306570B (zh) * | 2015-10-27 | 2018-07-20 | 创新科软件技术(深圳)有限公司 | 一种集群数据的存储方法 |
US10078451B1 (en) | 2016-01-22 | 2018-09-18 | Red Hat, Inc. | Deduplicating data based on boundary identification |
US9575681B1 (en) | 2016-04-29 | 2017-02-21 | International Business Machines Corporation | Data deduplication with reduced hash computations |
US10209892B2 (en) | 2016-11-28 | 2019-02-19 | Hewlett Packard Enterprise Development Lp | Storage of format-aware filter format tracking states |
US10528342B2 (en) * | 2017-10-16 | 2020-01-07 | Western Digital Technologies, Inc. | Function tracking for source code files |
JP6930506B2 (ja) * | 2018-08-08 | 2021-09-01 | 株式会社Jvcケンウッド | データ記録送信装置、データ記録送信方法、及びデータ記録送信プログラム |
US10922281B2 (en) | 2018-10-25 | 2021-02-16 | EMC IP Holding Company LLC | Application aware deduplication |
JP7295422B2 (ja) * | 2019-09-10 | 2023-06-21 | 富士通株式会社 | 情報処理装置および情報処理プログラム |
CN111222314B (zh) * | 2020-01-03 | 2021-12-21 | 北大方正集团有限公司 | 版式文档的比对方法、装置、设备及存储介质 |
US11797220B2 (en) | 2021-08-20 | 2023-10-24 | Cohesity, Inc. | Reducing memory usage in storing metadata |
US11947497B2 (en) | 2021-08-24 | 2024-04-02 | Cohesity, Inc. | Partial in-line deduplication and partial post-processing deduplication of data chunks |
Family Cites Families (74)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2166420C (fr) * | 1993-07-01 | 2006-03-28 | James R. Woodhill | Dispositif et methode de gestion de memoires reparties dans les systemes informatiques en reseau |
WO1996025801A1 (fr) * | 1995-02-17 | 1996-08-22 | Trustus Pty. Ltd. | Procede de decoupage d'un bloc de donnees en sous-blocs et de stockage et de communication de tels sous-blocs |
US5590810A (en) * | 1995-10-19 | 1997-01-07 | Wehbi; Ali D. | Order of participation control device |
US6278992B1 (en) * | 1997-03-19 | 2001-08-21 | John Andrew Curtis | Search engine using indexing method for storing and retrieving data |
US5873104A (en) * | 1997-06-26 | 1999-02-16 | Sun Microsystems, Inc. | Bounded-pause time garbage collection system and method including write barrier associated with source and target instances of a partially relocated object |
US6065046A (en) * | 1997-07-29 | 2000-05-16 | Catharon Productions, Inc. | Computerized system and associated method of optimally controlled storage and transfer of computer programs on a computer network |
US6487556B1 (en) * | 1998-12-18 | 2002-11-26 | International Business Machines Corporation | Method and system for providing an associative datastore within a data processing system |
US6377953B1 (en) * | 1998-12-30 | 2002-04-23 | Oracle Corporation | Database having an integrated transformation engine using pickling and unpickling of data |
US6526493B1 (en) * | 1999-03-30 | 2003-02-25 | Adaptec, Inc. | Method and apparatus for partitioning and formatting a storage media without rebooting by creating a logical device control block (DCB) on-the-fly |
JP2000293413A (ja) * | 1999-04-09 | 2000-10-20 | Canon Inc | データ保存装置および記憶媒体 |
JP2000315375A (ja) * | 1999-04-28 | 2000-11-14 | Matsushita Electric Ind Co Ltd | ファイル入出力システムおよびプログラム記録媒体 |
US6959291B1 (en) * | 1999-05-19 | 2005-10-25 | International Business Machines Corporation | Management of a concurrent use license in a logically-partitioned computer |
US7590644B2 (en) * | 1999-12-21 | 2009-09-15 | International Business Machine Corporation | Method and apparatus of streaming data transformation using code generator and translator |
US6704730B2 (en) | 2000-02-18 | 2004-03-09 | Avamar Technologies, Inc. | Hash file system and method for use in a commonality factoring system |
US6662193B1 (en) * | 2000-06-02 | 2003-12-09 | Cg4 Solutions, Inc. | Methods and systems for manipulating a database through portable data entry devices |
US6810398B2 (en) * | 2000-11-06 | 2004-10-26 | Avamar Technologies, Inc. | System and method for unorchestrated determination of data sequences using sticky byte factoring to determine breakpoints in digital sequences |
JP2002319230A (ja) * | 2001-01-25 | 2002-10-31 | Sony Computer Entertainment Inc | 記録媒体、情報処理装置、コンテンツ配信サーバ、方法、プログラム、その記録媒体 |
US6742081B2 (en) * | 2001-04-30 | 2004-05-25 | Sun Microsystems, Inc. | Data storage array employing block checksums and dynamic striping |
US6934835B2 (en) * | 2002-01-09 | 2005-08-23 | International Business Machines Corporation | Building block removal from partitions |
US7051180B2 (en) * | 2002-01-09 | 2006-05-23 | International Business Machines Corporation | Masterless building block binding to partitions using identifiers and indicators |
US6941436B2 (en) * | 2002-05-09 | 2005-09-06 | International Business Machines Corporation | Method and apparatus for managing memory blocks in a logical partitioned data processing system |
US6976146B1 (en) * | 2002-05-21 | 2005-12-13 | Network Appliance, Inc. | System and method for emulating block appended checksums on storage devices by sector stealing |
US6871200B2 (en) * | 2002-07-11 | 2005-03-22 | Forensic Eye Ltd. | Registration and monitoring system |
KR100462886B1 (ko) * | 2002-10-15 | 2004-12-17 | 삼성전자주식회사 | 부하 분담 구조와 프라이머리/백업 구조가 혼합된 시스템 |
US8176186B2 (en) | 2002-10-30 | 2012-05-08 | Riverbed Technology, Inc. | Transaction accelerator for client-server communications systems |
US7120666B2 (en) | 2002-10-30 | 2006-10-10 | Riverbed Technology, Inc. | Transaction accelerator for client-server communication systems |
US6667700B1 (en) | 2002-10-30 | 2003-12-23 | Nbt Technology, Inc. | Content-based segmentation scheme for data compression in storage and transmission including hierarchical segment representation |
US7065619B1 (en) | 2002-12-20 | 2006-06-20 | Data Domain, Inc. | Efficient data storage system |
US6928526B1 (en) | 2002-12-20 | 2005-08-09 | Datadomain, Inc. | Efficient data storage system |
WO2004061590A2 (fr) | 2003-01-02 | 2004-07-22 | Cricket Technologies Llc | Dispositif de filtrage d'archives electroniques et d'etablissement de profils, systeme, procede, et progiciel stocke par des moyens electroniques |
KR20040091392A (ko) | 2003-04-21 | 2004-10-28 | 주식회사 에트피아텍 | 웹을 이용한 원격 백업관리 시스템 및 그 시스템을 운용한백업관리 방법 |
US7143251B1 (en) * | 2003-06-30 | 2006-11-28 | Data Domain, Inc. | Data storage using identifiers |
CN1567303A (zh) * | 2003-07-03 | 2005-01-19 | 富士通株式会社 | 结构文档信息块的自动分割方法和装置 |
US7219102B2 (en) * | 2003-12-22 | 2007-05-15 | International Business Machines Corporation | Method, computer program product, and system converting relational data into hierarchical data structure based upon tagging trees |
US7412444B2 (en) * | 2004-02-11 | 2008-08-12 | Idx Systems Corporation | Efficient indexing of hierarchical relational database records |
US20060047855A1 (en) * | 2004-05-13 | 2006-03-02 | Microsoft Corporation | Efficient chunking algorithm |
US7487138B2 (en) * | 2004-08-25 | 2009-02-03 | Symantec Operating Corporation | System and method for chunk-based indexing of file system content |
US20060069733A1 (en) * | 2004-09-30 | 2006-03-30 | Microsoft Corporation | Detection and removal of information in files |
TWI274297B (en) * | 2004-11-19 | 2007-02-21 | Aiptek Int Inc | Method for deciding partition types of macro block |
US20060212439A1 (en) * | 2005-03-21 | 2006-09-21 | Microsoft Corporation | System and method of efficient data backup in a networking environment |
JP4682284B2 (ja) * | 2005-03-25 | 2011-05-11 | 成典 田中 | 文書差分検出装置 |
US8984636B2 (en) | 2005-07-29 | 2015-03-17 | Bit9, Inc. | Content extractor and analysis system |
US7447865B2 (en) * | 2005-09-13 | 2008-11-04 | Yahoo ! Inc. | System and method for compression in a distributed column chunk data store |
US8600948B2 (en) * | 2005-09-15 | 2013-12-03 | Emc Corporation | Avoiding duplicative storage of managed content |
US7624335B1 (en) * | 2005-10-13 | 2009-11-24 | Data Domain, Inc. | Verifying a file in a system with duplicate segment elimination using segmention-independent checksums |
WO2008048304A2 (fr) * | 2005-12-01 | 2008-04-24 | Firestar Software, Inc. | Système et procédé permettant d'échanger des informations entre des applications d'échange |
US7546321B2 (en) * | 2005-12-19 | 2009-06-09 | Yahoo! Inc. | System and method for recovery from failure of a storage server in a distributed column chunk data store |
US7472242B1 (en) * | 2006-02-14 | 2008-12-30 | Network Appliance, Inc. | Eliminating duplicate blocks during backup writes |
JP5309015B2 (ja) * | 2006-04-07 | 2013-10-09 | データ ストレージ グループ | データ圧縮技術およびデータ格納技術 |
US7949824B2 (en) | 2006-04-11 | 2011-05-24 | Emc Corporation | Efficient data storage using two level delta resemblance |
US7562186B2 (en) * | 2006-04-11 | 2009-07-14 | Data Domain, Inc. | Efficient data storage using resemblance of data segments |
US7504969B2 (en) * | 2006-07-11 | 2009-03-17 | Data Domain, Inc. | Locality-based stream segmentation for data deduplication |
US7881544B2 (en) * | 2006-08-24 | 2011-02-01 | Dell Products L.P. | Methods and apparatus for reducing storage size |
US7970216B2 (en) | 2006-08-24 | 2011-06-28 | Dell Products L.P. | Methods and apparatus for reducing storage size |
US7974478B2 (en) | 2006-08-24 | 2011-07-05 | Dell Products L.P. | Methods and apparatus for reducing storage size |
US7961959B2 (en) | 2006-08-24 | 2011-06-14 | Dell Products L.P. | Methods and apparatus for reducing storage size |
US7936932B2 (en) | 2006-08-24 | 2011-05-03 | Dell Products L.P. | Methods and apparatus for reducing storage size |
KR100834574B1 (ko) * | 2006-09-29 | 2008-06-02 | 한국전자통신연구원 | 파일 저장 시스템 및 그 시스템에서의 파일 저장 및 검색방법 |
US7733910B2 (en) | 2006-12-29 | 2010-06-08 | Riverbed Technology, Inc. | Data segmentation using shift-varying predicate function fingerprinting |
US20080243769A1 (en) * | 2007-03-30 | 2008-10-02 | Symantec Corporation | System and method for exporting data directly from deduplication storage to non-deduplication storage |
US8166012B2 (en) | 2007-04-11 | 2012-04-24 | Emc Corporation | Cluster storage using subsegmenting |
US8768895B2 (en) * | 2007-04-11 | 2014-07-01 | Emc Corporation | Subsegmenting for efficient storage, resemblance determination, and transmission |
US9930099B2 (en) | 2007-05-08 | 2018-03-27 | Riverbed Technology, Inc. | Hybrid segment-oriented file server and WAN accelerator |
US8209506B2 (en) * | 2007-09-05 | 2012-06-26 | Emc Corporation | De-duplication in a virtualized storage environment |
US8880797B2 (en) | 2007-09-05 | 2014-11-04 | Emc Corporation | De-duplication in a virtualized server environment |
US8219534B2 (en) | 2008-02-27 | 2012-07-10 | Dell Products L.P. | Multiple file compaction for network attached storage |
US8224831B2 (en) | 2008-02-27 | 2012-07-17 | Dell Products L.P. | Virtualization of metadata for file optimization |
US8516002B2 (en) | 2008-03-21 | 2013-08-20 | Dell Products L.P. | Deflate file data optimization |
US7933939B2 (en) * | 2008-04-16 | 2011-04-26 | Quantum Corporation | Apparatus and method for partitioning data blocks |
US8001329B2 (en) * | 2008-05-19 | 2011-08-16 | International Business Machines Corporation | Speculative stream scanning |
US7864083B2 (en) | 2008-05-21 | 2011-01-04 | Ocarina Networks, Inc. | Efficient data compression and decompression of numeric sequences |
WO2010080591A2 (fr) | 2008-12-18 | 2010-07-15 | Sumooh Inc. | Procédés et dispositif de partitionnement de données sensible au contenu et de déduplication de données |
CN102934115B (zh) | 2010-03-12 | 2016-07-06 | 科派恩股份有限公司 | 管理数据的方法、客户端设备和系统 |
WO2011116087A2 (fr) | 2010-03-16 | 2011-09-22 | Copiun, Inc. | Déduplication de données distribuée et hautement évolutive |
-
2009
- 2009-12-18 WO PCT/US2009/068687 patent/WO2010080591A2/fr active Application Filing
- 2009-12-18 JP JP2011542475A patent/JP5468620B2/ja active Active
- 2009-12-18 US US12/642,033 patent/US8589455B2/en active Active
- 2009-12-18 AU AU2009335697A patent/AU2009335697A1/en not_active Abandoned
- 2009-12-18 EP EP09837974.6A patent/EP2361417B1/fr active Active
- 2009-12-18 US US12/642,023 patent/US7925683B2/en active Active
- 2009-12-18 CN CN200980155547.6A patent/CN102301377B/zh active Active
- 2009-12-18 CA CA2747661A patent/CA2747661A1/fr not_active Abandoned
-
2012
- 2012-06-20 HK HK12106045.8A patent/HK1165573A1/xx unknown
Also Published As
Publication number | Publication date |
---|---|
CN102301377A (zh) | 2011-12-28 |
JP2012513069A (ja) | 2012-06-07 |
EP2361417A2 (fr) | 2011-08-31 |
US7925683B2 (en) | 2011-04-12 |
WO2010080591A3 (fr) | 2010-09-30 |
EP2361417B1 (fr) | 2022-02-16 |
AU2009335697A1 (en) | 2011-08-04 |
US8589455B2 (en) | 2013-11-19 |
US20100161608A1 (en) | 2010-06-24 |
EP2361417A4 (fr) | 2016-09-07 |
WO2010080591A2 (fr) | 2010-07-15 |
US20100161685A1 (en) | 2010-06-24 |
AU2009335697A2 (en) | 2011-09-01 |
JP5468620B2 (ja) | 2014-04-09 |
CA2747661A1 (fr) | 2010-07-15 |
CN102301377B (zh) | 2015-07-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
HK1165573A1 (en) | Methods and apparatus for content-aware data partitioning and data de- duplication | |
EP2414951A4 (fr) | Système et procédé de déduplication de données | |
GB2461803B (en) | Data access control method and data access control apparatus | |
GB2467622B (en) | Data storage apparatus | |
HK1159815A1 (en) | Method and apparatus for data categorizing | |
HK1132815A1 (en) | Methods and apparatus for improving data warehouse performance | |
EP2194463A4 (fr) | Procédé de classification de données et dispositif de classification de données | |
GB201100287D0 (en) | Methods and apparatus for processing road data | |
EP2622788A4 (fr) | Procédé et appareil permettant de commander un dispositif et support lisible par ordinateur contenant ce procédé | |
EP2545452A4 (fr) | Appareil et procédés de stockage de données | |
EP2232406A4 (fr) | Procédé et appareil pour analyser des données tridimensionnelles | |
GB2460773B (en) | Methods and apparatus for characterizing media | |
EP2235620A4 (fr) | Systeme et procede de creation de metadonnees | |
PL2403147T3 (pl) | Urządzenie i sposób przetwarzania danych | |
EP2506522A4 (fr) | Procédé et dispositif pour pousser les données | |
GB2460771B (en) | Methods and apparatus for performing moving checkshots | |
EP2108130A4 (fr) | Appareil et procédé de compression de données sismiques | |
GB0802184D0 (en) | Computer apparatus | |
EP2616948A4 (fr) | Procédé et appareil pour gérer des données | |
EP2618268A4 (fr) | Procédé et dispositif de conservation en mémoire de données | |
EP2260399A4 (fr) | Procédé et appareil pour entrer/émettre des données à l'aide d'une technique de virtualisation | |
EP2577667A4 (fr) | Appareil destiné au transfert d'informations synchrone avec la source et procédés associés | |
HK1127423A1 (en) | Method and apparatus for unshelling file | |
EP2362954A4 (fr) | Procédé et dispositif pour l'accès à des données et leur stockage | |
GB0814468D0 (en) | Methdo of and apparatus for analysing data files |