HK1165573A1 - Methods and apparatus for content-aware data partitioning and data de- duplication - Google Patents

Methods and apparatus for content-aware data partitioning and data de- duplication

Info

Publication number
HK1165573A1
HK1165573A1 HK12106045.8A HK12106045A HK1165573A1 HK 1165573 A1 HK1165573 A1 HK 1165573A1 HK 12106045 A HK12106045 A HK 12106045A HK 1165573 A1 HK1165573 A1 HK 1165573A1
Authority
HK
Hong Kong
Prior art keywords
data
duplication
methods
content
aware
Prior art date
Application number
HK12106045.8A
Other languages
English (en)
Chinese (zh)
Inventor
.加因
.喬德裡
Original Assignee
科普恩股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 科普恩股份有限公司 filed Critical 科普恩股份有限公司
Publication of HK1165573A1 publication Critical patent/HK1165573A1/xx

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0638Organizing or formatting or addressing of data
    • G06F3/064Management of blocks
    • G06F3/0641De-duplication techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/174Redundancy elimination performed by the file system
    • G06F16/1748De-duplication implemented within the file system, e.g. based on file segments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0608Saving storage space on storage systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/067Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS]
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/964Database arrangement
    • Y10S707/966Distributed
    • Y10S707/967Peer-to-peer
    • Y10S707/968Partitioning
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/964Database arrangement
    • Y10S707/966Distributed
    • Y10S707/967Peer-to-peer
    • Y10S707/968Partitioning
    • Y10S707/969Horizontal partitioning
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/964Database arrangement
    • Y10S707/966Distributed
    • Y10S707/967Peer-to-peer
    • Y10S707/968Partitioning
    • Y10S707/97Vertical partitioning
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/964Database arrangement
    • Y10S707/966Distributed
    • Y10S707/971Federated
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/964Database arrangement
    • Y10S707/966Distributed
    • Y10S707/971Federated
    • Y10S707/972Partitioning
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/964Database arrangement
    • Y10S707/966Distributed
    • Y10S707/971Federated
    • Y10S707/972Partitioning
    • Y10S707/973Horizontal partitioning
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/964Database arrangement
    • Y10S707/966Distributed
    • Y10S707/971Federated
    • Y10S707/972Partitioning
    • Y10S707/974Vertical partitioning

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)
  • Storage Device Security (AREA)
HK12106045.8A 2008-12-18 2012-06-20 Methods and apparatus for content-aware data partitioning and data de- duplication HK1165573A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US13882708P 2008-12-18 2008-12-18
PCT/US2009/068687 WO2010080591A2 (fr) 2008-12-18 2009-12-18 Procédés et dispositif de partitionnement de données sensible au contenu et de déduplication de données

Publications (1)

Publication Number Publication Date
HK1165573A1 true HK1165573A1 (en) 2012-10-05

Family

ID=42267562

Family Applications (1)

Application Number Title Priority Date Filing Date
HK12106045.8A HK1165573A1 (en) 2008-12-18 2012-06-20 Methods and apparatus for content-aware data partitioning and data de- duplication

Country Status (8)

Country Link
US (2) US8589455B2 (fr)
EP (1) EP2361417B1 (fr)
JP (1) JP5468620B2 (fr)
CN (1) CN102301377B (fr)
AU (1) AU2009335697A1 (fr)
CA (1) CA2747661A1 (fr)
HK (1) HK1165573A1 (fr)
WO (1) WO2010080591A2 (fr)

Families Citing this family (66)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004070568A2 (fr) 2003-01-31 2004-08-19 Viair, Inc. Recuperation asynchrone en temps reel de donnees
US8140637B2 (en) 2007-10-25 2012-03-20 Hewlett-Packard Development Company, L.P. Communicating chunks between devices
DE112007003693B4 (de) 2007-10-25 2014-05-15 Hewlett-Packard Development Co., L.P. Datenverarbeitungsvorrichtung und Verfahren zur Datenverarbeitung
US9372941B2 (en) 2007-10-25 2016-06-21 Hewlett Packard Enterprise Development Lp Data processing apparatus and method of processing data
US8782368B2 (en) * 2007-10-25 2014-07-15 Hewlett-Packard Development Company, L.P. Storing chunks in containers
DE112008003826B4 (de) 2008-04-25 2015-08-20 Hewlett-Packard Development Company, L.P. Datenverarbeitungsvorrichtung und Verfahren zur Datenverarbeitung
WO2010080591A2 (fr) * 2008-12-18 2010-07-15 Sumooh Inc. Procédés et dispositif de partitionnement de données sensible au contenu et de déduplication de données
GB2472072B (en) * 2009-07-24 2013-10-16 Hewlett Packard Development Co Deduplication of encoded data
WO2011013125A1 (fr) 2009-07-27 2011-02-03 Storwize Ltd. Procede et systeme de transformation d'objets de donnees logiques a des fins de stockage
WO2011033582A1 (fr) * 2009-09-18 2011-03-24 Hitachi, Ltd. Système de stockage pour élimination de données dupliquées
US8510275B2 (en) 2009-09-21 2013-08-13 Dell Products L.P. File aware block level deduplication
CN102934115B (zh) * 2010-03-12 2016-07-06 科派恩股份有限公司 管理数据的方法、客户端设备和系统
WO2011116087A2 (fr) 2010-03-16 2011-09-22 Copiun, Inc. Déduplication de données distribuée et hautement évolutive
US20120036366A1 (en) * 2010-08-09 2012-02-09 Microsoft Corporation Secure and verifiable data handling
CN103229161B (zh) 2010-08-24 2016-01-20 科派恩股份有限公司 连续接入网关和去重数据缓存服务器
US20130179413A1 (en) * 2010-09-21 2013-07-11 Georgia Tech Research Corporation Compressed Distributed Storage Systems And Methods For Providing Same
WO2012056491A1 (fr) 2010-10-26 2012-05-03 Hitachi, Ltd. Appareil de stockage et procédé de contrôle des données
US8862876B2 (en) 2010-11-09 2014-10-14 International Business Machines Corporation Method and system for deleting data
US8438139B2 (en) 2010-12-01 2013-05-07 International Business Machines Corporation Dynamic rewrite of files within deduplication system
US8380681B2 (en) * 2010-12-16 2013-02-19 Microsoft Corporation Extensible pipeline for data deduplication
US9280550B1 (en) * 2010-12-31 2016-03-08 Emc Corporation Efficient storage tiering
US8886901B1 (en) 2010-12-31 2014-11-11 Emc Corporation Policy based storage tiering
DE102011011283A1 (de) * 2011-02-15 2012-08-16 Christmann Informationstechnik + Medien Gmbh & Co. Kg Verfahren zur Deduplizierung von auf einem Speichermedium gespeicherten Daten und Dateiserver dafür
JP5660617B2 (ja) * 2011-03-29 2015-01-28 日本電気株式会社 ストレージ装置
US8904128B2 (en) 2011-06-08 2014-12-02 Hewlett-Packard Development Company, L.P. Processing a request to restore deduplicated data
US9069477B1 (en) * 2011-06-16 2015-06-30 Amazon Technologies, Inc. Reuse of dynamically allocated memory
US8918375B2 (en) 2011-08-31 2014-12-23 Microsoft Corporation Content aware chunking for achieving an improved chunk size distribution
US8990171B2 (en) 2011-09-01 2015-03-24 Microsoft Corporation Optimization of a partially deduplicated file
US8700634B2 (en) 2011-12-29 2014-04-15 Druva Inc. Efficient deduplicated data storage with tiered indexing
US8996467B2 (en) 2011-12-29 2015-03-31 Druva Inc. Distributed scalable deduplicated data backup system
US9990347B2 (en) 2012-01-23 2018-06-05 Microsoft Technology Licensing, Llc Borderless table detection engine
CN104067293B (zh) 2012-01-23 2017-07-25 微软技术许可有限责任公司 矢量图分类引擎
EP2807602A1 (fr) * 2012-01-23 2014-12-03 Microsoft Corporation Moteur de reconnaissance de motifs
US9128616B2 (en) * 2012-04-13 2015-09-08 Hitachi, Ltd. Storage device to backup content based on a deduplication system
US10135462B1 (en) 2012-06-13 2018-11-20 EMC IP Holding Company LLC Deduplication using sub-chunk fingerprints
US8918390B1 (en) 2012-06-13 2014-12-23 Emc Corporation Preferential selection of candidates for delta compression
US9026740B1 (en) 2012-06-13 2015-05-05 Emc Corporation Prefetch data needed in the near future for delta compression
US9141301B1 (en) 2012-06-13 2015-09-22 Emc Corporation Method for cleaning a delta storage system
US8972672B1 (en) 2012-06-13 2015-03-03 Emc Corporation Method for cleaning a delta storage system
US8712978B1 (en) 2012-06-13 2014-04-29 Emc Corporation Preferential selection of candidates for delta compression
US9400610B1 (en) 2012-06-13 2016-07-26 Emc Corporation Method for cleaning a delta storage system
US9116902B1 (en) 2012-06-13 2015-08-25 Emc Corporation Preferential selection of candidates for delta compression
US9262429B2 (en) 2012-08-13 2016-02-16 Microsoft Technology Licensing, Llc De-duplicating attachments on message delivery and automated repair of attachments
CN102880671A (zh) * 2012-09-07 2013-01-16 浪潮电子信息产业股份有限公司 一种面向分布式文件系统的主动重复数据删除方法
US9626373B2 (en) * 2012-10-01 2017-04-18 Western Digital Technologies, Inc. Optimizing data block size for deduplication
US9953008B2 (en) 2013-01-18 2018-04-24 Microsoft Technology Licensing, Llc Grouping fixed format document elements to preserve graphical data semantics after reflow by manipulating a bounding box vertically and horizontally
GB2513341A (en) 2013-04-23 2014-10-29 Ibm Method and system for data de-duplication
CN104123309B (zh) 2013-04-28 2017-08-25 国际商业机器公司 用于数据管理的方法和系统
CN105637493A (zh) * 2013-07-29 2016-06-01 慧与发展有限责任合伙企业 频繁使用的去重复对象的完整性
BR112015023973B1 (pt) 2013-08-19 2021-12-14 Huawei Technologies Co., Ltd Método e aparelho de processamento de objeto de dados
US10545918B2 (en) 2013-11-22 2020-01-28 Orbis Technologies, Inc. Systems and computer implemented methods for semantic data compression
WO2015183302A1 (fr) * 2014-05-30 2015-12-03 Hitachi, Ltd. Procédé et appareil de système de stockage avec déduplication de données
US10120875B1 (en) * 2014-12-02 2018-11-06 EMC IP Holding Company LLC Method and system for detecting boundaries of data blocks for deduplication
US10177907B2 (en) * 2015-07-20 2019-01-08 Sony Corporation Distributed object routing
CN105117235A (zh) * 2015-09-18 2015-12-02 四川效率源信息安全技术股份有限公司 一种重组Office文件的方法
CN105306570B (zh) * 2015-10-27 2018-07-20 创新科软件技术(深圳)有限公司 一种集群数据的存储方法
US10078451B1 (en) 2016-01-22 2018-09-18 Red Hat, Inc. Deduplicating data based on boundary identification
US9575681B1 (en) 2016-04-29 2017-02-21 International Business Machines Corporation Data deduplication with reduced hash computations
US10209892B2 (en) 2016-11-28 2019-02-19 Hewlett Packard Enterprise Development Lp Storage of format-aware filter format tracking states
US10528342B2 (en) * 2017-10-16 2020-01-07 Western Digital Technologies, Inc. Function tracking for source code files
JP6930506B2 (ja) * 2018-08-08 2021-09-01 株式会社Jvcケンウッド データ記録送信装置、データ記録送信方法、及びデータ記録送信プログラム
US10922281B2 (en) 2018-10-25 2021-02-16 EMC IP Holding Company LLC Application aware deduplication
JP7295422B2 (ja) * 2019-09-10 2023-06-21 富士通株式会社 情報処理装置および情報処理プログラム
CN111222314B (zh) * 2020-01-03 2021-12-21 北大方正集团有限公司 版式文档的比对方法、装置、设备及存储介质
US11797220B2 (en) 2021-08-20 2023-10-24 Cohesity, Inc. Reducing memory usage in storing metadata
US11947497B2 (en) 2021-08-24 2024-04-02 Cohesity, Inc. Partial in-line deduplication and partial post-processing deduplication of data chunks

Family Cites Families (74)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2166420C (fr) * 1993-07-01 2006-03-28 James R. Woodhill Dispositif et methode de gestion de memoires reparties dans les systemes informatiques en reseau
WO1996025801A1 (fr) * 1995-02-17 1996-08-22 Trustus Pty. Ltd. Procede de decoupage d'un bloc de donnees en sous-blocs et de stockage et de communication de tels sous-blocs
US5590810A (en) * 1995-10-19 1997-01-07 Wehbi; Ali D. Order of participation control device
US6278992B1 (en) * 1997-03-19 2001-08-21 John Andrew Curtis Search engine using indexing method for storing and retrieving data
US5873104A (en) * 1997-06-26 1999-02-16 Sun Microsystems, Inc. Bounded-pause time garbage collection system and method including write barrier associated with source and target instances of a partially relocated object
US6065046A (en) * 1997-07-29 2000-05-16 Catharon Productions, Inc. Computerized system and associated method of optimally controlled storage and transfer of computer programs on a computer network
US6487556B1 (en) * 1998-12-18 2002-11-26 International Business Machines Corporation Method and system for providing an associative datastore within a data processing system
US6377953B1 (en) * 1998-12-30 2002-04-23 Oracle Corporation Database having an integrated transformation engine using pickling and unpickling of data
US6526493B1 (en) * 1999-03-30 2003-02-25 Adaptec, Inc. Method and apparatus for partitioning and formatting a storage media without rebooting by creating a logical device control block (DCB) on-the-fly
JP2000293413A (ja) * 1999-04-09 2000-10-20 Canon Inc データ保存装置および記憶媒体
JP2000315375A (ja) * 1999-04-28 2000-11-14 Matsushita Electric Ind Co Ltd ファイル入出力システムおよびプログラム記録媒体
US6959291B1 (en) * 1999-05-19 2005-10-25 International Business Machines Corporation Management of a concurrent use license in a logically-partitioned computer
US7590644B2 (en) * 1999-12-21 2009-09-15 International Business Machine Corporation Method and apparatus of streaming data transformation using code generator and translator
US6704730B2 (en) 2000-02-18 2004-03-09 Avamar Technologies, Inc. Hash file system and method for use in a commonality factoring system
US6662193B1 (en) * 2000-06-02 2003-12-09 Cg4 Solutions, Inc. Methods and systems for manipulating a database through portable data entry devices
US6810398B2 (en) * 2000-11-06 2004-10-26 Avamar Technologies, Inc. System and method for unorchestrated determination of data sequences using sticky byte factoring to determine breakpoints in digital sequences
JP2002319230A (ja) * 2001-01-25 2002-10-31 Sony Computer Entertainment Inc 記録媒体、情報処理装置、コンテンツ配信サーバ、方法、プログラム、その記録媒体
US6742081B2 (en) * 2001-04-30 2004-05-25 Sun Microsystems, Inc. Data storage array employing block checksums and dynamic striping
US6934835B2 (en) * 2002-01-09 2005-08-23 International Business Machines Corporation Building block removal from partitions
US7051180B2 (en) * 2002-01-09 2006-05-23 International Business Machines Corporation Masterless building block binding to partitions using identifiers and indicators
US6941436B2 (en) * 2002-05-09 2005-09-06 International Business Machines Corporation Method and apparatus for managing memory blocks in a logical partitioned data processing system
US6976146B1 (en) * 2002-05-21 2005-12-13 Network Appliance, Inc. System and method for emulating block appended checksums on storage devices by sector stealing
US6871200B2 (en) * 2002-07-11 2005-03-22 Forensic Eye Ltd. Registration and monitoring system
KR100462886B1 (ko) * 2002-10-15 2004-12-17 삼성전자주식회사 부하 분담 구조와 프라이머리/백업 구조가 혼합된 시스템
US8176186B2 (en) 2002-10-30 2012-05-08 Riverbed Technology, Inc. Transaction accelerator for client-server communications systems
US7120666B2 (en) 2002-10-30 2006-10-10 Riverbed Technology, Inc. Transaction accelerator for client-server communication systems
US6667700B1 (en) 2002-10-30 2003-12-23 Nbt Technology, Inc. Content-based segmentation scheme for data compression in storage and transmission including hierarchical segment representation
US7065619B1 (en) 2002-12-20 2006-06-20 Data Domain, Inc. Efficient data storage system
US6928526B1 (en) 2002-12-20 2005-08-09 Datadomain, Inc. Efficient data storage system
WO2004061590A2 (fr) 2003-01-02 2004-07-22 Cricket Technologies Llc Dispositif de filtrage d'archives electroniques et d'etablissement de profils, systeme, procede, et progiciel stocke par des moyens electroniques
KR20040091392A (ko) 2003-04-21 2004-10-28 주식회사 에트피아텍 웹을 이용한 원격 백업관리 시스템 및 그 시스템을 운용한백업관리 방법
US7143251B1 (en) * 2003-06-30 2006-11-28 Data Domain, Inc. Data storage using identifiers
CN1567303A (zh) * 2003-07-03 2005-01-19 富士通株式会社 结构文档信息块的自动分割方法和装置
US7219102B2 (en) * 2003-12-22 2007-05-15 International Business Machines Corporation Method, computer program product, and system converting relational data into hierarchical data structure based upon tagging trees
US7412444B2 (en) * 2004-02-11 2008-08-12 Idx Systems Corporation Efficient indexing of hierarchical relational database records
US20060047855A1 (en) * 2004-05-13 2006-03-02 Microsoft Corporation Efficient chunking algorithm
US7487138B2 (en) * 2004-08-25 2009-02-03 Symantec Operating Corporation System and method for chunk-based indexing of file system content
US20060069733A1 (en) * 2004-09-30 2006-03-30 Microsoft Corporation Detection and removal of information in files
TWI274297B (en) * 2004-11-19 2007-02-21 Aiptek Int Inc Method for deciding partition types of macro block
US20060212439A1 (en) * 2005-03-21 2006-09-21 Microsoft Corporation System and method of efficient data backup in a networking environment
JP4682284B2 (ja) * 2005-03-25 2011-05-11 成典 田中 文書差分検出装置
US8984636B2 (en) 2005-07-29 2015-03-17 Bit9, Inc. Content extractor and analysis system
US7447865B2 (en) * 2005-09-13 2008-11-04 Yahoo ! Inc. System and method for compression in a distributed column chunk data store
US8600948B2 (en) * 2005-09-15 2013-12-03 Emc Corporation Avoiding duplicative storage of managed content
US7624335B1 (en) * 2005-10-13 2009-11-24 Data Domain, Inc. Verifying a file in a system with duplicate segment elimination using segmention-independent checksums
WO2008048304A2 (fr) * 2005-12-01 2008-04-24 Firestar Software, Inc. Système et procédé permettant d'échanger des informations entre des applications d'échange
US7546321B2 (en) * 2005-12-19 2009-06-09 Yahoo! Inc. System and method for recovery from failure of a storage server in a distributed column chunk data store
US7472242B1 (en) * 2006-02-14 2008-12-30 Network Appliance, Inc. Eliminating duplicate blocks during backup writes
JP5309015B2 (ja) * 2006-04-07 2013-10-09 データ ストレージ グループ データ圧縮技術およびデータ格納技術
US7949824B2 (en) 2006-04-11 2011-05-24 Emc Corporation Efficient data storage using two level delta resemblance
US7562186B2 (en) * 2006-04-11 2009-07-14 Data Domain, Inc. Efficient data storage using resemblance of data segments
US7504969B2 (en) * 2006-07-11 2009-03-17 Data Domain, Inc. Locality-based stream segmentation for data deduplication
US7881544B2 (en) * 2006-08-24 2011-02-01 Dell Products L.P. Methods and apparatus for reducing storage size
US7970216B2 (en) 2006-08-24 2011-06-28 Dell Products L.P. Methods and apparatus for reducing storage size
US7974478B2 (en) 2006-08-24 2011-07-05 Dell Products L.P. Methods and apparatus for reducing storage size
US7961959B2 (en) 2006-08-24 2011-06-14 Dell Products L.P. Methods and apparatus for reducing storage size
US7936932B2 (en) 2006-08-24 2011-05-03 Dell Products L.P. Methods and apparatus for reducing storage size
KR100834574B1 (ko) * 2006-09-29 2008-06-02 한국전자통신연구원 파일 저장 시스템 및 그 시스템에서의 파일 저장 및 검색방법
US7733910B2 (en) 2006-12-29 2010-06-08 Riverbed Technology, Inc. Data segmentation using shift-varying predicate function fingerprinting
US20080243769A1 (en) * 2007-03-30 2008-10-02 Symantec Corporation System and method for exporting data directly from deduplication storage to non-deduplication storage
US8166012B2 (en) 2007-04-11 2012-04-24 Emc Corporation Cluster storage using subsegmenting
US8768895B2 (en) * 2007-04-11 2014-07-01 Emc Corporation Subsegmenting for efficient storage, resemblance determination, and transmission
US9930099B2 (en) 2007-05-08 2018-03-27 Riverbed Technology, Inc. Hybrid segment-oriented file server and WAN accelerator
US8209506B2 (en) * 2007-09-05 2012-06-26 Emc Corporation De-duplication in a virtualized storage environment
US8880797B2 (en) 2007-09-05 2014-11-04 Emc Corporation De-duplication in a virtualized server environment
US8219534B2 (en) 2008-02-27 2012-07-10 Dell Products L.P. Multiple file compaction for network attached storage
US8224831B2 (en) 2008-02-27 2012-07-17 Dell Products L.P. Virtualization of metadata for file optimization
US8516002B2 (en) 2008-03-21 2013-08-20 Dell Products L.P. Deflate file data optimization
US7933939B2 (en) * 2008-04-16 2011-04-26 Quantum Corporation Apparatus and method for partitioning data blocks
US8001329B2 (en) * 2008-05-19 2011-08-16 International Business Machines Corporation Speculative stream scanning
US7864083B2 (en) 2008-05-21 2011-01-04 Ocarina Networks, Inc. Efficient data compression and decompression of numeric sequences
WO2010080591A2 (fr) 2008-12-18 2010-07-15 Sumooh Inc. Procédés et dispositif de partitionnement de données sensible au contenu et de déduplication de données
CN102934115B (zh) 2010-03-12 2016-07-06 科派恩股份有限公司 管理数据的方法、客户端设备和系统
WO2011116087A2 (fr) 2010-03-16 2011-09-22 Copiun, Inc. Déduplication de données distribuée et hautement évolutive

Also Published As

Publication number Publication date
CN102301377A (zh) 2011-12-28
JP2012513069A (ja) 2012-06-07
EP2361417A2 (fr) 2011-08-31
US7925683B2 (en) 2011-04-12
WO2010080591A3 (fr) 2010-09-30
EP2361417B1 (fr) 2022-02-16
AU2009335697A1 (en) 2011-08-04
US8589455B2 (en) 2013-11-19
US20100161608A1 (en) 2010-06-24
EP2361417A4 (fr) 2016-09-07
WO2010080591A2 (fr) 2010-07-15
US20100161685A1 (en) 2010-06-24
AU2009335697A2 (en) 2011-09-01
JP5468620B2 (ja) 2014-04-09
CA2747661A1 (fr) 2010-07-15
CN102301377B (zh) 2015-07-08

Similar Documents

Publication Publication Date Title
HK1165573A1 (en) Methods and apparatus for content-aware data partitioning and data de- duplication
EP2414951A4 (fr) Système et procédé de déduplication de données
GB2461803B (en) Data access control method and data access control apparatus
GB2467622B (en) Data storage apparatus
HK1159815A1 (en) Method and apparatus for data categorizing
HK1132815A1 (en) Methods and apparatus for improving data warehouse performance
EP2194463A4 (fr) Procédé de classification de données et dispositif de classification de données
GB201100287D0 (en) Methods and apparatus for processing road data
EP2622788A4 (fr) Procédé et appareil permettant de commander un dispositif et support lisible par ordinateur contenant ce procédé
EP2545452A4 (fr) Appareil et procédés de stockage de données
EP2232406A4 (fr) Procédé et appareil pour analyser des données tridimensionnelles
GB2460773B (en) Methods and apparatus for characterizing media
EP2235620A4 (fr) Systeme et procede de creation de metadonnees
PL2403147T3 (pl) Urządzenie i sposób przetwarzania danych
EP2506522A4 (fr) Procédé et dispositif pour pousser les données
GB2460771B (en) Methods and apparatus for performing moving checkshots
EP2108130A4 (fr) Appareil et procédé de compression de données sismiques
GB0802184D0 (en) Computer apparatus
EP2616948A4 (fr) Procédé et appareil pour gérer des données
EP2618268A4 (fr) Procédé et dispositif de conservation en mémoire de données
EP2260399A4 (fr) Procédé et appareil pour entrer/émettre des données à l'aide d'une technique de virtualisation
EP2577667A4 (fr) Appareil destiné au transfert d'informations synchrone avec la source et procédés associés
HK1127423A1 (en) Method and apparatus for unshelling file
EP2362954A4 (fr) Procédé et dispositif pour l'accès à des données et leur stockage
GB0814468D0 (en) Methdo of and apparatus for analysing data files